We have several introductory tutorials to help get you started with using Fonduer.
In this tutorial, we build a Fonduer application to tackle the task of extracting maximum storage temperatures for specific transistor part numbers from their datasheets (PDF format).
In this tutorial, we use Fonduer to extract the place of of birth for American presidents from their wikipedia html pages (HTML format).
In this advanced tutorial, we build a Fonduer application
to tackle the task of extracting images for specific transistors from their
datasheets. This demonstrates how Fonduer can be used for image data in
additional to text.
docker-compose upWe use a few applications that you'll need to install and be sure are on your PATH.
For OS X using homebrew:
brew install poppler
brew install postgresql
brew install libpng freetype pkg-configOn Debian-based distros:
sudo apt install libxml2-dev libxslt-dev python3-dev libpq-dev
sudo apt build-dep python-matplotlib
sudo apt install poppler-utils
sudo apt install postgresqlFor the Python dependencies, we recommend using a virtualenv. Once you have cloned the repository, change directories to the root of the repository and run
virtualenv -p python3 .venvOnce the virtual environment is created, activate it by running
source .venv/bin/activateAny Python libraries installed will now be contained within this virtual
environment. To deactivate the environment, simply run deactivate.
Then, install Fonduer and any other python dependencies by running:
pip install -r requirements.txtAfter installing all the requirements, just run:
jupyter notebook
