A collection of example code for performing entity extraction (i.e., named entity recognition) in a variety of human languages using a variety of Java-based open source software libraries.
This project will eventually contain entity extraction examples for several different NER tools. Each example or tool will have some particular instructions for getting it up & running. Please follow these instructions carefully to ensure you are able to run this code successfully.
Additionally, the pom.xml
file contains Maven dependencies for each individual NER tool. When using them in your own projects, you will obviously only need to include the dependencies for the tools you're actually using, not all of them.
Since the creators of GATE don't distribute their plugins bundled with the JARs published on Maven Central, you'll need to download them separately before running the example code.
-
Download either the
BIN
,SRC
, orALL
package... or just download this package directly: https://sourceforge.net/projects/gate/files/gate/8.0/gate-8.0-build4825-SRC.zip/download -
Unzip the package, and copy the
plugins
directory it contains intogate-demo/src/main/resources
AnnieExtractor.java
demonstrates how to use the default English entity extractor for GATE (called ANNIE), and ArabicExtractor.java
provides another example using the Arabic version.
Note: Since certain third-party libraries used in this project are licensed via the GPL, this project as a whole must be released under the GPL as well. However, individual files will be marked with more permissive open source licenses whenever possible. For example, the GATE examples are released under the LGPL.
Entity-Extraction-Demos
Copyright (C) 2015 Charlie Greenbacker
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.