File | Date | Author | Commit |
---|---|---|---|
src | 2015-04-05 |
![]() |
[2cb69c] Switch language detection to oss-utils |
.gitignore | 2014-11-02 |
![]() |
[6bcd8d] Initial skeleton. |
CHANGES.txt | 2014-11-23 |
![]() |
[343a68] Implements #34 |
LICENSE.txt | 2014-11-23 |
![]() |
[343a68] Implements #34 |
NOTICE.txt | 2014-12-26 |
![]() |
[df39fa] Implements #37, #36, #34 |
README.md | 2015-03-29 |
![]() |
[acc80d] Integration with oss-cluster |
pom.xml | 2015-04-05 |
![]() |
[2cb69c] Switch language detection to oss-utils |
An open source RESTFul Web Service for text extraction and analysis.
oss-extractor supports various binary formats.
Check that you have installed a JAVA Runtime Environment 7 or newer
The binary archives are available at SourceForge
To follow this quickstart please download oss-extractor-1.1-exec.jar
The compilation and packaging requires Maven 3.0 or newer
Clone the source code:
git clone https://github.com/opensearchserver/oss-extractor.git
Compile and package (the binary will located in the target directory):
mvn clean package
java -jar target/oss-extractor-xxx-exec.jar
curl -XGET http://localhost:9091
curl -XGET http://localhost:9091/pdfbox
By uploading a document:
curl -XPUT --data-binary @tutorial.pdf http://localhost:9091/pdfbox
If the file is already available in the server, the follow API can be used:
curl -XGET http://localhost:9091/pdfbox?path=/home/user/myfile.pdf
Issues and milestones are tracked on GitHub:
Copyright 2014-2015 OpenSearchServer Inc.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.