Install Git
$ git clone https://github.com/apinzonf/HtmlContentExtractor.git
Install JDK 1.8 and Maven 3.5
$ cd HtmlContentExtractor
$ mvn compile
$ mvn package
Install Java 8
$ java -jar ./target/html-content-extractor-0.0.1-SNAPSHOT.jar -input ./src/main/resources/input.txt -output ./outputResults
-input a location of a file that has a list of URLs
-output a location of a folder to write the results