Key files:

Scraping_Torah.pynb NLP.ipynb visualization.py

These files are intended to be run in order.

Scraping_Torah.ipynb

When this notebook is run, it retrieves verses from the Jewish Virtual Library, and packages them into Torah_Verses, Torah_Chapters, and Chapter_Indices. Additionally, it generates a labeling scheme based on https://en.wikipedia.org/wiki/Composition_of_the_Torah, which is stored in Verse_Labels.csv.

NLP.ipynb

When this notebook is run, it uses Torah_Verses.csv and Verse_Labels.csv to produce a trained vectorizer, topic modeler, and classification algorithm, which are stored in model.p.

visualization.py

When this is run using

streamlit run visualization.py

it runs a streamlit application that allows users to enter verses and determine what

Related Files:

Torah_Chapters.csv: Contains all Torah verses, grouped by chapter. Generated by Scraping_Torah.ipynb

Chapter_Indices.csv: For each chapter, labels it with its chapter number and the book it is from. Intended for visualization purposes. Generated by Scraping_Torah.ipynb

Torah_Verses.csv: Contains all Torah verses, as individual rows. Generated by Scraping_Torah.ipynb

Verse_Labels.csv: Simple array, containing either a 'p', 'y', or 'y', corresponding to the source for the appropriate verse. Generated by Scraping_Torah.ipynb

model.p: A pickled tuple containing a vectorizer, topic modeler, and classification algorithm. All have been trained appropriately on the torah verses. Generated by NLP.ipynb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Key files:

Scraping_Torah.ipynb

NLP.ipynb

visualization.py

Related Files:

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Chapter Indices.csv		Chapter Indices.csv
NLP.ipynb		NLP.ipynb
Project 4 Presentation.pdf		Project 4 Presentation.pdf
README.md		README.md
Scraping_Torah.ipynb		Scraping_Torah.ipynb
Torah_Chapters.csv		Torah_Chapters.csv
Torah_Verses.csv		Torah_Verses.csv
Verse_Labels.csv		Verse_Labels.csv
model.p		model.p
visualization.py		visualization.py

computerGeologist/Project_4

Folders and files

Latest commit

History

Repository files navigation

Key files:

Scraping_Torah.ipynb

NLP.ipynb

visualization.py

Related Files:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages