sentiment-analysis

Simple Naive Bayes classifier that uses n-gram models to try and predict whether a given sentence is of positive or negative sentiment.

Installation

Since some of the corpora are rather large, Git LFS must be installed before cloning this repository.

Dependencies

Written in Python version 3.6.7, dependencies are listed in 'requirements.txt'. Using virtualenv the dependencies can be installed as follows:

user@user:~$ virtualenv env

The environment can be activated using:

user@user:~$ source env/bin/activate

user@user:~$ pip install -r requirements.txt

The environment can be deactivated using:

user@user:~$ deactivate

Directory descriptions

src/corpora/processed/

Contains different corpora of which the format has been standardized as follows:

Two .csv files for training data: one for positive and one for negative examples
One .csv file for development data, used to evaluate the classifier
Optional if the corpus is large enough: one .csv file for testing data

src/corpora/raw/

Contains the raw corpus data

src/models/

Contains n-gram models that were created with specific settings and saved, used to save time when classifying by loading instead of recreating certain models

src/resources/

Contains the class files that are used in the scripts

src/results/

Contains preliminary results for each different training corpus

Basic command line functionality

To classify a given sentence, the following bash command can be used:

user@user:~$ python classify_sentence.py "..."

Where ... denotes the sentence to be classified

Basic classifier instantiation and sentence classification in Python

To classify a sentence in a Python script:

import resources.LanguageModel as ngram
import resources.NaiveBayesClassifier as NBclassifier

# Modify the model_file argument to select another model from models/
LM_pos = ngram.LanguageModel(model_file='models/positive_n2_stemmed_rottentomatoes.p')
LM_neg = ngram.LanguageModel(model_file='models/negative_n2_stemmed_rottentomatoes.p')

# Construct classifier from the two models
classifier = NBclassifier.NaiveBayesClassifier(('positive', LM_pos), ('negative', LM_neg))

sentence = '...'
classifier.classify(sentence)

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
results		results
sentiment_analysis		sentiment_analysis
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

sentiment-analysis

Installation

Dependencies

Directory descriptions

src/corpora/processed/

src/corpora/raw/

src/models/

src/resources/

src/results/

Basic command line functionality

Basic classifier instantiation and sentence classification in Python

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

stevenbal/sentiment_analysis

Folders and files

Latest commit

History

Repository files navigation

sentiment-analysis

Installation

Dependencies

Directory descriptions

src/corpora/processed/

src/corpora/raw/

src/models/

src/resources/

src/results/

Basic command line functionality

Basic classifier instantiation and sentence classification in Python

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages