A curated list of resources dedicated to Natural Language Processing
Maintainers - Keon Kim
- Stanford's Coursera Course on NLP from basics
- Intro to Natural Language Processing on Coursera by U of Michigan
- Intro to Artificial Intelligence course on Udacity which also covers NLP
- Natural Language Processing - course on Coursera that was only done in 2013 but the videos are still up. Also Mike Collins is a great professor and his notes and lectures are very good.
- Statistical Machine Translation - a Machine Translation course with great assignments and slides.
- Natural Language Processing SFU - course by Prof Anoop Sarkar on Natural Language Processing. Good notes and some good lectures on youtube about HMM.
- Pre-trained word embeddings for WSJ corpus by Koc AI-Lab
- Word2vec by Mikolov
- HLBL language model by Turian
- Real-valued vector "embeddings" by Dhillon
- Improving Word Representations Via Global Context And Multiple Word Prototypes by Huang
- Dependency based word embeddings
- Global Vectors for Word Representations
- Node.js - Node.js Libaries for NLP
- Natural - general natural language facilities for node
- Python - Python NLP Libraries
- C++ - C++ Libraries
- Java - Java NLP Libraries
- word2vec - on creating vectors to represent language, useful for RNN inputs
- sense2vec - on word sense disambiguation
- Infinite Dimensional Word Embeddings - new
- Skip Thought Vectors - word representation method
- Adaptive skip-gram - similar approach, with adaptive properties
- Neural autocoder for paragraphs and documents - LTSM representation
- LTSM over tree structures
- Sequence to Sequence Learning - word vectors for machine translation
- Teaching Machines to Read and Comprehend - DeepMind paper
- Efficient Estimation of Word Representations in Vector Space
- Improving distributional similarity with lessons learned from word embeddings
- Low-Dimensional Embeddings of Logic
- Tutorial on Markov Logic Networks (based on this paper)
- Markov Logic Networks for Natural Language Question Answering
- Distant Supervision for Cancer Pathway Extraction From Text
- Privee: An Architecture for Automatically Analyzing Web Privacy Policies
- A Neural Probabilistic Language Model
- Template-Based Information Extraction without the Templates
- Retrofitting word vectors to semantic lexicons
- Unsupervised Learning of the Morphology of a Natural Language
- Natural Language Processing (Almost) from Scratch
- Computational Grounded Cognition: a new alliance between grounded cognition and computational modelling
- Learning the Structure of Biomedical Relation Extractions
- Relation extraction with matrix factorization and universal schemas
- Word2Vec
- Relation Extraction with Matrix Factorization and Universal Schemas
- Towards a Formal Distributional Semantics: Simulating Logical Calculi with Tensors
- Presentation slides for MLN tutorial
- Presentation slides for QA applications of MLNs
- Presentation slides
- Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations
- Blog Post on Deep Learning, NLP, and Representations
- Blog Post on NLP Tutorial
- TreeTagger - POSTagger
- Stanford - POSTagger
- Freeling
- ixa-pipe-pos
- Ruby Snowball Implementation
- Spaguetti POSTagger(Based on NLTK + CESS corpus
- OpenNLP - Person/Place/Organization models
- DBPedia Spotlight
- CitiusTagger - Spanish NER and POSTagger
- Word2Vec vectors for Wikipedia Spanish Articles
- DBpedia Spanish Entities Titles
- DBpedia Spanish Abstracts
- Conshuga - Galician Verb conjugator
part of the lists are from ai-reading-list and nlp-reading-group