- CurEx – A System for Extracting, Curating, and Exploring Domain-Specific Knowledge Graphs from Text [CurEx, CIKM 2018]
- Mining Structures of Factual Knowledge from Text: An Effort-Light Approach [PhD Thesis of Prof Xiang Ren]
- DeepDive: Incremental Knowledge Base Construction Using DeepDive [VLDB 2015] {Paper} {Slides}{Project link}
- Distant Supervision; Build a fator graph containing both relation phrases and features.
- Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion [Knowledge Vault, KDD 2014]
- ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering [ClusType, KDD2015] (Also Relation Clustering) croase-grained entity typing
- Label Noise Reduction in Entity Typingby Heterogeneous Partial-Label Embedding [PLE, KDD 2016] fine-grained entity typing
- AFET: Automatic Fine-Grained Entity Typing byHierarchical Partial-Label Embedding [AFET, EMNLP 2016] fine-grained entity typing
- A survey of named entity recoginition and classification {Paper}
- No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities [EMNLP-CoNLL 2012]{Paper}
Tutorial:
- A SURVEY ON RELATION EXTRACTION (CMU) {Slides}
- Relation Extraction: CSE 517: Natural Language Processing {Slides}
- Relation Extraction II: CSE 517: Natural Language Processing {Slides}
Papers:
- CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases [CoType, WWW2017] https://blog.csdn.net/hqc888688/article/details/73559365
- [g]Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations {Code}{Slides}
- Recently, researchers have developed multi- instance learning algorithms to combat the noisy training data that can come from heuristic labeling, but their models assume relations are disjoint . for example they cannot extract the pair Founded(Jobs, Apple) and CEO-of(Jobs, Apple). This paper presents a novel approach for multi-instance learning with overlapping relations that combines a sentence-level extraction model with a simple, corpus-level component for aggregating the individual facts.
- [g]Modeling missing data in distant supervision for information extraction [ACL2013] missing data problem(?)
- Neural Relation Extraction with Selective Attention over Instances [ACL 2016] {Paper}{Code}{Blog}
- Fix the problem of distant supervised relation extraction
- Employs CNN to embed the semantics of sentences, then builds sentence-level attention over multi- ple instances, which is expected to dynamically reduce the weights of those noisy instances (major contribution). Notes in group meeting.
- Multi-instance Multi-label Learning for Relation Extraction [EMMLP-CoNLL 2012]{Paper}
Notes:
- Candiate Entity Ranking: https://www.jianshu.com/p/90e2c7a5c9f5
- http://nlpprogress.com/english/entity_linking.html
Papers:
- Entity Linking wuth a Knowledge Base: Issues, Techniques, and Solutions (Survey)
- Robust Disambihguation of Named Entities in Text [EMNLP 2011]
- Liege: Link Entities in Web Lists with Knowledge Base [KDD 2012]
- Entity Lnking for Tweets [ACL 2013]
- Collective Annotation of Wikipedia Entities in Webb Text [KDD 2009]
- Local and Global Algorithms for Disambiguation to Wikipedia [ACL 2011]
Sides, Tutorials and Surveys
- Brief Introduction and Review of Open Information Extraction System {Slides}
- A Survey on Open Information Extraction {Paper}
- Open Information Extraction on Scientific Text: An Evaluation {Paper}
- https://github.com/gkiril/oie-resources
OpenIE Tools
- Open Information Extraction from the Web [TextRunner, IJCAI 2007]
- Incoherent Extractions
- Uninformative Extractions
- MinIE: Minimizing Facts in Open Information Extraction [MinIE, EMNLP 2017] Code (java)
- Represent information about polarity, modality, attribution and quantities with semantic annotations (instead of actual extraction)
- idetify and remove parts that are considered over specific
- Facts that Matter [SALIE, EMNLP 2018] {Code}
- Extract salient facts, which fulfill two requirements: (1) relevance and (2) diversity
- Use syntactic constraints to specify relation phrases (3 simple patterns). Find longest phrase matching one of the syntactic constraints.
- Find nearest noun-phrases to the left and right of relation phrase. - Not a relative pronoun or WHO-adverb or an existential there.
- To avoid "overspecified" relation phrases, a relation phrase must have many distinct args in a large corpus
- ClausIE: Clause-Based Open Information Extraction [ClausIE, WWW 2013] {Paper}{Code (Python)}{Code (Java)}
- Map the dependency relations of an input senetnce to clause constituents.
- A set of coherent clauses presenting a simple linguistic structure is derived from the input
OpenIE Triple Clustering
- Query-Driven On-The-Fly Knowledge Base Construction [QKBfly, VLDB2017] relation
- CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information [CESI, WWW2018] Code triple
- Canonicalizing Open Knowledge Bases [CIKM 2014] triple
- Towards Practical Open Knowledge Base Canonicalization [FAC, CIKM 2018] triple
- Identifying Relations for Open Information Extraction [ReVerb, EMNLP 2011] {Paper}{Code}{Homepage} relation
- Mophological Normalization
- Open Information Extraction to KBP Relations in 3 Hours [TAC. 2013] {Paper}
- Main idea: relation phrases mapping to KB otology
- Manually define a set of rules for each relation, to conduct the mapping
- The motivation and error analysis are well written.
- ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering [ClusType, KDD2015]
- Relation Clustering: Two relation phrases tend to have similar cluster membershipd, if they have similar (1) strings; (2) context words; and (3) left and right argument type indicators
- Unsupervised Methods for Determining Object and Relation Synonyms on the Web [Resolover, JAIR 2009] relation
- Relation Extraction with Matrix Fatorization and Universal Schemes [NAACL-HLT 2013] {Paper}
- Close to relation clustering
- Create a universal scheme by unioning surface form predicates from Open IE and relations in the schemas of pre-existing databbases.
Relation Phrases Clustering (finding synonymous phrases and hypernyms)
- HARPY: Hypernyms and Alignment of Relational Paraphrases [HAPPY, COLING 2014] {Paper}{Data}
- POLY: Mining Relational Paraphrases from Multilingual Sentences [POLY, EMNLP 2016] {Paper}{Data}
- Make use of another language
- RELLY: Inferring Hypernym Relationships Between Relational Phrases [REELY, EMNLP 2015] {Paper}{Data}
- PATTY: A Taxonomy of Relational Patterns with Semantic Types [PATTY, EMNLP 2012] {Paper}{Data}
- Discovering and Exploring Relations on the Web [PATTY demo, VLDB 2012] {Paper}
- [g]Ensemble Semantics for Large-Scale Unsupervised Relation Extraction [WEBRE, EMNLP-CoNELL 2012] relation
- [g]Relation Schema Induction using Tensor Factorization with Side Information [SICTF, EMNLP 2016] relation schema induction (for building domain-specific kb from unstructured text) Code: https://github.com/malllabiisc/sictf
- [g]Constrained Information-Theoretic Tripartite Graph Clustering to Identify Semantically Similar Relations [IJCAI 2015]
Relation Linking
- Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text [FALCON, NAACL 2019] {Paper} {Code}{Demo}{Notes}
- EARL: Joint Entity and Relation Linking for Question Answering [EARL, ISWC 2018] {Paper} {Code}
Others
- Intergring Local Context and Global Cohesiveness for Open Information Extraction [ReMine, WSDM 2019]
- Solving a joint optimization problem to unify (1) segmenting entity/relation phrases in individual sentences based on local context; and (2) measuring the quality of tuples extracted from individual sentences with a translating-based objective.
- DeepWalk: Online Learning of Social Representations [DeepWalk, KDD 2014] [Code] (https://github.com/phanein/deepwalk Slides: https://www.slideshare.net/bperz/14-kdddeep-walk-2)
- Use a sentence embedding model
- [g]DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning [DeepPath, EMNLP 2017] [Code](https://github.com/xwhan/DeepPath Notes: https://zhuanlan.zhihu.com/p/33536026)
- [g]Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs [Know-Evolve, ICML 2017]{Paper}{Code (C++)}
- [g]Reading and Reasoning with Knowledge Graphs [PhD Thesis of Matthew Gardner] {Thesis}
- Reasoning, Relation Extraction, Modeling Lexical Semantics
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph
- Has time and location info
- A system that integrates knowledge from different existing KBs
- Knowledge Graph Embedding: A Survey of Approaches and Applications {Paper}
- Multilingual Knowledge Graph Embedding for Cross-lingual Knowledge Alignment. {Slides}
- [g]Dynamic Word Embeddings {Paper}
- [g]DYREP: LEARNING REPRESENTATIONS OVER DYNAMIC GRAPHS [ICLR 2019] {Paper}
- [g]Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs [Know-Evolve, ICML 2017]{Paper}{Code (C++)}
- [g] Continuous-Time Dynamic Network Embeddings [WWW 2018] {Paper}
Survey:
- Knowledge Graph Refinement:A Survey of Approaches and Evaluation Methods
Knowledge Base Completion:
- Knowledge base completion via search-based question answering [WWW 2014]{Paper}
- Knowledge base completion via coupled path ranking [ACL 2016]{Paper}
Knowledge Base Population:
- Overview: https://nlp.stanford.edu/projects/kbp/
- Slides of Summary: https://pdfs.semanticscholar.org/8193/1b57a1760a4fc9e8b42065bebcab4cbf164e.pdf
- A Fresh Look on Knowledge Bases Distilling Named [CIKM 2014]
- Event KB. Each news article is regarded as a event. Build the semantic similarity relations and the tmporal relations between evernts.
- CERES: Distantly Supervised Relation Extractionfrom the Semi-Structured Web [CERES, VLDB 2018]
- When Open Information Extraction Meets the Semi-Structured Web [OpenCERES, NAACL 2019]
- How to Keep a Knowledge Base Synchronized with Its Encyclopedia Source [IJCAI 2017] {Notes}
- https://kgtutorial.github.io An introduction to knowledge graph and knowledge extraction from unstructured text.
- https://github.com/impillar/knowledge_graph/blob/master/README.md
- https://github.com/BrambleXu/knowledge-graph-learning
- https://github.com/Pelhans/Z_knowledge_graph
- https://zhuanlan.zhihu.com/p/44904796
- Information Extraction by Niranjan Balasubramanian {Slides in my Mac}
- Probabilistic Graphical Models: Lagrangian Relaxation Algorithms for Natural Language Processing {Slides}
- Introduction to Conditional Random Fields {Blog}
- Network Community Detection: A Review and Visual Survey {Paper}
- Section 2.3. Community Detection Techniques
- Fast unfolding of communities in large networks {Paper}
- A discussion of the Louvain method: https://www.quora.com/Is-there-a-simple-explanation-of-the-Louvain-Method-of-community-detection, wiki of the Louvein Modularity: https://en.wikipedia.org/wiki/Louvain_Modularity
- How do they design the function Q: Finding and evaluating community structure in networks {Paper}
- A compendium of NP optimization problems http://www.nada.kth.se/~viggo/wwwcompendium/
- Notes about LSH: https://blog.csdn.net/yc461515457/article/details/48845775
- Survey about Min Hash Sketch: http://www.cohenwang.com/edith/Surveys/minhash.pdf
- MinHash Tutorial with Python Code: https://mccormickml.com/2015/06/12/minhash-tutorial-with-python-code/ https://github.com/chrisjmccormick/MinHash
- GNN: https://github.com/thunlp/GNNPapers
- Wikidata Integrator
- Stanford KBP
- DBpeida Spotlight
- OpenTapioca {Link}
- spaCy {[Link](https://spacy.io/api/annotation#section-named-entities https://towardsdatascience.com/named-entity-recognition-with-nltk-and-spacy-8c4a7d88e7da)}
- NLTK
- DBpedia Spotlight {Link}
- BOOKNLP https://github.com/dbamman/book-nlp (a natural language processing pipeline that scales to books and other long documents (in English))
- From Freebase to Wikidata: The Great Migration {Paper and useful links}
- SPASQL tutorial {Link}
- Installing and running ElasticSearch {Link}
- Open KG on COVID-19 [Link]