Skip to content
View jeradf's full-sized avatar

Block or report jeradf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

65 stars written in Java
Clear filter

The official home of the Presto distributed SQL query engine for big data

Java 16,557 5,503 Updated Nov 7, 2025

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 11,582 2,090 Updated Nov 6, 2025

No clever tagline needed.

Java 11,424 1,382 Updated Nov 7, 2025

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 8,360 4,429 Updated Nov 7, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,009 2,438 Updated Nov 7, 2025

Apache Ignite

Java 5,001 1,924 Updated Nov 6, 2025

A machine learning software for extracting information from scholarly documents

Java 4,415 517 Updated Nov 6, 2025

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

Java 3,615 799 Updated Jun 7, 2024

Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.

Java 2,844 664 Updated Dec 17, 2015

an open source geocoder for openstreetmap data

Java 2,452 332 Updated Nov 6, 2025

Apache Drill is a distributed MPP query layer for self describing data

Java 1,996 988 Updated Nov 6, 2025

Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch

Java 1,519 374 Updated Oct 30, 2025

MacroBase: A Search Engine for Fast Data

Java 670 126 Updated Dec 14, 2022

A Question Answering system built on top of the Apache UIMA framework.

Java 622 205 Updated Aug 5, 2018

Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby

Java 601 134 Updated Jan 11, 2018

Web-Scale Open Information Extraction

Java 542 132 Updated Mar 6, 2019

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer…

Java 479 144 Updated Jul 7, 2023

Fast Entity Linker Toolkit for training models to link entities to KnowledgeBase (Wikipedia) in documents and queries.

Java 339 82 Updated Feb 12, 2021

Java 8 Recommender Systems framework for novelty, diversity and much more

Java 278 57 Updated Jan 21, 2022

Terrier IR Platform

Java 268 61 Updated Jul 12, 2025

A machine learning tool for fishing entities

Java 263 24 Updated May 23, 2025

GERBIL - General Entity annotatoR Benchmark

Java 230 57 Updated Oct 14, 2025

Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.

Java 211 56 Updated Apr 9, 2017

K# - Knowledge Sharing Platform

Java 186 44 Updated Oct 1, 2020

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)

Java 178 59 Updated May 8, 2017

Distributed Model Serving Framework

Java 178 78 Updated Sep 30, 2025

A text tagger based on Lucene / Solr, using FST technology

Java 177 37 Updated Dec 18, 2023
Java 175 44 Updated Apr 10, 2023

A probabilistic approach from an Improbabilistic company

Java 152 33 Updated Mar 18, 2024
Next