C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Sparsity-aware deep learning inference runtime for CPUs
Industrial-strength Natural Language Processing (NLP)
Han Language Processing
Efficient Retrieval Augmentation and Generation Framework
Unified embedding model
Large Language Model Text Generation Inference
Data and tools for generating and inspecting OLMo pre-training data
Obsei is a low code AI powered automation tool
Pretrained model hub for Keras 3
Toolkit for conversational AI
The Classical Language Toolkit
Evaluation code for various unsupervised automated metrics
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
A full spaCy pipeline and models for scientific/biomedical documents
Transformers4Rec is a flexible and efficient library
Easy-to-use and powerful NLP library with Awesome model zoo
Training data (data labeling, annotation, workflow) for all data types
Extract schema, statistics and entities from datasets
A Repo For Document AI
ReFT: Representation Finetuning for Language Models
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval