Lists (1)
Sort Name ascending (A-Z)
Stars
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
A multi-factor equity risk model for quantitative trading.
Build Real-Time Knowledge Graphs for AI Agents
Hierarchical Generation of Molecular Graphs using Structural Motifs
Precision Medicine Knowledge Graph (PrimeKG)
A simple screen parsing tool towards pure vision based GUI agent
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
Code and other material for the book "Deep Learning and the Game of Go"
Minimal reproduction of DeepSeek R1-Zero
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Protein language model customized for antibodies
[ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
AFusion: AlphaFold 3 GUI & Toolkit with Visualization
A course on aligning smol models.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Awesome speech/audio LLMs, representation learning, and codec models
first base model for full-duplex conversational audio
Improved Sentence Alignment in Linear Time and Space
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
A comprehensive library for computational molecular biology
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an excellent balance between performance and efficiency.
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"