Highlights
Stars
An open-source, cross-platform terminal for seamless workflows
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
Lightweight coding agent that runs in your terminal
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…
Use the NVIDIA Audio2Face headless server and interact with it through a requests API. Generate animation sequences for Unreal Engine 5, Maya and MetaHumans
An open solution for AI-powered photorealistic digital humans.
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
🛡️ ⚛️ A simple, scalable, and powerful architecture for building production ready React applications.
🚀 Bring your favorite shell wherever you go through the ssh. Xonsh shell, fish, zsh, osquery and so on.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Apache Superset is a Data Visualization and Data Exploration Platform
A macOS setup guide specific to front-end development.
Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
Data Labeling, Tracking and Annotation with AI
Voice activity detector (VAD) for the browser with a simple API
A multi-speaker, multilingual speech generation tool
Code for the article - Development of A Novel Robot-Assisted Vocabulary Learning System Using Pure Synthetic Data
A guidance language for controlling large language models.
A document version of my "Vipassana for Hackers" talk
⚛️ React.js components 💯% compatible with 🪐 Jupyter.
A guided tour on how to use HuggingFace large language models on Macs with Apple Silicon
Visualize Your Ideas With Code
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Pythonic AI generation of images and videos


