Lists (10)
Sort Name ascending (A-Z)
Stars
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
Fully open data curation for reasoning models
The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.
Create Epic Math and Physics Animations From Text.
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
🍏 + 🎯 + 🐍 = Query Apple's FindMy Network with Python!
Examples and guides for using the Gemini API
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
On-device Speech Recognition for Apple Silicon
This SDK is now deprecated, use the unified Firebase SDK.
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )
This repo contains the sample code of the Azure Search and Cognitive Services used to provide insights and analysis around the JFK Files.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
[AAAI 2021] Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal
An open-source RAG-based tool for chatting with your documents.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps