Stars
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
High-performance In-browser LLM Inference Engine
Start building LLM-empowered multi-agent applications in an easier way.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
一个基于 Android 调试 API + 百度地图实现的虚拟定位工具,并且同时实现了一个可以自由移动的摇杆
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
aider is AI pair programming in your terminal
Run any ComfyUI workflow w/ ZERO setup.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
A custom node set for Video Frame Interpolation in ComfyUI.
[WIP] Layer Diffusion for WebUI (via Forge)
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A neural network that transforms a design mock-up into a static website.
i茅台app自动预约,每日自动预约,支持docker一键部署(本项目不提供成品,使用的是已淘汰的算法)
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A native, user-mode, multi-process, graphical debugger.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Instant voice cloning by MIT and MyShell. Audio foundation model.
TikTok 发布/喜欢/合辑/直播/视频/图集/音乐;抖音发布/喜欢/收藏/收藏夹/视频/图集/实况/直播/音乐/合集/评论/账号/搜索/热榜数据采集工具/下载工具