zRich

🎯

Focusing

Rich Zhao zRich

🎯

Focusing

Rich Zhao in Shenzhen China

24 followers · 37 following

Shenzhen

Achievements

Stars

lit / lit

Lit is a simple library for building fast, lightweight web components.

TypeScript 19,883 975 Updated Jun 24, 2025

microsoft / data-formulator

🪄 Create rich visualizations with AI

TypeScript 12,639 1,020 Updated Jul 2, 2025

hwjiang1510 / MegaSynth

Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)

Python 185 5 Updated May 20, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 9,997 862 Updated May 30, 2025

Jixuan-Fan / Momentum-GS

[ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

Python 147 6 Updated Jun 26, 2025

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,225 2,375 Updated Mar 13, 2025

docling-project / docling

Get your documents ready for gen AI

Python 33,413 2,204 Updated Jul 1, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,296 5,311 Updated Jun 27, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 41,091 5,317 Updated Aug 16, 2024

aigc-apps / EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 2,173 166 Updated Mar 6, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 12,503 1,803 Updated Jun 24, 2025

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,206 724 Updated May 27, 2025

CyberAgentAILab / TANGO

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,063 137 Updated Jun 26, 2025

johndpope / VASA-1-hack

Using Claude Sonnet 3.5 to forward (reverse) engineer code from VASA white paper - WIP - (this is for La Raza 🎷)

Python 294 36 Updated Nov 9, 2024

antgroup / echomimic

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,944 438 Updated Dec 10, 2024

xyflow / xyflow

React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…

TypeScript 30,364 1,970 Updated Jun 27, 2025