Stars
cvJie / Awesome-Vision-Language-Action-Models
Forked from nicehiro/Awesome-Vision-Language-Action-ModelsNew repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
[CVPR'25 Highlight] You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Hackable and optimized Transformers building blocks, supporting a composable construction.
The reinforcement learning training code for AgiBot X1.
[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
Aligning pretrained language models with instruction data generated by themselves.
🦜🔗 Build context-aware reasoning applications
An arbitrary face-swapping framework on images and videos with one single trained model!
Tracking and collecting papers/projects/others related to Segment Anything.
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
ImageBind One Embedding Space to Bind Them All
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Open-vocabulary Object Segmentation with Diffusion Models
Fast and memory-efficient exact attention
Official repo for consistency models.
GLIDE: a diffusion-based text-conditional image synthesis model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Denoising Diffusion Probabilistic Models
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Stable Diffusion web UI
This is a Pytorch implementation of deep image blending