Stars
A curated collection of fun and creative examples generated with Nano Banana🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development…
Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…
Official code for our ICCV2025 paper "SDMatte: Grafting Diffusion Models for Interactive Matting"
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
[CVPR 2025] Towards In-the-wild 3D Plane Reconstruction from a Single Image
A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefN…
Code for "gen2seg: Generative Models Enable Generalizable Instance Segmentation"
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
This repository is dedicated to maintaining, updating, fixing bugs and keeping up to date my inpainting ComfyUI workflow, previously hosted on CivitAI called: Proper Flux Control-Net inpainting and…
Reference PyTorch implementation and models for DINOv3
ObjectClear: Complete Object Removal via Object-Effect Attention
[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)
[SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
Official repo for paper "Sparse Representation and Construction for High-Resolution 3D Shapes Modeling".
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
[NeurIPS 2024]GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction
3DGS-to-PC: 3D Gaussian Splatting to Dense Point Clouds [3D-VAST: ICCVW 2025]
The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"
Ray tracing and hybrid rasterization of Gaussian particles
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
Exporting Segment Anything, MobileSAM, and Segment Anything 2 into ONNX format for easy deployment
This repository shows how to solve ONNX export issue in Segment Anything model
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!