Skip to content
View dereyly's full-sized avatar

Block or report dereyly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,522 606 Updated Nov 20, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,763 116 Updated Sep 16, 2025

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 489 22 Updated Mar 17, 2025

Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"

Python 83 1 Updated Apr 21, 2024

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,905 312 Updated Feb 19, 2025

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 4,764 774 Updated Mar 7, 2025
Python 4,412 422 Updated Sep 14, 2025

Emotional FusionBrain Challenge 4.0 - dev

Python 5 Updated Oct 2, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 91,254 11,448 Updated Sep 8, 2025

High-resolution models for human tasks.

Python 5,223 305 Updated Nov 18, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,327 407 Updated Jun 28, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,401 50 Updated Oct 17, 2025

Collection of awesome parameter-efficient fine-tuning resources.

579 17 Updated Oct 6, 2025

Collection of AWESOME vision-language models for vision tasks

3,010 225 Updated Oct 14, 2025

EVA Series: Visual Representation Fantasies from BAAI

Python 2,608 188 Updated Aug 1, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,435 1,594 Updated Aug 9, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,848 2,674 Updated Jul 3, 2025

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,586 233 Updated Jun 14, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,036 2,665 Updated Aug 12, 2024

AI Journey 2023: Russian Sign Language Recognition (Equal AI Track)

Jupyter Notebook 5 Updated Oct 24, 2023

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,822 1,320 Updated Aug 14, 2024

1st place solution to the Google - American Sign Language Fingerspelling Recognition competition

Python 172 32 Updated Aug 31, 2023
Jupyter Notebook 22 1 Updated Oct 4, 2023

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Jupyter Notebook 280 13 Updated Aug 12, 2024

[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"

Python 67 1 Updated Oct 15, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

68,137 7,722 Updated Jun 4, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,479 5,837 Updated Nov 24, 2025
Next