Skip to content
View yukke42's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report yukke42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 617 26 Updated Dec 16, 2023

Explore the Multimodal “Aha Moment” on 2B Model

Python 595 21 Updated Mar 18, 2025

Collect every awesome work about r1!

Python 392 12 Updated May 2, 2025

A library for advanced large language model reasoning

Python 2,156 191 Updated Jun 10, 2025

Build multimodal language agents for fast prototype and production

Python 2,517 281 Updated Mar 19, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,612 160 Updated May 21, 2025

Everything you need to know to build your own RAG application

Jupyter Notebook 2,924 292 Updated Mar 26, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,340 172 Updated Mar 28, 2025

One for All Modalities Evaluation Toolkit - including text, image, video, audio tasks.

Python 2,690 318 Updated Jun 29, 2025

Deep Learning tools and applications for NVIDIA AGX platforms.

Shell 229 48 Updated Jun 19, 2025

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 6,331 695 Updated Jun 27, 2025

Awesome Data-Driven Autonomous Driving Solutions. Also the official repository of our survey paper: Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Min…

172 8 Updated Mar 20, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,270 3,285 Updated Jun 30, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,759 380 Updated Jun 18, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,973 1,878 Updated Dec 25, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 5,670 176 Updated May 31, 2025

A unified framework for 3D content generation.

Jupyter Notebook 6,801 527 Updated Dec 16, 2024

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python 5,518 647 Updated Aug 16, 2024

Generative Models by Stability AI

Python 26,092 2,904 Updated May 20, 2025

RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)

Python 356 31 Updated Aug 31, 2024

An open source implementation of CLIP.

Python 12,047 1,120 Updated Jun 10, 2025

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,019 92 Updated Jun 27, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,440 650 Updated May 29, 2025

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 964 133 Updated Apr 12, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,573 211 Updated Jun 26, 2025

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Python 3,445 653 Updated Aug 23, 2024

The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…

Python 270 39 Updated May 30, 2025

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,093 627 Updated Aug 9, 2023

Python class for calculating confusion matrix for object detection task

Python 90 20 Updated Apr 21, 2022
Next