clownrat6

Follow

🤡

A holistic joke

🤡🐀 clownrat6

🤡

A holistic joke

Follow

🤡 "V i L L A i N" without dawn

82 followers · 104 following

🤡 School
🤡 Gotham
clownrat6.github.io
@clownrat66

Achievements

Achievements

Highlights

Pro

Organizations

Lists (12)

Sort

3D

📚 Dataset & Benchmark

⚡Efficient Attentions

🔮 Future ideas

Generation

⭐LLM

💵 Trade

Video Instance Segmentation

Video LLM

⭐Vision-Language Pretraining

Visual Reasoning MLLM

Visual Tokenizer

Stars

yjsunnn / DLoRAL

[NeurIPS'25] One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

Python 298 16 Updated Nov 3, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,810 427 Updated Nov 8, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 550 33 Updated Nov 8, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,810 76 Updated Nov 6, 2025

xiaobai1217 / Awesome-Video-Datasets

Video datasets

1,547 110 Updated Mar 8, 2023

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 628 42 Updated Jun 5, 2025

joez17 / VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 50 1 Updated Mar 9, 2025

Nemo2011 / bilibili-api

哔哩哔哩常用API调用。支持视频、番剧、用户、频道、音频等功能。原仓库地址：https://github.com/MoyuScript/bilibili-api

Python 3,109 293 Updated Nov 9, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,240 4,779 Updated Jun 2, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,131 1,916 Updated Nov 1, 2025

ShaoQiBNU / YouTube_get_video

从YouTube上爬取视频

Python 93 22 Updated Mar 24, 2020

SihengLi99 / RePO

RePO: Replay-Enhanced Policy Optimization

Python 22 1 Updated Jun 12, 2025

bytedance / video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…

Python 108 8 Updated Oct 21, 2025

OpenGVLab / V2PE

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 57 2 Updated Dec 13, 2024

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

729 30 Updated Nov 5, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

8,719 582 Updated Nov 7, 2025

Kwai-Keye / Keye

Python 693 12 Updated Nov 1, 2025

Zhao-Yian / GraCo

[CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.

Python 60 2 Updated Mar 11, 2025

yuezih / Movie101

Narrative movie understanding benchmark

Python 76 Updated Jun 11, 2025

mutonix / Vript

Python 155 3 Updated Jan 16, 2025

jssprz / video_captioning_datasets

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

Jupyter Notebook 130 11 Updated Oct 27, 2023

mathllm / MathCoder

[MathCoder, MathCoder-VL] Family of LLMs/LMMs for mathematical reasoning.

Python 329 26 Updated Oct 18, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,384 49 Updated Oct 17, 2025

snap-research / DELTA_densetrack3d

DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)

Python 131 3 Updated Apr 6, 2025

OpenNLPLab / FAVDBench

[CVPR 2023] Official implementation of the paper: Fine-grained Audible Video Description

Python 74 5 Updated Dec 4, 2023

intelpro / CBMNet

Official repository of "Event-based Video Frame Interpolation with Cross-Modal Asymmetric Bidirectional Motion Fields", CVPR 2023 paper(highlight)

Python 79 4 Updated Apr 6, 2025

xuxw98 / ESAM

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 580 27 Updated May 7, 2025

apple / ml-cross-entropy

Python 545 53 Updated Sep 23, 2025

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 919 47 Updated Mar 19, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,873 302 Updated Nov 8, 2025