Skip to content
View WhiteDOU's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ZheJiang University
  • Hang Zhou, Zhe Jiang

Block or report WhiteDOU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model

Python 183 10 Updated May 27, 2025

Awesome RL Reasoning Recipes ("Triple R")

713 42 Updated Jun 16, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

937 43 Updated Jun 18, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

449 20 Updated Jun 5, 2025

a-m-team's exploration in large language modeling

162 3 Updated May 29, 2025

A Survey of Direct Preference Optimization (DPO)

43 Updated Jun 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,105 1,666 Updated Jun 29, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,215 700 Updated Jun 19, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,757 380 Updated Jun 18, 2025

Official Repo for Open-Reasoner-Zero

Python 1,977 106 Updated Jun 2, 2025

Fully open reproduction of DeepSeek-R1

Python 24,912 2,312 Updated Jun 26, 2025

This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Python 16 Updated Mar 4, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,542 1,530 Updated Jun 13, 2025

Official repository of IDEA-Bench

Python 35 2 Updated Jan 24, 2025
Python 50 Updated Dec 20, 2024
Python 14 4 Updated Dec 16, 2022

Official repository of In-Context LoRA for Diffusion Transformers

1,932 93 Updated Dec 20, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,356 201 Updated Jun 2, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,985 8,391 Updated Jun 29, 2025

Ongoing research training transformer models at scale

Python 12,696 2,878 Updated Jun 28, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,515 443 Updated Jun 28, 2025

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,621 350 Updated May 29, 2025
Python 7 Updated Jun 18, 2024

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 504 34 Updated Sep 23, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 899 35 Updated Jun 27, 2025

A curated list of awesome Multimodal studies.

217 20 Updated Jun 27, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,252 1,502 Updated Jun 26, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,199 91 Updated Feb 16, 2025

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 480 28 Updated Apr 4, 2025

A collection of awesome video generation studies.

TeX 566 22 Updated Jun 16, 2025
Next