Skip to content
View gujiaqivadin's full-sized avatar

Block or report gujiaqivadin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,341 718 Updated Jun 28, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

404 19 Updated Jun 23, 2025

The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"

Python 190 12 Updated May 30, 2025

Geologic models from Llama 4 language model + GemPy!

Jupyter Notebook 54 18 Updated May 25, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 345 15 Updated Jun 13, 2025

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

Python 91 5 Updated May 19, 2025

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 879 121 Updated Jun 25, 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 3,427 256 Updated Jun 24, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 593 38 Updated May 27, 2025

MMR1: Advancing the Frontiers of Multimodal Reasoning

161 5 Updated Mar 17, 2025

Collections of Papers and Projects for Multimodal Reasoning.

105 9 Updated Apr 25, 2025

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

64 7 Updated Mar 18, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 595 20 Updated Mar 18, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,025 84 Updated Jun 26, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

937 43 Updated Jun 18, 2025

Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.

JavaScript 78 9 Updated Mar 20, 2025

R1-Vision: Let's first take a look at the image

Python 47 1 Updated Feb 16, 2025

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 310 9 Updated Jun 21, 2025

Fully open reproduction of DeepSeek-R1

Python 24,912 2,313 Updated Jun 26, 2025

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Python 45 Updated Dec 3, 2024

Google AI Studio Starter Apps

TypeScript 1,153 427 Updated Feb 4, 2025

This repository contains the code for the paper [HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting](https://gujiaqivadin.github.io/hybridgs/).

Python 49 1 Updated Dec 13, 2024

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,732 66 Updated Apr 1, 2025

Code for SpotLessSplats: Ignoring Distractors in 3D Gaussian Splatting built on gsplat codebase.

Cuda 188 10 Updated May 9, 2025

Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine

Python 467 48 Updated Jan 14, 2025

MoonPalace(月宫)是由 Moonshot AI 月之暗面提供的 API 调试工具。

Go 193 4 Updated Dec 30, 2024

A Unified Toolkit for Deep Learning-Based Table Extraction

Python 40 6 Updated Nov 21, 2024

A High-efficiency Open-source Toolkit for Table-to-Latex Task

Python 247 22 Updated Dec 12, 2024
Next