Skip to content
View tyxiong23's full-sized avatar

Highlights

  • Pro

Block or report tyxiong23

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data and sample evaluation codes for Multimodal Rewardbench 2

Python 82 6 Updated Dec 20, 2025
Jupyter Notebook 172 2 Updated Dec 19, 2025

The official code of "VisCoder2: Building Multi-Language Visualization Coding Agents"

Python 7 Updated Nov 17, 2025
Jupyter Notebook 554 47 Updated Nov 2, 2024

Multimodal Large Language Models for Code Generation under Multimodal Scenarios

185 6 Updated Dec 24, 2025
Python 40 6 Updated Nov 12, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,479 295 Updated Dec 19, 2025

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,477 97 Updated Jul 27, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,444 816 Updated Dec 25, 2025
Python 5 Updated Dec 2, 2025

Open source codebase for PRBench

Python 9 Updated Nov 20, 2025

Computer-Use Agents as Judges for Generative UI

Python 38 5 Updated Nov 27, 2025

This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs

Jupyter Notebook 36 2 Updated Mar 9, 2025

The first Interleaved framework for textual reasoning within the visual generation process

153 1 Updated Nov 21, 2025

This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO

Python 21 Updated Nov 26, 2025
Python 65 3 Updated Nov 5, 2025

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 476 35 Updated Dec 24, 2025

This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"

Python 255 18 Updated Oct 15, 2025

Project Page for "Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following"

JavaScript 1 Updated Dec 10, 2025
Python 13 1 Updated Dec 10, 2025

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 321 9 Updated Oct 14, 2025

PhD/MBA-level human-annotated rubrics dataset across Physics, Chemistry, Finance and Consulting

Python 26 1 Updated Oct 30, 2025

Async pipelined version of Verl

Python 125 13 Updated Apr 8, 2025

PromSketch: Approximation-First Timeseries Query at Scale

Go 24 3 Updated Nov 20, 2025

Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling

C++ 16 3 Updated Sep 27, 2023

Fully Open Framework for Democratized Multimodal Training

Python 663 53 Updated Dec 15, 2025

VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Python 249 8 Updated Sep 27, 2025

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 154 17 Updated Sep 23, 2025

The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"

Python 244 15 Updated Nov 16, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,290 114 Updated Dec 16, 2025
Next