Skip to content
View MilleniumSpark's full-sized avatar

Block or report MilleniumSpark

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

This user has many starred repositories - we’re only showing some of them.
Showing results

[ACL 2023] Reasoning with Language Model Prompting: A Survey

989 70 Updated May 21, 2025
Python 1 Updated Jun 6, 2024

日本語LLMまとめ - Overview of Japanese LLMs

1 Updated Jun 6, 2024

A framework for few-shot evaluation of autoregressive language models.

Python 1 Updated Sep 7, 2024

[ACL 2023] Reasoning with Language Model Prompting: A Survey

1 Updated Dec 25, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

1 Updated Dec 25, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 1 Updated Oct 7, 2025
Python 10 Updated Dec 5, 2024
Python 16 4 Updated Jul 4, 2025
Python 5 1 Updated Dec 5, 2024

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Python 140 37 Updated Jul 20, 2022
57 1 Updated Dec 6, 2024

code implementation of paper "UltraRE: Enhancing RecEraser for Recommendation Unlearning via Error Decomposition"

Python 8 1 Updated Dec 12, 2023
Python 18 Updated Dec 25, 2023

[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image

55 2 Updated Sep 19, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 812 50 Updated Jun 16, 2025

Multimodal Models in Real World

Jupyter Notebook 551 23 Updated Feb 24, 2025

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,327 279 Updated May 4, 2024

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 247 13 Updated Apr 3, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,623 2,235 Updated Feb 1, 2025

[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound

Python 148 8 Updated May 30, 2025

[ICCV‘25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"

Python 42 2 Updated Oct 23, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,438 563 Updated Nov 28, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,328 442 Updated Nov 28, 2025

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 243 18 Updated Sep 26, 2025

Dimple, the first Discrete Diffusion Multimodal Large Language Model

Python 112 6 Updated Jul 9, 2025

This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers"

Python 53 1 Updated May 21, 2025
Next