-
Yonsei University
- Seoul, Republic of Korea
- https://jerife.org
Highlights
- Pro
Stars
[ACL 2025 Industry Track] A Large-Scale Real-World Evaluation of an LLM-Based Virtual Teaching Assistant
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Awesome artificial intelligence (AI) and large language model (LLM) for education papers.
A curated list of LLM researches and applications in education.
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳
Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023
[ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
Minimal Implementation of a D3PM in pytorch
Unofficial PyTorch implementation of Discrete Denoising Diffusion Probabilistic Model(D3PM)
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
A benchmark dataset for evaluating LLM's SVG editing capabilities
Official PyTorch implementation for "Large Language Diffusion Models"
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"
ROOT: VLM based System for Indoor Scene Understanding and Beyond
Code release for "Deep Texture Manifold for Ground Terrain Recognition", CVPR 2018