Stars
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
Paper Debugger is the best overleaf companion
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Official code for Paper: "Can One Modality Model Synergize Training of Other Modality Models?" implemented in PyTorch
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations (MIB for multimodal sentiment analysis)
[ICLR 2025] Multi-modal representation learning of shared, unique and synergistic features between modalities
[NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Code for paper: "An Information Criterion for Controlled Disentanglement of Multimodal Data"
可搜索、可筛选的 AI-Scientist 提示词画廊,支持一键复制与阅读指引,适合研究与写作工作流。
The official implementation for "Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Image Generation" (NeruIPS 2025)
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
[KDD 2025] Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization (DCCL)
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>
Source code and sample data for paper "Multi-Level Feature Transmission in Dynamic Channels: A Semantic Knowledge Base and Deep Reinforcement Learning-Enabled Approach"
This is the code for the paper Bayesian Invariant Risk Minmization of CVPR 2022.
Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment [CVPR-2024]