Stars
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
[CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手)with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Do…
[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
[NeurIPS 2022] Egocentric Video-Language Pretraining
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed
Methods and Implements of Deep Clustering
computing-intelligence / ai-edu
Forked from microsoft/ai-eduAI education materials for Chinese students, teachers and IT professionals.