Starred repositories
[AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection
A Survey on Multimodal Retrieval-Augmented Generation
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)
[ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors
[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Efficient Multimodal Large Language Models: A Survey
[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".
[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!
aimicm / HEAL
Forked from yifanlu0227/HEAL[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
(IEEE TIV) A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.
The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
how to optimize some algorithm in cuda.
[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics W…
An open-source tool-augmented conversational language model from Fudan University
Code and Data for "Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features"