Skip to content
View aimicm's full-sized avatar

Block or report aimicm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection

Python 10 1 Updated Jan 24, 2025

A Survey on Multimodal Retrieval-Augmented Generation

414 17 Updated Nov 8, 2025
Python 46 2 Updated May 6, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,332 1,209 Updated Oct 28, 2025

Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)

Python 121 12 Updated Mar 20, 2024

[ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning

Python 202 18 Updated Aug 28, 2025

[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors

Python 174 11 Updated Jul 23, 2024

[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

517 33 Updated Nov 2, 2025

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,781 82 Updated Jul 27, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,607 1,115 Updated Sep 14, 2024

Efficient Multimodal Large Language Models: A Survey

375 20 Updated Apr 29, 2025

[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric

Python 377 30 Updated Aug 15, 2024

Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".

Python 349 17 Updated Jun 5, 2025

[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!

Python 212 20 Updated Jan 1, 2025

[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!

Python 1 Updated Mar 14, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,634 187 Updated Apr 20, 2024

Masked World Models for Visual Control

Python 131 9 Updated Jun 11, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

67,241 7,604 Updated Jun 4, 2025

(IEEE TIV) A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving

Python 215 13 Updated Dec 5, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,612 528 Updated Aug 29, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 21,789 2,555 Updated Oct 19, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,147 565 Updated Aug 22, 2025

This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.

Python 27 6 Updated Oct 11, 2023

The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data

Python 63 10 Updated Dec 1, 2023

[CoRL 2023] Robot Parkour Learning

Python 901 136 Updated Oct 26, 2025

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

2,129 208 Updated Mar 30, 2024

how to optimize some algorithm in cuda.

Cuda 2,616 238 Updated Nov 7, 2025

[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics W…

Python 398 79 Updated Jan 29, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 12,065 1,138 Updated Jul 13, 2024

Code and Data for "Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features"

Python 142 27 Updated Mar 26, 2025
Next