Skip to content
View BUG1989's full-sized avatar
🎯
Focusing
🎯
Focusing
  • axera
  • Shenzhen,Guangdong,China

Organizations

@AXERA-TECH

Block or report BUG1989

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,463 652 Updated May 29, 2025

QRCode(from WeChat) implement in ncnn⚡二维码检测&解码⚡ncnn⚡

C++ 232 44 Updated Jul 4, 2023

DJI Payload SDK Official Repository

C 361 155 Updated Jun 27, 2025

Qwen2.5-Omni-3B on Axera

Python 5 Updated Jun 23, 2025

演示快速在爱芯AX650平台上运行MOTRv2

Python 2 Updated Jun 19, 2025

in preparation...

Python 415 76 Updated Oct 14, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 30,416 4,251 Updated Nov 24, 2024

One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression

Jupyter Notebook 56 5 Updated Jun 23, 2025

Demo for Qwen2.5-VL-3B-Instruct on Axera device.

Python 10 2 Updated Apr 9, 2025

FunAudioLLM CosyVoice on Axera platform

Python 1 Updated Jun 3, 2025

Demo for satrn on Axera device.

Python 1 Updated Jun 13, 2025
Python 2 Updated Jun 18, 2025

LivePortrait DEMO on Axera

Python 3 Updated May 29, 2025
Python 1 Updated Jan 23, 2025

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 2,295 251 Updated Jun 30, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,976 190 Updated May 19, 2025

Demo for InternVL3-2B on Axera device.

Python 5 1 Updated Jun 27, 2025

Multilingual Voice Understanding Model

Python 6,038 533 Updated Mar 23, 2025

FunASR SenseVoice on Axera

Python 1 1 Updated Apr 28, 2025

Demo for Janus-Pro-1B on Axera device.

Python 4 Updated May 19, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,305 821 Updated May 15, 2025

MixformerV2 onnx c++, MixformerV2 TensorRT CPP and python version

C++ 19 3 Updated Feb 5, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,410 2,239 Updated Feb 1, 2025

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Python 165 24 Updated Apr 20, 2024
Python 7 1 Updated Jul 2, 2025

Spark-TTS Inference Code

Python 9,920 1,049 Updated Apr 9, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 14,905 1,575 Updated Jun 29, 2025

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 647 95 Updated Jul 2, 2025

YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,001 265 Updated Jul 1, 2025

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,352 698 Updated Aug 5, 2024
Next