Skip to content
View wanghqc's full-sized avatar
  • Qualcomm
  • San Diego, CA, USA
  • 20:02 (UTC -07:00)
  • LinkedIn in/hongqiang

Block or report wanghqc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MLX: An array framework for Apple silicon

C++ 22,522 1,361 Updated Oct 17, 2025

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 87 11 Updated Sep 30, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,869 1,852 Updated Oct 6, 2025

LLM inference in C/C++

C++ 19 3 Updated Oct 18, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 4,787 763 Updated May 12, 2025

Beignet is an open source implementation of the OpenCL specification - a generic compute oriented API. Here is Beignet Source Code Mirror in github- This is a publish-only repository and all pull r…

C++ 101 40 Updated Jan 7, 2023

Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver

C++ 40 38 Updated Oct 17, 2025

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

C++ 4,708 423 Updated Oct 14, 2025

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 808 140 Updated Oct 14, 2025

pocl - Portable Computing Language

C 1,026 278 Updated Oct 17, 2025

LM Studio CLI

TypeScript 3,778 291 Updated Oct 7, 2025

Microsoft Automatic Mixed Precision Library

Python 626 48 Updated Sep 29, 2024

Print all known information about all available OpenCL platforms and devices in the system

C 362 84 Updated Jun 24, 2025

A comprehensive 10-page probability cheatsheet that covers a semester's worth of introduction to probability.

TeX 3,121 703 Updated Jun 15, 2022

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,952 2,626 Updated Oct 17, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,592 572 Updated Oct 17, 2025

Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)

316 71 Updated May 28, 2023

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,287 77 Updated Mar 6, 2025

A C++ GPU Computing Library for OpenCL

C++ 1,630 338 Updated Aug 14, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,930 1,877 Updated Jul 15, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,716 1,305 Updated Apr 6, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,784 6,456 Updated Oct 18, 2025

Inference Llama 2 in one file of pure C

C 18,856 2,387 Updated Aug 6, 2024

Inference code for Llama models

Python 58,846 9,811 Updated Jan 26, 2025

A curated list of awesome computer vision resources

22,608 4,378 Updated May 17, 2024

LLM inference in C/C++

C++ 88,001 13,373 Updated Oct 18, 2025

OpenCL for Visual Studio Code

TypeScript 40 8 Updated Oct 3, 2025

Tensor Tiling Library

C 37 5 Updated Sep 23, 2025

A profiler to disclose and quantify hardware features on GPUs.

C++ 174 22 Updated May 15, 2022
Next