Stars
10
stars
written in Python
Clear filter
Stable Diffusion web UI
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Fast R-CNN Object Detection on Azure using CNTK
Hierarchical Context Pruning (HCP): A strategy to optimize real-world code completion with repository-level pre-trained code large language models





