Skip to content
View MachineLP's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report MachineLP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.

Python 1,008 40 Updated Oct 27, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,964 523 Updated Oct 27, 2025

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 448 37 Updated Oct 27, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,423 2,166 Updated Dec 25, 2024

Implementation of the paper "DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients"

Jupyter Notebook 586 81 Updated Jan 9, 2025

Contexts Optical Compression

Python 17,985 1,171 Updated Oct 25, 2025

Enjoy the magic of Diffusion models!

Python 10,459 977 Updated Oct 27, 2025

Practice Code for text to image trainer

Python 501 37 Updated Oct 4, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,613 1,216 Updated Oct 27, 2025

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

Python 224 40 Updated Aug 31, 2022

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection" (AAAI 2022 Oral)

Python 588 89 Updated Jul 11, 2023

吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版

Jupyter Notebook 280 31 Updated Jul 23, 2023

Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"

Python 407 27 Updated Oct 10, 2025

Official implementation of HYPIR: Harnessing Diffusion-Yielded Score Priors for Image Restoration (SIGGRAPH 2025)

Python 898 60 Updated Oct 16, 2025

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR

Jupyter Notebook 672 42 Updated Sep 26, 2024

This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"

Python 206 7 Updated May 7, 2025

Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)

Python 1,285 81 Updated Aug 5, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 5,302 450 Updated May 12, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,813 316 Updated Sep 30, 2025

The ultimate training toolkit for finetuning diffusion models

Python 6,646 792 Updated Oct 27, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,404 6,452 Updated Oct 28, 2025

[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)

Python 177 10 Updated Oct 23, 2025

The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""

Python 85 5 Updated Oct 20, 2025

Official repository of In-Context LoRA for Diffusion Transformers

2,027 95 Updated Dec 20, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,172 10,823 Updated Oct 28, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,682 1,164 Updated Nov 14, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,703 929 Updated Oct 27, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,378 81 Updated Sep 22, 2025

An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.

Python 428 14 Updated Aug 4, 2025

LLM inference in C/C++

C++ 88,414 13,443 Updated Oct 28, 2025
Next