Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Models and examples built with TensorFlow
A high-throughput and memory-efficient inference and serving engine for LLMs
The simplest, fastest repository for training/finetuning medium-sized GPTs.
High-Resolution Image Synthesis with Latent Diffusion Models
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
💫 Industrial-strength Natural Language Processing (NLP) in Python
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
PyTorch Tutorial for Deep Learning Researchers
Code and documentation to train Stanford's Alpaca models, and generate the data.
Open-Sora: Democratizing Efficient Video Production for All
State-of-the-art 2D and 3D Face Analysis Project
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official inference repo for FLUX.1 models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Fast and memory-efficient exact attention
Janus-Series: Unified Multimodal Understanding and Generation Models
Datasets, Transforms and Models specific to Computer Vision
DALL·E Mini - Generate images from a text prompt

