Starred repositories
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
HunyuanVideo: A Systematic Framework For Large Video Generation Model
StyleGAN2 - Official TensorFlow Implementation
🐍 Geometric Computer Vision Library for Spatial AI
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
The absolute trainer to light up AI agents.
Large World Model -- Modeling Text and Video with Millions Context
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
AI based multi-label girl image classification system, implemented by using TensorFlow.
Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Official Pytorch implementation of the preprint paper "Castle in the Sky: Dynamic Sky Replacement and Harmonization in Videos", in arXiv:2010.11800.
Official Pytorch implementation of the preprint paper "Stylized Neural Painting", in CVPR 2021.
Chinese Pre-Trained Language Models (CPM-LM) Version-I
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
[CVPR 2021] Closed-Form Factorization of Latent Semantics in GANs
Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)
transfer the makeup style of a reference face image to a non-makeup face
Code for Deep Single-image Portrait Image Relighting