Code release for ConvNeXt V2 model
Reference implementation of the Transformer architecture optimized
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Environment generation code for the paper "Emergent Tool Use"
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
ICLR2024 Spotlight: curation/training code, metadata, distribution
JetBrains’ 4B parameter code model for completions
OpenAI’s compact 20B open model for fast, agentic, and local use
Vision-language-action model for robot control via images and text