-
Allen Institute for Artificial Intelligence
- Seattle
- http://tanmaygupta.info
- @tanmay2099
Highlights
- Pro
Stars
Simple project page template for your research paper, built with Astro and Tailwind CSS
An open-source framework for training large multimodal models.
Generative Models by Stability AI
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Pytorch implementation of LOST unsupervised object discovery method
GIT: A Generative Image-to-text Transformer for Vision and Language
Hackable and optimized Transformers building blocks, supporting a composable construction.
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
An efficient video loader for deep learning with smart shuffling that's super easy to digest
LAVIS - A One-stop Library for Language-Vision Intelligence
High-Resolution Image Synthesis with Latent Diffusion Models
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
Python bindings for FFmpeg - with complex filtering support
BigRedT / ViLT
Forked from dandelin/ViLTCode for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Deep learning for molecules and materials book
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A simple python package for multi-task training: wrappers for pytorch DataLoader and pytorch-lightning DataModule
allenai / refer
Forked from lichengunc/referReferring Expression Datasets API
Google Research
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Awesome Incremental Learning
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Semantic Segmentation Architectures Implemented in PyTorch