Skip to content

Commit 45b212f

Browse files
committed
add timeline and model size images
1 parent 446768b commit 45b212f

File tree

4 files changed

+87
-0
lines changed

4 files changed

+87
-0
lines changed

.gitignore

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
docs/history.twbx
2+
13
.DS_Store
24

35
# Byte-compiled / optimized / DLL files

docs/history.csv

Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
Name,Paper Date,paper,hide,type,parameters
2+
LSTM,01/11/1997,http://www.bioinf.jku.at/publications/older/2604.pdf,1,Autoregressive / Transformer,
3+
VAE,20/12/2013,https://arxiv.org/abs/1312.6114,0,Variational Autoencoder,
4+
Encoder Decoder,03/06/2014,https://arxiv.org/abs/1406.1078,1,Autoregressive / Transformer,
5+
GAN,10/06/2014,https://arxiv.org/abs/1406.2661,0,Generative Adversarial Network,
6+
Attention,01/09/2014,https://arxiv.org/abs/1409.0473,1,General,
7+
GRU,03/09/2014,https://arxiv.org/abs/1409.1259,0,Autoregressive / Transformer,
8+
CGAN,06/11/2014,https://arxiv.org/abs/1411.1784,0,Generative Adversarial Network,
9+
Diffusion Process,12/03/2015,https://arxiv.org/abs/1503.03585,1,Energy-Based / Diffusion Models,
10+
UNet,18/05/2015,https://arxiv.org/abs/1505.04597,0,General,
11+
Neural Style,26/08/2015,https://arxiv.org/abs/1508.06576,1,General,
12+
DCGAN,19/11/2015,https://arxiv.org/abs/1511.06434,0,Generative Adversarial Network,
13+
ResNet,10/12/2015,https://arxiv.org/abs/1512.03385,0,General,
14+
VAE-GAN,31/12/2015,https://arxiv.org/abs/1512.09300,0,Variational Autoencoder,
15+
Self Attention,25/01/2016,https://arxiv.org/abs/1601.06733,1,Autoregressive / Transformer,
16+
PixelRNN,25/01/2016,https://arxiv.org/abs/1601.06759,0,Autoregressive / Transformer,
17+
RealNVP,27/05/2016,https://arxiv.org/abs/1605.08803v3,0,Normalizing Flow,
18+
PixelCNN,16/06/2016,https://arxiv.org/abs/1606.05328,0,Autoregressive / Transformer,
19+
pix2pix,21/11/2016,https://arxiv.org/abs/1611.07004,0,Generative Adversarial Network,
20+
Stack GAN,10/12/2016,https://arxiv.org/abs/1612.03242,1,Generative Adversarial Network,
21+
PixelCNN++,19/01/2017,https://arxiv.org/abs/1701.05517,0,Autoregressive / Transformer,
22+
WGAN,26/01/2017,https://arxiv.org/abs/1701.07875,0,Generative Adversarial Network,
23+
CycleGAN,30/03/2017,https://arxiv.org/abs/1703.10593,0,Generative Adversarial Network,
24+
WGAN GP,31/03/2017,https://arxiv.org/abs/1704.00028,1,Generative Adversarial Network,
25+
Transformers,12/06/2017,https://arxiv.org/abs/1706.03762,0,Autoregressive / Transformer,
26+
MuseGAN,19/09/2017,https://arxiv.org/abs/1709.06298,0,Generative Adversarial Network,
27+
ProGAN,27/10/2017,https://arxiv.org/abs/1710.10196,0,Generative Adversarial Network,
28+
VQ-VAE,02/11/2017,https://arxiv.org/abs/1711.00937v2,0,Variational Autoencoder,
29+
World Models,27/03/2018,https://arxiv.org/abs/1803.10122,0,Variational Autoencoder,
30+
SAGAN,21/05/2018,https://arxiv.org/abs/1805.08318v2,0,Generative Adversarial Network,
31+
GPT,11/06/2018,https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf,0,Autoregressive / Transformer,0.117
32+
GLOW,09/07/2018,https://arxiv.org/abs/1807.03039,0,Normalizing Flow,
33+
Universal Transformer,10/07/2018,https://arxiv.org/abs/1807.03819,1,Autoregressive / Transformer,
34+
BigGAN,28/09/2018,https://arxiv.org/abs/1809.11096,0,Generative Adversarial Network,
35+
FFJORD,02/10/2018,https://arxiv.org/abs/1810.01367,0,Normalizing Flow,
36+
BERT,11/10/2018,https://arxiv.org/abs/1810.04805,0,Autoregressive / Transformer,
37+
StyleGAN,12/12/2018,https://arxiv.org/abs/1812.04948,0,Generative Adversarial Network,
38+
Music Transformer,12/12/2018,https://arxiv.org/abs/1809.04281,0,Autoregressive / Transformer,
39+
GPT-2,14/02/2019,https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf,0,Autoregressive / Transformer,1.5
40+
MuseNet,25/04/2019,https://openai.com/blog/musenet/,0,Autoregressive / Transformer,
41+
VQ-VAE-2,02/06/2019,https://arxiv.org/abs/1906.00446v1,0,Variational Autoencoder,
42+
NCSN,12/07/2019,https://arxiv.org/abs/1907.05600,0,Energy-Based / Diffusion Models,
43+
T5,23/10/2019,https://arxiv.org/abs/1910.10683,0,Autoregressive / Transformer,11
44+
StyleGAN2,03/12/2019,https://arxiv.org/abs/1912.04958,0,Generative Adversarial Network,
45+
NeRF,19/03/2020,https://arxiv.org/abs/2003.08934,0,General,
46+
GPT-3,28/05/2020,https://arxiv.org/abs/2005.14165,0,Autoregressive / Transformer,175
47+
DDPM,19/06/2020,https://arxiv.org/abs/2006.11239,0,Energy-Based / Diffusion Models,
48+
DDIM,06/10/2020,https://arxiv.org/abs/2010.02502,0,Energy-Based / Diffusion Models,
49+
Vision Transformer,22/10/2020,https://arxiv.org/abs/2010.11929,0,Autoregressive / Transformer,
50+
VQ-GAN,17/12/2020,https://arxiv.org/abs/2012.09841,0,Generative Adversarial Network,
51+
DALL.E,24/02/2021,https://arxiv.org/abs/2102.12092,0,Multimodal Models,12
52+
CLIP,26/02/2021,https://arxiv.org/abs/2103.00020,0,Multimodal Models,
53+
GPT-Neo,21/03/2021,https://github.com/EleutherAI/gpt-neo,0,Autoregressive / Transformer,2.7
54+
GPT-J,10/06/2021,https://github.com/kingoflolz/mesh-transformer-jax,0,Autoregressive / Transformer,6
55+
StyleGAN3,23/06/2021,https://arxiv.org/abs/2106.12423,0,Generative Adversarial Network,
56+
Codex,07/07/2021,https://arxiv.org/abs/2107.03374,0,Autoregressive / Transformer,
57+
ViT VQ-GAN,09/10/2021,https://arxiv.org/abs/2110.04627,0,Generative Adversarial Network,
58+
Megatron-Turing NLG,11/10/2021,https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/,0,Autoregressive / Transformer,530
59+
Gopher,08/12/2021,https://arxiv.org/abs/2112.11446,0,Autoregressive / Transformer,280
60+
GLIDE,20/12/2021,https://arxiv.org/abs/2112.10741,0,Multimodal Models,5
61+
Latent Diffusion,20/12/2021,https://arxiv.org/abs/2112.10752,0,Energy-Based / Diffusion Models,
62+
LaMDA,20/01/2022,https://arxiv.org/abs/2201.08239,0,Autoregressive / Transformer,137
63+
StyleGAN-XL,01/02/2022,https://arxiv.org/abs/2202.00273v2,0,Generative Adversarial Network,0
64+
GPT-NeoX,02/02/2022,https://github.com/EleutherAI/gpt-neox,0,Autoregressive / Transformer,20
65+
Chinchilla,29/03/2022,https://arxiv.org/abs/2203.15556v1,0,Autoregressive / Transformer,70
66+
PaLM,05/04/2022,https://arxiv.org/abs/2204.02311,0,Autoregressive / Transformer,540
67+
DALL.E 2,13/04/2022,https://arxiv.org/abs/2204.06125,0,Multimodal Models,3.5
68+
Flamingo,29/04/2022,https://arxiv.org/abs/2204.14198,0,Multimodal Models,80
69+
OPT,02/05/2022,https://arxiv.org/abs/2205.01068,0,Autoregressive / Transformer,175
70+
Imagen,23/05/2022,https://arxiv.org/abs/2205.11487,0,Multimodal Models,4.6
71+
Parti,22/06/2022,https://arxiv.org/abs/2206.10789,0,Multimodal Models,20
72+
BLOOM,16/07/2022,https://arxiv.org/abs/2211.05100,0,Autoregressive / Transformer,176
73+
Stable Diffusion,22/08/2022,https://stability.ai/blog/stable-diffusion-public-release,0,Multimodal Models,0.89
74+
ChatGPT,30/11/2022,https://chat.openai.com/,0,Autoregressive / Transformer,
75+
MUSE,02/01/2023,https://arxiv.org/abs/2301.00704,0,Multimodal Models,3
76+
MusicLM,26/01/2023,https://arxiv.org/abs/2301.11325,0,Multimodal Models,
77+
Dreamix,02/02/2023,https://arxiv.org/pdf/2302.01329.pdf,0,Multimodal Models,
78+
Toolformer,09/02/2023,https://arxiv.org/pdf/2302.04761.pdf,0,Autoregressive / Transformer,
79+
ControlNet,10/02/2023,https://arxiv.org/abs/2302.05543,0,Multimodal Models,
80+
LLaMA,24/02/2023,https://arxiv.org/abs/2302.13971,0,Autoregressive / Transformer,65
81+
PaLM-E,06/03/2023,https://arxiv.org/abs/2303.03378,0,Multimodal Models,562
82+
Visual ChatGPT,08/03/2023,https://arxiv.org/abs/2303.04671,0,Multimodal Models,
83+
Alpaca,13/03/2023,https://github.com/tatsu-lab/stanford_alpaca,1,Autoregressive / Transformer,
84+
GPT-4,16/03/2023,https://cdn.openai.com/papers/gpt-4.pdf,0,Multimodal Models,
85+
Luminous,14/04/2022,https://www.aleph-alpha.com/luminous,0,Autoregressive / Transformer,

docs/model_sizes.png

165 KB
Loading

docs/timeline.png

295 KB
Loading

0 commit comments

Comments
 (0)