Skip to content

Commit 82bc391

Browse files
authored
Update README.md
1 parent d712884 commit 82bc391

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22

33
This is an implementation of ViT - Vision Transformer by Google Research Team through the paper [**"An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"**](https://arxiv.org/abs/2010.11929)
44

5+
**Please install PyTorch with CUDA support following this [link](https://pytorch.org/get-started/locally/)**
56

67
## ViT Architecture
78
![Architecture of Vision Transformer](https://neurohive.io/wp-content/uploads/2020/10/rsz_cov.png)
@@ -29,8 +30,8 @@ cls #pool
2930
## Training
3031
Currently, you can only train this model on CIFAR-100 with the following commands:
3132

32-
`> pip3 install einops`\
3333
`> git clone https://github.com/quanmario0311/ViT_PyTorch.git`\
34+
`> pip3 install -r requirements.txt`\
3435
`> cd ViT_PyTorch`\
3536
`> python3 train.py`
3637

0 commit comments

Comments
 (0)