Skip to content

Commit 3189905

Browse files
committed
update dataset
1 parent f908bbb commit 3189905

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

asset/DATASET.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,17 @@ We follow the exact data preparation for the image text pair data using https://
3434
Please filter out coco2017 validation set from the karpathy training split or delete coco trainin data from ./datasets/register_vlp_datasets.py
3535
```
3636

37+
```
38+
.xdecoder_data
39+
└── pretrain_arrows_code224/
40+
├── filtcoco2017val_caption_karpathy_train.arrow
41+
├── ...
42+
├── code224_vg.arrow
43+
├── code224_sbu_0.arrow
44+
├── ...
45+
└── code224_conceptual_caption_train_0.arrow
46+
```
47+
3748

3849
### Note
3950
<img src="https://user-images.githubusercontent.com/11957155/226159078-7f817452-76f8-44f4-af7a-9f13f3e02554.png" width="500">

0 commit comments

Comments
 (0)