Skip to content

Commit 2fc1857

Browse files
committed
Merge remote-tracking branch 'origin/main'
2 parents 7a3323f + cdbc030 commit 2fc1857

File tree

10 files changed

+5
-354
lines changed

10 files changed

+5
-354
lines changed

.idea/.gitignore

Lines changed: 0 additions & 10 deletions
This file was deleted.

.idea/ASL-Dataset-Preprocess.iml

Lines changed: 0 additions & 8 deletions
This file was deleted.

.idea/inspectionProfiles/Project_Default.xml

Lines changed: 0 additions & 283 deletions
This file was deleted.

.idea/inspectionProfiles/profiles_settings.xml

Lines changed: 0 additions & 6 deletions
This file was deleted.

.idea/markdown.xml

Lines changed: 0 additions & 9 deletions
This file was deleted.

.idea/misc.xml

Lines changed: 0 additions & 7 deletions
This file was deleted.

.idea/modules.xml

Lines changed: 0 additions & 8 deletions
This file was deleted.

.idea/vcs.xml

Lines changed: 0 additions & 6 deletions
This file was deleted.

.idea/youtube-asl-a-large-scale.iml

Lines changed: 0 additions & 10 deletions
This file was deleted.

README.md

Lines changed: 5 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
# ASL Translation Data Preprocessing<!-- omit from toc -->
1+
# ASL Preprocessing Using MediaPipe <!-- omit from toc -->
22

3-
This repository provides a comprehensive solution for preprocessing American Sign Language (ASL) datasets, designed to handle both **How2Sign** and **YouTube-ASL** datasets. Our pipeline streamlines the workflow from video acquisition to landmark extraction, preparing the data for ASL translation tasks.
3+
This repository follows the method from ["YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus" (Uthus et al., 2023)](https://arxiv.org/abs/2306.15162) which is designed to handle both **YouTube-ASL** and **How2Sign** datasets through **MediaPipe Holistic**. Our pipeline streamlines the workflow from video acquisition to landmark extraction, preparing the data for ASL translation tasks.
44

55
## Table of Contents<!-- omit from toc -->
66

@@ -9,8 +9,6 @@ This repository provides a comprehensive solution for preprocessing American Sig
99
- [YouTube-ASL](#youtube-asl)
1010
- [How2Sign](#how2sign)
1111
- [Dataset Introduction](#dataset-introduction)
12-
- [YouTube-ASL Dataset](#youtube-asl-dataset)
13-
- [How2Sign Dataset](#how2sign-dataset)
1412

1513
## Project Configuration
1614

@@ -27,7 +25,7 @@ All project settings are managed through `conf.py`, offering a single configurat
2725
- `FRAME_SKIP`: Controls frame sampling rate for efficient processing
2826
- `MAX_WORKERS`: Manages parallel processing to optimize performance
2927

30-
- `POSE_IDX`, `FACE_IDX`, `HAND_IDX`: Selected landmark indices for extracting relevant points for sign language analysis
28+
- `POSE_IDX`, `FACE_IDX`, `HAND_IDX`: Selected landmark indices for extracting relevant points for sign language analysis. Devault value is the index defined in YouTube-ASL Dataset's research paper.
3129

3230
## How to Use
3331

@@ -47,14 +45,14 @@ All project settings are managed through `conf.py`, offering a single configurat
4745
- The script processes each video segment according to its timestamp, extracting only the most relevant body keypoints for sign language analysis. It uses parallel processing to handle multiple video efficiently. Results are saved as NumPy arrays.
4846

4947
### How2Sign
50-
1. Download **Green Screen RGB videos** and **English Translation (manually re-aligned)** from the How2Sign website.
48+
1. Download **Green Screen RGB videos** and **English Translation (manually re-aligned)** from the [How2Sign Website](https://how2sign.github.io/).
5149
2. Place the directory and .csv file in the correct path or amend the path in `conf.py`.
5250
3. Run **Step 3: Feature Extraction** (`s3_mediapipe_labelling.py`) only.
5351

5452
## Dataset Introduction
5553

5654
### YouTube-ASL Dataset
57-
- **Video List**: [GitHub Repository](https://github.com/google-research/google-research/blob/master/youtube_asl/README.md)
55+
- **Video List**: [google-research/youtube_asl](https://github.com/google-research/google-research/blob/master/youtube_asl/README.md)
5856
- **Paper**: ["YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus" (Uthus et al., 2023)](https://arxiv.org/abs/2306.15162)
5957

6058
If you use YouTube-ASL, please cite their associated paper:

0 commit comments

Comments
 (0)