You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-7Lines changed: 5 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
-
# ASL Translation Data Preprocessing<!-- omit from toc -->
1
+
# ASL Preprocessing Using MediaPipe <!-- omit from toc -->
2
2
3
-
This repository provides a comprehensive solution for preprocessing American Sign Language (ASL) datasets, designed to handle both **How2Sign** and **YouTube-ASL** datasets. Our pipeline streamlines the workflow from video acquisition to landmark extraction, preparing the data for ASL translation tasks.
3
+
This repository follows the method from ["YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus" (Uthus et al., 2023)](https://arxiv.org/abs/2306.15162) which is designed to handle both **YouTube-ASL** and **How2Sign** datasets through **MediaPipe Holistic**. Our pipeline streamlines the workflow from video acquisition to landmark extraction, preparing the data for ASL translation tasks.
4
4
5
5
## Table of Contents<!-- omit from toc -->
6
6
@@ -9,8 +9,6 @@ This repository provides a comprehensive solution for preprocessing American Sig
9
9
-[YouTube-ASL](#youtube-asl)
10
10
-[How2Sign](#how2sign)
11
11
-[Dataset Introduction](#dataset-introduction)
12
-
-[YouTube-ASL Dataset](#youtube-asl-dataset)
13
-
-[How2Sign Dataset](#how2sign-dataset)
14
12
15
13
## Project Configuration
16
14
@@ -27,7 +25,7 @@ All project settings are managed through `conf.py`, offering a single configurat
27
25
-`FRAME_SKIP`: Controls frame sampling rate for efficient processing
28
26
-`MAX_WORKERS`: Manages parallel processing to optimize performance
29
27
30
-
-`POSE_IDX`, `FACE_IDX`, `HAND_IDX`: Selected landmark indices for extracting relevant points for sign language analysis
28
+
-`POSE_IDX`, `FACE_IDX`, `HAND_IDX`: Selected landmark indices for extracting relevant points for sign language analysis. Devault value is the index defined in YouTube-ASL Dataset's research paper.
31
29
32
30
## How to Use
33
31
@@ -47,14 +45,14 @@ All project settings are managed through `conf.py`, offering a single configurat
47
45
- The script processes each video segment according to its timestamp, extracting only the most relevant body keypoints for sign language analysis. It uses parallel processing to handle multiple video efficiently. Results are saved as NumPy arrays.
48
46
49
47
### How2Sign
50
-
1. Download **Green Screen RGB videos** and **English Translation (manually re-aligned)** from the How2Sign website.
48
+
1. Download **Green Screen RGB videos** and **English Translation (manually re-aligned)** from the [How2Sign Website](https://how2sign.github.io/).
51
49
2. Place the directory and .csv file in the correct path or amend the path in `conf.py`.
52
50
3. Run **Step 3: Feature Extraction** (`s3_mediapipe_labelling.py`) only.
-**Paper**: ["YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus" (Uthus et al., 2023)](https://arxiv.org/abs/2306.15162)
59
57
60
58
If you use YouTube-ASL, please cite their associated paper:
0 commit comments