fixing and re-organizing pipelines #1250

parmeet · 2021-03-08T19:00:59Z

FIxing pipelines according to new features in torchtext and removing pytext dependency

codecov · 2021-03-12T06:14:44Z

Codecov Report

Merging #1250 (9467f6e) into master (be3f640) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #1250   +/-   ##
=======================================
  Coverage   78.80%   78.80%           
=======================================
  Files          67       67           
  Lines        3624     3624           
=======================================
  Hits         2856     2856           
  Misses        768      768

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update be3f640...9467f6e. Read the comment docs.

zhangguanheng66 · 2021-03-19T17:07:06Z

examples/data_pipeline/README.md

-    python pipelines.py --pipeline pytext
-
-
-## Experimental PyText


why do we want to remove this pipeline?

The github repo of pytext is not maintained anymore and is not in good state as of this writing. This would mean that we have code in torchtext that is breaking going forward. I wasn't so sure, if we still want to maintain these code snippets that may sporadically break?

cc: @hudeven

Let's remove this in a separate PR, since we're not entirely clear on it and the other changes in this diff can go ahead, plus we might need it for comparison relatively soon.

cpuhrsch · 2021-03-23T19:02:43Z

examples/data_pipeline/pipelines.py

 if __name__ == "__main__":
    parser = argparse.ArgumentParser(description='Data procesing pipelines')
    parser.add_argument('--pipeline', type=str, default='sentencepiece',
                        help='The name of pipeline')
    parser.add_argument('--dataset', type=str, default='AG_NEWS',
                        help='Dataset for performance benchmark')
-    parser.add_argument('--spm-filename', type=str, default='m_user.model',
+    parser.add_argument('--spm-filename', type=str, default='text_unigram_25000',


Why this change?

This is for ease of use in default mode. If the user does not have access to spm model, user can simply run the code in default mode that will download one of the pre-trained spm model (text_unigram_25000). This change does not impact previous behavior, i.e if the use indeed specify m_user.model (with name other that the name in pre-trained spm models) it will work with the user model.

cpuhrsch

See the comment before landing

This reverts commit 04ffc11.

fixing and re-organizing pipelines

3fcfdd1

facebook-github-bot added the cla signed label Mar 8, 2021

parmeet and others added 2 commits March 8, 2021 14:08

fixing dataset

f88b6c4

Merge branch 'master' of github.com:pytorch/text into pipelines

02a674f

parmeet changed the title ~~[WIP] fixing and re-organizing pipelines~~ fixing and re-organizing pipelines Mar 15, 2021

parmeet added 2 commits March 18, 2021 21:41

Merge branch 'master' of github.com:pytorch/text into pipelines

d333d33

removing pytext code

04ffc11

parmeet requested a review from cpuhrsch March 19, 2021 04:52

zhangguanheng66 reviewed Mar 19, 2021

View reviewed changes

cpuhrsch reviewed Mar 23, 2021

View reviewed changes

cpuhrsch approved these changes Mar 24, 2021

View reviewed changes

parmeet and others added 3 commits March 24, 2021 11:14

Revert "removing pytext code"

a4e4b46

This reverts commit 04ffc11.

Merge branch 'master' of github.com:pytorch/text into pipelines

a7853c2

Merge branch 'master' into pipelines

9467f6e

parmeet merged commit eb5e39d into pytorch:master Mar 24, 2021

parmeet deleted the pipelines branch March 24, 2021 23:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixing and re-organizing pipelines #1250

fixing and re-organizing pipelines #1250

Uh oh!

parmeet commented Mar 8, 2021 •

edited

Loading

Uh oh!

codecov bot commented Mar 12, 2021 •

edited

Loading

Uh oh!

zhangguanheng66 Mar 19, 2021

Uh oh!

parmeet Mar 19, 2021

Uh oh!

cpuhrsch Mar 24, 2021

Uh oh!

cpuhrsch Mar 23, 2021

Uh oh!

parmeet Mar 24, 2021

Uh oh!

cpuhrsch left a comment

Uh oh!

Uh oh!

fixing and re-organizing pipelines #1250

fixing and re-organizing pipelines #1250

Uh oh!

Conversation

parmeet commented Mar 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Mar 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zhangguanheng66 Mar 19, 2021

Choose a reason for hiding this comment

Uh oh!

parmeet Mar 19, 2021

Choose a reason for hiding this comment

Uh oh!

cpuhrsch Mar 24, 2021

Choose a reason for hiding this comment

Uh oh!

cpuhrsch Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

parmeet Mar 24, 2021

Choose a reason for hiding this comment

Uh oh!

cpuhrsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

parmeet commented Mar 8, 2021 •

edited

Loading

codecov bot commented Mar 12, 2021 •

edited

Loading