remove padding mask for input embeddings #1799

parmeet · 2022-06-21T01:18:21Z

This PR removes the masking at Padded tokens for the input embeddings.

In fairseq, the masking is applied to input embedding here but not in HF implementation. This causes MisMatch in output embedding for the padded tokens.

Ideally, it should not matter the output for padded tokens. Per the investigations from @ebsmothers, it is somehow causing results to differ for MDETR model. In order for Torch MM to upstream dependency on torchtext for RoBERTa encoder, this change is necessary.

ebsmothers

Thanks for the fix! This looks good to me

remove padding mask for input embeddings

be49832

facebook-github-bot added the cla signed label Jun 21, 2022

parmeet requested review from abhinavarora and ebsmothers June 21, 2022 01:18

ebsmothers approved these changes Jun 21, 2022

View reviewed changes

parmeet merged commit a937288 into pytorch:main Jun 21, 2022

parmeet deleted the match_hf_padding branch June 21, 2022 13:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove padding mask for input embeddings #1799

remove padding mask for input embeddings #1799

Uh oh!

parmeet commented Jun 21, 2022 •

edited

Loading

Uh oh!

ebsmothers left a comment

Uh oh!

Uh oh!

remove padding mask for input embeddings #1799

remove padding mask for input embeddings #1799

Uh oh!

Conversation

parmeet commented Jun 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ebsmothers left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

parmeet commented Jun 21, 2022 •

edited

Loading