[#1782] Add max position params to speech recognition #1783

mgaido91 · 2020-03-05T17:29:56Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

facebook-github-bot · 2020-03-05T17:30:06Z

Hi @mgaido91!

Thank you for your pull request and welcome to our community.We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

facebook-github-bot · 2020-03-05T17:52:09Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

mgaido91 · 2020-03-05T18:21:08Z

the failure seems unrelated to this PR, but I don't know how to retrigger the tests, can anyone help me? Thanks.

erip · 2020-03-05T23:28:59Z

@mgaido91 Windows never passes.

okhonko

Thank you for your contribution @mgaido91 !

Please take a look at my minor comments.

okhonko · 2020-03-10T08:58:34Z

examples/speech_recognition/tasks/speech_recognition.py

@@ -77,6 +77,10 @@ def add_args(parser):
        parser.add_argument(
            "--silence-token", default="\u2581", help="token for silence (used by w2l)"
        )
+        parser.add_argument('--max-source-positions', default=2048, type=int, metavar='N',


The default value of 2048 may be too small since in speech recognition the source is sequence of frames, not tokens.
With this change we may start filtering out large portion of librispeech data by default for example.

Maybe we can set default as sys.maxsize for both of the source and target sequence to keep existing behavior (no data filtering by max size).

okhonko · 2020-03-10T08:59:01Z

examples/speech_recognition/tasks/speech_recognition.py

@@ -77,6 +77,10 @@ def add_args(parser):
        parser.add_argument(
            "--silence-token", default="\u2581", help="token for silence (used by w2l)"
        )
+        parser.add_argument('--max-source-positions', default=2048, type=int, metavar='N',
+                            help='max number of tokens in the source sequence')


nit: max number of frames

okhonko

Thanks for addressing my comments @mgaido91 !
Please consider changing the default for max-target-positions as well.

examples/speech_recognition/tasks/speech_recognition.py

okhonko

Looks good to me, thanks

examples/speech_recognition/tasks/speech_recognition.py

mgaido91 · 2020-03-13T11:47:35Z

Thanks for the review @okhonko . Is there anything else I can do to push this PR forward? Thanks in advance for the guidance.

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-06-23T14:07:15Z

@myleott merged this pull request in a12c5c5.

[facebookresearch#1782] Add max position params to speech recognition

3b61eb0

facebook-github-bot added the CLA Signed label Mar 5, 2020

myleott requested a review from okhonko March 7, 2020 15:02

okhonko suggested changes Mar 10, 2020

View reviewed changes

address comments

3e2a93c

okhonko reviewed Mar 10, 2020

View reviewed changes

examples/speech_recognition/tasks/speech_recognition.py Show resolved Hide resolved

okhonko approved these changes Mar 11, 2020

View reviewed changes

examples/speech_recognition/tasks/speech_recognition.py Show resolved Hide resolved

facebook-github-bot reviewed May 20, 2020

View reviewed changes

facebook-github-bot closed this in a12c5c5 Jun 23, 2020

facebook-github-bot added the Merged label Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#1782] Add max position params to speech recognition #1783

[#1782] Add max position params to speech recognition #1783

mgaido91 commented Mar 5, 2020

facebook-github-bot commented Mar 5, 2020

facebook-github-bot commented Mar 5, 2020

mgaido91 commented Mar 5, 2020

erip commented Mar 5, 2020

okhonko left a comment

okhonko Mar 10, 2020

okhonko Mar 10, 2020

okhonko left a comment

okhonko left a comment

mgaido91 commented Mar 13, 2020

facebook-github-bot left a comment

facebook-github-bot commented Jun 23, 2020

[#1782] Add max position params to speech recognition #1783

[#1782] Add max position params to speech recognition #1783

Conversation

mgaido91 commented Mar 5, 2020

Before submitting

What does this PR do?

PR review

Did you have fun?

facebook-github-bot commented Mar 5, 2020

facebook-github-bot commented Mar 5, 2020

mgaido91 commented Mar 5, 2020

erip commented Mar 5, 2020

okhonko left a comment

Choose a reason for hiding this comment

okhonko Mar 10, 2020

Choose a reason for hiding this comment

okhonko Mar 10, 2020

Choose a reason for hiding this comment

okhonko left a comment

Choose a reason for hiding this comment

okhonko left a comment

Choose a reason for hiding this comment

mgaido91 commented Mar 13, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Jun 23, 2020