Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optionally ignore utf-8 decoding error for scripted C++ tokenizers. #2128

Closed

Conversation

shuminghu
Copy link
Contributor

Summary: Binding and test to make sure we can use 'ignore' option for utf-8 decoding added to pytorch in D43970697( pytorch/pytorch#97282).

Reviewed By: Nayef211

Differential Revision: D44315169

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D44315169

…ytorch#2128)

Summary:
Pull Request resolved: pytorch#2128

Binding and test to make sure we can use 'ignore' option for utf-8 decoding added to pytorch in D43970697( pytorch/pytorch#97282).

Reviewed By: Nayef211

Differential Revision: D44315169

fbshipit-source-id: a4a60549c8945c8e8594f4f2cc9e7c99d105128d
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D44315169

@Nayef211
Copy link
Contributor

Don't need to export diffs to GH

@Nayef211 Nayef211 closed this Mar 28, 2023
facebook-github-bot pushed a commit that referenced this pull request Mar 28, 2023
…2128)

Summary:
Pull Request resolved: #2128

Binding and test to make sure we can use 'ignore' option for utf-8 decoding added to pytorch in D43970697( pytorch/pytorch#97282).

Reviewed By: Nayef211

Differential Revision: D44315169

fbshipit-source-id: d42fcacafd429cf586c631faf826abc172b173d3
Nayef211 pushed a commit to Nayef211/text that referenced this pull request Mar 29, 2023
…ytorch#2128)

Summary:
Pull Request resolved: pytorch#2128

Binding and test to make sure we can use 'ignore' option for utf-8 decoding added to pytorch in D43970697( pytorch/pytorch#97282).

Reviewed By: Nayef211

Differential Revision: D44315169

fbshipit-source-id: d42fcacafd429cf586c631faf826abc172b173d3
Nayef211 added a commit that referenced this pull request Mar 29, 2023
#2134)

* Optionally ignore utf-8 decoding error for scripted C++ tokenizers. (#2128)

Summary:
Pull Request resolved: #2128

Binding and test to make sure we can use 'ignore' option for utf-8 decoding added to pytorch in D43970697( pytorch/pytorch#97282).

Reviewed By: Nayef211

Differential Revision: D44315169

fbshipit-source-id: d42fcacafd429cf586c631faf826abc172b173d3

* Linter fixes

---------

Co-authored-by: Shuming Hu <smhu@meta.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants