-
Notifications
You must be signed in to change notification settings - Fork 801
Adding Google SentencePiece as a Tokenizer #1106
Conversation
This diff adds SentencePiece as a pip requirement, and a tokenizer shell for PyText
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
@snisarg has updated the pull request. Re-import the pull request |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
@snisarg has updated the pull request. Re-import the pull request |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
@snisarg has updated the pull request. Re-import the pull request |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
@snisarg has updated the pull request. Re-import the pull request |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@snisarg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
This diff adds SentencePiece as a pip requirement, and a tokenizer shell
for PyText
Motivation and Context
We need SentencePiece to support modern cross lingual models
How Has This Been Tested
A unit test has been added.
Types of changes
Checklist