Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

replacing tokenize by split_words_on_whitespace to keep text structure #201

Closed
wants to merge 2 commits into from

Conversation

AghilesAzzoug
Copy link
Contributor

Related Issue

Fixes #200

Summary

  • [ x ] I have read CONTRIBUTING.md to understand how to contribute to this repository :)

Replaced tokenize and detokenize functions by split_words_on_whitespace and rejoin_words_and_whitespace.

Unit Tests

Text

Replaced expected strings in unit test using ReplaceSimilarChars.

All tests passed.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 10, 2022
Copy link
Contributor

@zpapakipos zpapakipos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @AghilesAzzoug, thanks for this PR and for surfacing this issue! Please remove all the formatting changes you made in this PR (probably by running black or some other formatting which doesn't match the ones we use). Once that is done then we can more easily review the actual changes you made. Thanks!

augly/image/functional.py Outdated Show resolved Hide resolved
@facebook-github-bot
Copy link
Contributor

@zpapakipos has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@zpapakipos zpapakipos requested a review from jbitton March 11, 2022 18:45
zpapakipos pushed a commit to zpapakipos/AugLy that referenced this pull request Mar 28, 2022
facebookresearch#201)

Summary:
## Related Issue
Fixes facebookresearch#200

- [ x ] I have read CONTRIBUTING.md to understand how to contribute to this repository :)

Replaced `tokenize` and `detokenize` functions by `split_words_on_whitespace` and `rejoin_words_and_whitespace`.

## Unit Tests

### Text

Replaced expected strings in unit test using `ReplaceSimilarChars`.

All tests passed.

Pull Request resolved: facebookresearch#201

Reviewed By: jbitton

Differential Revision: D34819291

Pulled By: zpapakipos

fbshipit-source-id: 3174900eb7823c1b4df535cbf90edcaf3cadafdc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ReplaceSimilarChars text augmenter removes separators and replaces them with one blank space
3 participants