Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Back translation augmentation #26

Closed
martincjespersen opened this issue Aug 7, 2021 · 2 comments
Closed

Back translation augmentation #26

martincjespersen opened this issue Aug 7, 2021 · 2 comments
Labels
additional augmenter New augmenter(s) to add

Comments

@martincjespersen
Copy link

Augmenting of a document using back translation of various languages e.g., using huggingface models: https://huggingface.co/models?pipeline_tag=translation.

Example blog: https://dzlab.github.io/dltips/en/pytorch/text-augmentation/

Example sentence:
Augmenty is an augmentation library based on spaCy for augmenting texts. Augmenty differs from other augmentation libraries in that it corrects (as far as possible) the token, sentence and document labels under the augmentation.

English -> Danish (Google):
Augmenty er et udvidelsesbibliotek baseret på spaCy til forstørrelse af tekster. Augmenty adskiller sig fra andre augmentationsbiblioteker ved, at den korrigerer (så vidt muligt) token-, sætnings- og dokumentetiketterne under augmentationen.

Danish -> English (Google):
Augmenty is an extension library based on spaCy for enlarging texts. Augmenty differs from other augmentation libraries in that it corrects (as far as possible) the token, sentence, and document labels during augmentation.

@martincjespersen martincjespersen added the additional augmenter New augmenter(s) to add label Aug 7, 2021
@KennethEnevoldsen
Copy link
Owner

This would require solving issue #14 as the model currently takes in one example at a time, which would be unfeasible for back-translation.

@KennethEnevoldsen
Copy link
Owner

I will close this as stale @martincjespersen, let me know if you still would like this change.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
additional augmenter New augmenter(s) to add
Projects
None yet
Development

No branches or pull requests

2 participants