-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Word-alignment module for the back-translation task #26
Comments
Hi @hosseinfani You can see early results in this commit: link Now, if this module is suitable, I will start to use modules of word alignment and back-translation for the data augmentation task and work on the aspect semantic comparing to see which of the augmented reviews will be helpful for us to be added to the dataset. |
@farinamhz |
@hosseinfani |
…anslated version. Also, we do for translated version just in case (#26)
…anslated version. Also, we do for translated version just in case (#26)
@farinamhz Line 83 in eeeb48b
Also, right after translation and backtranslation, I do the alignment on Line 66 in eeeb48b
Line 70 in eeeb48b
Please check these lines and let me know your comments. |
In this issue, we are going to provide a module that gets two datasets of the same length (each of them contains a list of texts) as the input and gives the alignments list (these alignments will be between every two texts) as the output.
Alignments of the output for the given two texts will be a list of tuples, and each tuple contains an index of a token in the first text and an index of a token in the second text that aligns with text 1.
The text was updated successfully, but these errors were encountered: