My french_conjugation_transformation #212

Louanes1 · 2021-08-19T16:37:55Z

Faced this issue when using pre-commit :

When I went to check the "pre-commit-config.yaml" file, it says that the repo for this hook is local.

Since black, flake8 and isort hooks passed, I commited the code with following command :

git commit -m "My french_conjugation_transformation" -n

AbinayaM02 · 2021-08-20T07:23:28Z

Hi @Louanes1 : Thanks for your contribution. The pre-commit hook failed because your test case didn't pass. Did you try running the pytest for your transformation locally?

Louanes1 · 2021-08-20T07:53:23Z

Hi @Louanes1 : Thanks for your contribution. The pre-commit hook failed because your test case didn't pass. Did you try running the pytest for your transformation locally?

Thanks for the reply @AbinayaM02 :) Yes when I run the pytest , the test passes :

Oh, I think it's because I added a direct link to download the "fr_core_news_lg" in my requirements file, the build in "Checks" fails because of some characters in the link. I'll try to add it differently

AbinayaM02 · 2021-08-20T08:48:57Z

Oh, I think it's because I added a direct link to download the "fr_core_news_lg" in my requirements file, the build in "Checks" fails because of some characters in the link. I'll try to add it differently

Yes, you're right. The build is failing because of that.

Since your test case passed when you ran it separately, ideally pre-commit test hook shouldn't fail. Try running the pre-commit and see if it throws any specific message.

[Edit] Are you using ubuntu or windows while committing the code?

Louanes1 · 2021-08-20T15:33:09Z

Oh, I think it's because I added a direct link to download the "fr_core_news_lg" in my requirements file, the build in "Checks" fails because of some characters in the link. I'll try to add it differently

Yes, you're right. The build is failing because of that.

Since your test case passed when you ran it separately, ideally pre-commit test hook shouldn't fail. Try running the pre-commit and see if it throws any specific message.

[Edit] Are you using ubuntu or windows while committing the code?

I am using windows, and when I run the pre-commit, the test hook still fails :/
After removing the fr_core_news_lg, the build throws another error :

It looks like there is a conflict with the nltk version I use in my requirements. I removed the version so pip will attempt to resolve the dependecy conflict.

Louanes1 · 2021-08-23T08:06:46Z

Hi @AbinayaM02,
The dependencies installed correctly ! No more conflict :)
However, I use a spacy package called "fr_core_news_lg" and this one needs to be download with following : python -m spacy download fr_core_news_lg.

The installation is supposed to be similar to the en_core_web_sm found in initialize.py. I've added a link to download it in my requirements file (its commented otherwise special characters will fail the build : https://github.com/explosion/spacy-models/releases/download/fr_core_news_lg-3.0.0/fr_core_news_lg-3.0.0-py3-none-any.whl)

I have also tried to donwload the fr_core_news_lg, add it to my folder and add a link to that file in my requirements, but obviously it's too large, git won't let me push.

Do you have any idea on how I am suppose to proceed ? Thanks

AbinayaM02 · 2021-08-23T10:50:58Z

Do you have any idea on how I am suppose to proceed ? Thanks

Try adding your model in the below format in the requirement.txt and uncomment it. (Hopefully, it should work!)
fr_core_news_lg @ https://github.com/explosion/spacy-models/releases/download/fr_core_news_lg-3.0.0/fr_core_news_lg-3.0.0-py3-none-any.whl

Louanes1 · 2021-08-23T13:47:23Z

Do you have any idea on how I am suppose to proceed ? Thanks

Try adding your model in the below format in the requirement.txt and uncomment it. (Hopefully, it should work!)
fr_core_news_lg @ https://github.com/explosion/spacy-models/releases/download/fr_core_news_lg-3.0.0/fr_core_news_lg-3.0.0-py3-none-any.whl

Yes it works thanks ! @AbinayaM02
I've faced an exit code 137, maybe due to memory the model takes, I've switched fr_core_new_lg to fr_core_news_md. I will try with fr_core_news_sm if the issue still persists

richplant · 2021-09-10T13:36:24Z

Seems inefficient to load a second Spacy model that might never get used for more than this single transformation in the initialize script. You should probably import it as a module and call the .load() function directly on it in your script.

You also need to add the appropriate language tags and keywords to your class and a robustness evaluation to the readme (check the evaluate.py script in the main dir).

Added keywords to the class. Also merged changes from main.

mille-s · 2021-09-21T12:49:52Z

Hi, nice transformation! Do we have an idea of the accuracy of the substitution? Does it fail sometimes, and if so how often?

Louanes1 · 2021-09-21T13:51:23Z

Hi, nice transformation! Do we have an idea of the accuracy of the substitution? Does it fail sometimes, and if so how often?

Hello @mille-s thank you ! The conjugation of the verbs is quite robust since it is relying on the mlconjug library and their model is trained on the different verb groups

In french, we have 3 different group of verbs

1st group where verbs end with "er" like "manger, écouter"
2nd group where they end with "ir" like "finir, courir"
3rd group where they end with "re" like "boire, croire"

The conjugation of each group is different whatever the tense.

The french model behind mlconjug is trained to predict the conjugation based on these different groups, so that even when a verb do not exist like "facebooker", it will still consider it as a verb of 1st group and conjugate it accordingly.

However in order to conjugate a verb we first need to transform it into its "indicative" form (he ate --> to eat) and send it to the mlconjug function. And we do that with lemmatization, so I guess that if we can not get the lemma of the conjugated verb from the original sentence, we won't be able to conjugate it to a different tense. The spacy lemmatizer is quite good though so I didn't encounter verbs that could trigger these issue.

The common issues I encountered are more linked to the "pronoun" it should be conjugated to (because the conjugation also differs based on the pronoun used, and it is an entry of the mlconjug function). Right now it can handle sentences where the subject is a pronoun defined in our dictionnary. But if the subject is a group of words ( the parentsinstead of they) it won't be able to detect the pronoun, therefore won't know to which person the verb is supposed to be conjugated.

mille-s · 2021-09-21T16:34:39Z

@Louanes1 ok thanks for the answer! Note that there should be some kind of filter applied on the candidate input sentences that returns only sentences with one main verb and a subject pronoun before applying your transformation. Do you have such a filter at hand?

…r/NL-Augmenter into noun_compound_paraphraser

…mation Added disability_transformation

Added use_acronyms

…moval Auxiliary negation removal

add tense transform

German gender swap

Louanes1 · 2021-10-03T14:51:47Z

Hey @tuetschek , @mille-s

Re isn't is safe to assume that if there is no pronoun a verb is in third person? Then it's a matter of finding out if the verb is singular or plural, which should be doable. This could be a future improvement to reach more verbs.

Actually, this makes sense for verbs in english. In french the spelling is different when the verb is conjugated with each plural pronoun :

I ate --> Je mangeai
You ate --> Tu mangeas
He ate --> Il mangea
We ate --> Nous mangâmes
You ate --> Vous mangeâtes
They ate --> Ils mangèrent

There is quite a different ending depending on the pronoun used, that is why we cannot assume one of them in case we didn't find any.
I am trying to build a classification model where I pass a conjugated verb and I expect the pronoun it is conjugated to, to be returned.
I've gathered around 3000 verbs; I conjugate them to past, futur and present, then I map each one of them with its pronoun. (My dataset have 2 columns : all conjugated verbs, and pronoun) I am still working on it, trying to get the prediction, I will let you know as soon as I find something that could help us solve this issue.
By the way, which classification algorithm, you guys think I should use for this kind of task ?

Meanwhile, I believe it is better to conjugate verbs to the latest detected pronoun, so that it can handle cases where a subject does more than 1 action, many verbs should be conjugated to that pronoun.

Yoda transform

AbinayaM02 · 2021-10-04T05:00:13Z

Hi @Louanes1: Please add your transformation name to the test/mapper.py in the right dictionary for the pytest to pick up your test.json. By default, we're testing only light transformations and filters.

mille-s · 2021-10-04T08:33:17Z

@Louanes1 : what I meant was that if there is no pronoun before a verb, this verb will very likely be in third person since it's almost mandatory to have a first or second person pronoun to have a verb conjugated in first or second person (except for the imperative mood, for which the verb is used with no pronouns, but this mood is overall quite unfrequent). So to get the right ending of a verb when there is no pronoun, it boils down to finding the number (third sigular or third plural), which I think can be derived from the original verb form with simple regex in many cases (this needs to be checked with more care). In any case this can be added later as an improvement, no need to do it now.

…argument in test file

AbinayaM02 · 2021-10-04T18:09:21Z

Hi @Louanes1: Please do not rebase the branch. Follow the below steps to add your changes only,

Fetch and merge the main branch of your repository which you have already done on the UI.
Checkout the main branch and pull the main branch on to your local repository.

git checkout main
git pull origin main

Switch to your branch where you're making changes and pull the main branch into that.

git checkout french_conjugation_transformation
git pull origin main

If there are any conflicts, resolve them (mostly there won't be any except for the test/mapper.py where the latest file will have your changes). Add your changes pertaining to french conjugation transformation and commit.
Push your changes to the upstream repository.

git push origin french_conjugation_transformation

Your PR will be automatically updated.

You can either try to fix your current PR or open a new clean PR with only your changes.

Louanes1 · 2021-10-05T13:21:46Z

I've ended up creating a new PR : #308 :)
@AbinayaM02 @mille-s

AbinayaM02 · 2021-10-06T08:58:13Z

Closing this PR since a PR is created with the requested changes.

AbinayaM02 added the transformation label Aug 23, 2021

csinva and others added 11 commits September 20, 2021 22:16

update evaluation file

b5b6421

Merge branch 'GEM-benchmark:main' into ocr_perturbation

dbb75cf

moved example.py to test/helper.py

4a8a6f7

add keywords

8ac1789

Update README.md

6c46672

add tests for entity_mention_replacement_ner

8ac5189

temporarily removing test cases

1251764

adding test cases for entity_mention_replacement_ner

9d97ddd

Merge remote-tracking branch 'upstream/main' into filler_words

b77d4dc

Fixed RandomDeletion pytest failure

dd9d38d

Adding keywords

997e635

Added keywords to the class. Also merged changes from main.

feat: adding keywords / max_outputs param.

4ba9ccc

gokyori and others added 6 commits September 21, 2021 11:37

added keywords

959aa02

Added keywords.

ee96e13

Merge branch 'noun_compound_paraphraser' of https://github.com/juand-…

bc7e3fe

…r/NL-Augmenter into noun_compound_paraphraser

Update transformation.py

7cf3977

Update test.json

e7ba533

Update transformation.py

a95c30d

raft001 and others added 15 commits October 1, 2021 23:06

Update transformation.py

ca90e27

Update noun_pairs.json

5cb7be3

Merge branch 'GEM-benchmark:main' into disability_transformation

7ae6820

Merge branch 'GEM-benchmark:main' into GermanGenderSwap

1d4f389

Update transformation.py

7f15f58

Merge pull request GEM-benchmark#241 from raft001/disability_transfor…

b84becc

…mation Added disability_transformation

Merge branch 'GEM-benchmark:main' into GermanGenderSwap

c477206

Added transformation to test/mapper.py

310b6c9

Merge branch 'main' into auxiliary_negation_removal

e9e32ee

Added transformation to test/mapper.py

e1bb0c2

Merge branch 'main' into use_acronyms

4e74664

Merge pull request GEM-benchmark#218 from Sotwi/use_acronyms

d850b6e

Added use_acronyms

Merge pull request GEM-benchmark#149 from Sotwi/auxiliary_negation_re…

6a550b1

…moval Auxiliary negation removal

Merge pull request GEM-benchmark#135 from MukundVarmaT/tense

736fa20

add tense transform

Merge pull request GEM-benchmark#257 from raft001/GermanGenderSwap

73e66d0

German gender swap

rteehas and others added 2 commits October 3, 2021 20:49

update readme

e4ca383

Merge pull request GEM-benchmark#243 from rteehas/yoda_transform

c2b3f77

Yoda transform

Louanes Hamla added 5 commits October 4, 2021 16:09

My french_conjugation_transformation

3c7745d

use spacy model directly in the transformation file and add tense as …

86c4805

…argument in test file

add keywords

feaf2e3

use more conjugated verbs in example

015f9b6

rebase & add transformation name to mapper.py

199363e

Louanes1 mentioned this pull request Oct 5, 2021

French Conjugation Transformation clean #308

Merged

AbinayaM02 closed this Oct 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

My french_conjugation_transformation #212

My french_conjugation_transformation #212

Louanes1 commented Aug 19, 2021

AbinayaM02 commented Aug 20, 2021

Louanes1 commented Aug 20, 2021 •

edited

AbinayaM02 commented Aug 20, 2021 •

edited

Louanes1 commented Aug 20, 2021

Louanes1 commented Aug 23, 2021

AbinayaM02 commented Aug 23, 2021 •

edited

Louanes1 commented Aug 23, 2021

richplant commented Sep 10, 2021

mille-s commented Sep 21, 2021

Louanes1 commented Sep 21, 2021

mille-s commented Sep 21, 2021

Louanes1 commented Oct 3, 2021 •

edited

AbinayaM02 commented Oct 4, 2021

mille-s commented Oct 4, 2021

AbinayaM02 commented Oct 4, 2021

Louanes1 commented Oct 5, 2021 •

edited

AbinayaM02 commented Oct 6, 2021

My french_conjugation_transformation #212

My french_conjugation_transformation #212

Conversation

Louanes1 commented Aug 19, 2021

AbinayaM02 commented Aug 20, 2021

Louanes1 commented Aug 20, 2021 • edited

AbinayaM02 commented Aug 20, 2021 • edited

Louanes1 commented Aug 20, 2021

Louanes1 commented Aug 23, 2021

AbinayaM02 commented Aug 23, 2021 • edited

Louanes1 commented Aug 23, 2021

richplant commented Sep 10, 2021

mille-s commented Sep 21, 2021

Louanes1 commented Sep 21, 2021

mille-s commented Sep 21, 2021

Louanes1 commented Oct 3, 2021 • edited

AbinayaM02 commented Oct 4, 2021

mille-s commented Oct 4, 2021

AbinayaM02 commented Oct 4, 2021

Louanes1 commented Oct 5, 2021 • edited

AbinayaM02 commented Oct 6, 2021

Louanes1 commented Aug 20, 2021 •

edited

AbinayaM02 commented Aug 20, 2021 •

edited

AbinayaM02 commented Aug 23, 2021 •

edited

Louanes1 commented Oct 3, 2021 •

edited

Louanes1 commented Oct 5, 2021 •

edited