training performance issue #7

mlant · 2022-05-04T09:08:10Z

Hi,
Thanks you very much for your work.
I'm using your model and I'm having trouble getting a pretrained model from scratch as good as yours.
Here are the best results I got :

BLEU : 0.20907722516675964
ROUGE-L : 0.19650848830796055
METEOR : 0.3384904110878787
Could you tell what set of parameters you used?
Thanks,

NTDXYG · 2022-05-04T09:26:24Z

It looks like you have a strange result. Generally speaking, the METEOR metric should have the lowest score and the BLEU and Rouge-L scores should be relatively high. I double-checked and you got a METEOR that was already similar to the one in my paper.

mlant · 2022-05-04T09:38:22Z

It was indeed strange for the METEOR.
Look for example in another training, I've got those results :
BLEU : 0.16899427354084465
ROUGE-L : 0.15780111135381264
METEOR : 0.10913556588489155
I had the early stopping = 6, lr=5e-4 and it stopped at 17th step because of the early stopping
I don't know why the model stop improving so early

NTDXYG · 2022-05-04T09:47:38Z

I would like to confirm if your dataset is consistent with the one used in my paper and the way it was preprocessed? The parameters I used are the ones in train.py

NTDXYG · 2022-05-04T09:48:46Z

And make sure you use the nlg-eval to evaluation.

mlant · 2022-05-04T09:51:54Z

I used RQ1 deepCom dataset from their git.
I also used nlg-eval for evaluation.
What kind of preprocessing did you make?

NTDXYG · 2022-05-04T09:55:36Z

code_seq, sbt = utils.transformer(code)
input_text = ' '.join(code_seq.split()[:256]) + ' '.join(sbt.split()[:256])
and "max_length" in train.py is 512

mlant · 2022-05-04T12:10:31Z

Do you mean I need to that before executing train_tokenizer.py and train.py.
In use;py for generating the comment I see where this preprocessing is done however, for the training, I don't know where this kind of preprocess is done

NTDXYG · 2022-05-04T12:37:05Z

yes, before training BPE tokenize, you need to use this method to preprocess the corpus, just like i said in my paper.

…

---Original--- From: ***@***.***> Date: Wed, May 4, 2022 20:10 PM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7) Do you mean I need to that before executing train_tokenizer.py and train.py. In use;py for generating the comment I see where this preprocessing is done however, for the training, I don't know where this kind of preprocess is done — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

youngstudent2 · 2022-05-05T14:39:37Z

hi, how you deal with the syntax error in the original dataset?
I use code_seq, sbt = utils.transformer(code) to preprocess Hybrid-DeepCom RQ1, but met:

Traceback (most recent call last):
  File "preprocess.py", line 19, in <module>
    code_seq, sbt = transformer(row[0])
  File "/root/ComFormer/utils.py", line 180, in transformer
    ast = get_ast(processed_code)
  File "/root/ComFormer/utils.py", line 41, in get_ast
    for path, node in tree:
UnboundLocalError: local variable 'tree' referenced before assignment

by checking the data, I found it is caused by the syntax error of the code in dataset
so did you make it empty or just fix it in dataset?

thanks 😄

NTDXYG · 2022-05-05T14:45:21Z

pass the error data 

…

---Original--- From: ***@***.***> Date: Thu, May 5, 2022 22:39 PM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7) hi, how you deal with the syntax error in the original dataset? I use code_seq, sbt = utils.transformer(code) to preprocess Hybrid-DeepCom RQ1, but met: Traceback (most recent call last): File "preprocess.py", line 19, in <module> code_seq, sbt = transformer(row[0]) File "/root/ComFormer/utils.py", line 180, in transformer ast = get_ast(processed_code) File "/root/ComFormer/utils.py", line 41, in get_ast for path, node in tree: UnboundLocalError: local variable 'tree' referenced before assignment by checking the data, I found it is caused by the syntax error of the code in dataset so did you make it empty or just fix it in dataset? thanks 😄 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

NTDXYG · 2022-05-05T14:46:08Z

you can use try and expect to pass these errors

…

---Original--- From: ***@***.***> Date: Thu, May 5, 2022 22:39 PM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7) hi, how you deal with the syntax error in the original dataset? I use code_seq, sbt = utils.transformer(code) to preprocess Hybrid-DeepCom RQ1, but met: Traceback (most recent call last): File "preprocess.py", line 19, in <module> code_seq, sbt = transformer(row[0]) File "/root/ComFormer/utils.py", line 180, in transformer ast = get_ast(processed_code) File "/root/ComFormer/utils.py", line 41, in get_ast for path, node in tree: UnboundLocalError: local variable 'tree' referenced before assignment by checking the data, I found it is caused by the syntax error of the code in dataset so did you make it empty or just fix it in dataset? thanks 😄 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

youngstudent2 · 2022-05-05T18:25:26Z

it helps , thanks a lot ! ✨✨✨

mlant · 2022-05-06T09:06:08Z

yes, before training BPE tokenize, you need to use this method to preprocess the corpus, just like i said in my paper.

Thanks, I forgot to do so.
I run another training with the preprocessed data and the result are worse than before, I've got :

training_progress_scores.csv

I'm still in the first epoch and the loss is increasing a lot. I removed the early stopping because the model was stopping at 4 steps.

Do you have an idea about what's wrong?

NTDXYG · 2022-05-06T09:20:13Z

I don't know why now, you can start by fine-tuning my pre-trained model on the target dataset.
I will retrain it from scratch once in the next few weeks to see.
It may also have something to do with the dataset, the DeepCom dataset seems to have changed before.

NTDXYG · 2022-05-06T13:25:38Z

Maybe I know what your problem is.
Two of the parameters in 'train.py' were wrong and have been corrected.
In the meantime, I downloaded the latest version of DeepCom dataset and I will post the related preprocessing code later.
And I am also working on training from scratch and continuing training on the 'NTUYG/ComFormer' model in parallel.
Updates to the model will follow.

NTDXYG · 2022-05-09T04:34:44Z

The training is done. The results I have uploaded are true.csv and predict.csv.
The new version of the model is in the process of uploading. The results are as follows.
Bleu_1: 0.564457
Bleu_2: 0.521086
Bleu_3: 0.488375
Bleu_4: 0.461608
METEOR: 0.411969
ROUGE_L: 0.595989
CIDEr: 4.095886

aaajeong · 2022-05-12T06:35:59Z

Hello.
Could you tell me BLUE score?

nltk.translate.bleu

NTDXYG · 2022-05-12T07:00:15Z

you can run BLEU with ''predict.csv'' and ''test.token.nl''(it can be download in the current DeepCom dataset)

aaajeong · 2022-05-14T12:11:24Z

Hello :)
I run the test code as you told me.
I wanted to know the blue score, not the blue score 1, 2, 3, and 4, so I checked the test.token.nl & predict.csv and I got about 20 BLUE score.
Considering the blue 1,2,3,4 you wrote, I expected the blue score to be around 50.
I want to compare the blue score with the one you ran.
Could you tell me?
Thank you

NTDXYG · 2022-05-14T12:25:50Z

Please give me the code you used to compute BLEU score

aaajeong · 2022-05-14T12:40:30Z

Oh Sorry @NTDXYG
The code are like this.

There was a problem with the code.
After the question, I ran it again and found that the BLEU score is around 56.
Thank you for your quick answer.

NTDXYG · 2022-05-14T12:47:05Z

Okay, that's fine. If you still have questions and difficulties with my code, please do let me know. This helps me to improve the code of ComFormer.

By the way, if you want to communicate with me in a timely manner, you can send an email to 744621980@qq.com or wechat directly to me at 744621980.

aaajeong · 2022-05-14T12:54:38Z

Thank you 😄

youngstudent2 mentioned this issue May 6, 2022

question about evaluating model #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training performance issue #7

training performance issue #7

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022 via email

youngstudent2 commented May 5, 2022

NTDXYG commented May 5, 2022 via email

NTDXYG commented May 5, 2022 via email

youngstudent2 commented May 5, 2022

mlant commented May 6, 2022

NTDXYG commented May 6, 2022

NTDXYG commented May 6, 2022

NTDXYG commented May 9, 2022

aaajeong commented May 12, 2022

NTDXYG commented May 12, 2022

aaajeong commented May 14, 2022

NTDXYG commented May 14, 2022

aaajeong commented May 14, 2022

NTDXYG commented May 14, 2022

aaajeong commented May 14, 2022

training performance issue #7

training performance issue #7

Comments

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022

mlant commented May 4, 2022

NTDXYG commented May 4, 2022 via email

youngstudent2 commented May 5, 2022

NTDXYG commented May 5, 2022 via email

NTDXYG commented May 5, 2022 via email

youngstudent2 commented May 5, 2022

mlant commented May 6, 2022

NTDXYG commented May 6, 2022

NTDXYG commented May 6, 2022

NTDXYG commented May 9, 2022

aaajeong commented May 12, 2022

NTDXYG commented May 12, 2022

aaajeong commented May 14, 2022

NTDXYG commented May 14, 2022

aaajeong commented May 14, 2022

NTDXYG commented May 14, 2022

aaajeong commented May 14, 2022