-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
training performance issue #7
Comments
It looks like you have a strange result. Generally speaking, the METEOR metric should have the lowest score and the BLEU and Rouge-L scores should be relatively high. I double-checked and you got a METEOR that was already similar to the one in my paper. |
It was indeed strange for the METEOR. |
I would like to confirm if your dataset is consistent with the one used in my paper and the way it was preprocessed? The parameters I used are the ones in train.py |
And make sure you use the nlg-eval to evaluation. |
I used RQ1 deepCom dataset from their git. |
code_seq, sbt = utils.transformer(code) |
Do you mean I need to that before executing train_tokenizer.py and train.py. |
yes, before training BPE tokenize, you need to use this method to preprocess the corpus, just like i said in my paper.
…---Original---
From: ***@***.***>
Date: Wed, May 4, 2022 20:10 PM
To: ***@***.***>;
Cc: ***@***.******@***.***>;
Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7)
Do you mean I need to that before executing train_tokenizer.py and train.py.
In use;py for generating the comment I see where this preprocessing is done however, for the training, I don't know where this kind of preprocess is done
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
hi, how you deal with the syntax error in the original dataset?
by checking the data, I found it is caused by the syntax error of the code in dataset thanks 😄 |
pass the error data
…---Original---
From: ***@***.***>
Date: Thu, May 5, 2022 22:39 PM
To: ***@***.***>;
Cc: ***@***.******@***.***>;
Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7)
hi, how you deal with the syntax error in the original dataset?
I use code_seq, sbt = utils.transformer(code) to preprocess Hybrid-DeepCom RQ1, but met:
Traceback (most recent call last): File "preprocess.py", line 19, in <module> code_seq, sbt = transformer(row[0]) File "/root/ComFormer/utils.py", line 180, in transformer ast = get_ast(processed_code) File "/root/ComFormer/utils.py", line 41, in get_ast for path, node in tree: UnboundLocalError: local variable 'tree' referenced before assignment
by checking the data, I found it is caused by the syntax error of the code in dataset
so did you make it empty or just fix it in dataset?
thanks 😄
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
you can use try and expect to pass these errors
…---Original---
From: ***@***.***>
Date: Thu, May 5, 2022 22:39 PM
To: ***@***.***>;
Cc: ***@***.******@***.***>;
Subject: Re: [NTDXYG/ComFormer] training performance issue (Issue #7)
hi, how you deal with the syntax error in the original dataset?
I use code_seq, sbt = utils.transformer(code) to preprocess Hybrid-DeepCom RQ1, but met:
Traceback (most recent call last): File "preprocess.py", line 19, in <module> code_seq, sbt = transformer(row[0]) File "/root/ComFormer/utils.py", line 180, in transformer ast = get_ast(processed_code) File "/root/ComFormer/utils.py", line 41, in get_ast for path, node in tree: UnboundLocalError: local variable 'tree' referenced before assignment
by checking the data, I found it is caused by the syntax error of the code in dataset
so did you make it empty or just fix it in dataset?
thanks 😄
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
it helps , thanks a lot ! ✨✨✨ |
Thanks, I forgot to do so. I'm still in the first epoch and the loss is increasing a lot. I removed the early stopping because the model was stopping at 4 steps. Do you have an idea about what's wrong? |
I don't know why now, you can start by fine-tuning my pre-trained model on the target dataset. |
Maybe I know what your problem is. |
The training is done. The results I have uploaded are true.csv and predict.csv. |
Hello.
|
you can run BLEU with ''predict.csv'' and ''test.token.nl''(it can be download in the current DeepCom dataset) |
Hello :) |
Please give me the code you used to compute BLEU score |
Oh Sorry @NTDXYG |
Okay, that's fine. If you still have questions and difficulties with my code, please do let me know. This helps me to improve the code of ComFormer. By the way, if you want to communicate with me in a timely manner, you can send an email to 744621980@qq.com or wechat directly to me at 744621980. |
Thank you 😄 |
Hi,
Thanks you very much for your work.
I'm using your model and I'm having trouble getting a pretrained model from scratch as good as yours.
Here are the best results I got :
Could you tell what set of parameters you used?
Thanks,
The text was updated successfully, but these errors were encountered: