Are the training loss and validation loss recorded? #28

shatealaboxiaowang · 2024-01-24T10:15:14Z

Hi, Dear:

Thank you very much for your code. I am reproducing your training process. I wonder what your training loss and validation loss are during the training process, and I want to align them with your training process on dataset Magicoder-OSS-Instruct-75K and datasets--ise-uiuc--Magicoder-Evol-Instruct-110K

Thx

shatealaboxiaowang · 2024-01-25T02:36:19Z

Thank you for your open source, I am replicating your fine-tuning process according to the code on github. Do the results of train loss=0.16 and eval_loss=0.21 I trained on the 75k dataset match yours? I will continue training on the 110k dataset.
I trained for 4 epochs and indeed started overfitting after the second epoch.

UniverseFly · 2024-01-25T06:53:57Z

Magicoder-S-CL.json
Magicoder-CL.json
Magicoder-S-DS.json
Magicoder-DS.json

Hi, here are the trainer states. Hope they can help!

shatealaboxiaowang · 2024-01-29T03:01:26Z

Magicoder-S-CL.json Magicoder-CL.json Magicoder-S-DS.json Magicoder-DS.json

Hi, here are the trainer states. Hope they can help!

Thank you very much, my training process is basically the same as your loss metric and the test results on humaneval dataset are basically consistent

But I have a question. Why is the model fully fine-tuned by instruction, but why is the capacity infilinig increased?
I look forward to hearing from you.

UniverseFly · 2024-01-29T04:27:27Z

Good to hear you can reproduce it. Yeah we did observe that the infilling capability was at least not decreasing. We believe this is because the model learned some general alignment during the instruction tuning, and infilling is a kind of alignment based on the surrounding context. Further study of this phenomenon would be interesting.

UniverseFly closed this as completed Feb 15, 2024

UniverseFly mentioned this issue May 3, 2024

Achieved close performance of MagicoderS by finetuning only with evol-codealpaca-v1. #31

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are the training loss and validation loss recorded? #28

Are the training loss and validation loss recorded? #28

shatealaboxiaowang commented Jan 24, 2024

shatealaboxiaowang commented Jan 25, 2024

UniverseFly commented Jan 25, 2024

shatealaboxiaowang commented Jan 29, 2024 •

edited

Loading

UniverseFly commented Jan 29, 2024

Are the training loss and validation loss recorded? #28

Are the training loss and validation loss recorded? #28

Comments

shatealaboxiaowang commented Jan 24, 2024

shatealaboxiaowang commented Jan 25, 2024

UniverseFly commented Jan 25, 2024

shatealaboxiaowang commented Jan 29, 2024 • edited Loading

UniverseFly commented Jan 29, 2024

shatealaboxiaowang commented Jan 29, 2024 •

edited

Loading