Curious about the experimental stability？ #18

vhzy · 2024-05-27T03:02:02Z

I modified the model and evaluation codes to train and test on the NEXT-QA dataset. However, there is about a 1% point fluctuation in the results during each training run. Are the experimental results stable during the training process? I suspect this variability might stem from certain parameter settings of the large language model (LLM). Do you have any empirical suggestions to address this issue? Thank you!

boheumd · 2024-05-29T14:29:41Z

Hello, for the 1% point fluctuation of the results, are those for the testing dataset or training dataset?

vhzy · 2024-05-30T01:10:03Z

Sorry, I re-ran your code and the result is stable. There were some bugs before. Thanks for your reply! Bo He ***@***.***> 于2024年5月29日周三 22:30写道：

…

Hello, for the 1% point fluctuation of the results, are those for the testing dataset or training dataset? — Reply to this email directly, view it on GitHub <#18 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AMR5ZOKGNY3AV55W6KY6VCTZEXQ6XAVCNFSM6AAAAABIKLSHPGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMZXGU3DANJYGY> . You are receiving this because you authored the thread.Message ID: ***@***.***>

boheumd closed this as completed Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Curious about the experimental stability？ #18

Curious about the experimental stability？ #18

vhzy commented May 27, 2024

boheumd commented May 29, 2024

vhzy commented May 30, 2024 via email

Curious about the experimental stability？ #18

Curious about the experimental stability？ #18

Comments

vhzy commented May 27, 2024

boheumd commented May 29, 2024

vhzy commented May 30, 2024 via email