-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Give h2oGPT proper personality #73
Comments
Line 1252 in 97b9f2d
(env) arno@rippa:/nfs4/llm/h2ogpt(main)$ NCCL_P2P_LEVEL=LOC WORLD_SIZE=2 CUDA_VISIBLE_DEVICES="0,1" torchrun --nproc_per_node=2 finetune.py --base_model=h2oai/h2ogpt-oig-oasst1-512-6.9b --data_path=h2oai/openassistant_oasst1_h2ogpt_only --micro_batch_size=2 --batch_size=32 --cutoff_len=2048 --run_id=100 --num_epochs=1 &> run.100.text
https://slack-files.com/T0329MHH6-F054U3491DF-a909f3bb3b lora weights and logs |
Comparing validation score on only 1k ShareGPT before/after personality refresh, so just "hammering" the personality updates, and nothing else. before: after: |
give h2oai/h2ogpt-oasst1-512-6.9b personalitySo obviously hurts too much to only hammer in the personality and nothing else, need to keep variety of fine-tuning data, so easiest for smaller models is to train a couple epochs on the full personalized dataset instead For larger models, can try to do some mixin mode, with small personality data + large mixin, but not all 80k rows. Note: didn't do that, just added 2 epochs on top of existing
https://slack-files.com/T0329MHH6-F054GHUJJFM-eebaba5e98 lora weights and logs https://huggingface.co/h2oai/h2ogpt-oig-oasst1-512-6.9b model card created by fcaae7e |
|
give h2oai/h2ogpt-oasst1-512-12b personalitycontinue #22 (comment)
|
Comparing validation score on 1k ShareGPT before/after personality refresh Before: After: |
give h2oai/h2ogpt-oasst1-512-20b personality
https://slack-files.com/T0329MHH6-F054HBLG2LB-7a39e222f8 lora weights and logs
|
Fix https://github.com/h2oai/h2ogpt/blob/main/FAQ.md#why-does-the-h2ogpt-say-it-was-trained-by-openai-or-open-assistant
The text was updated successfully, but these errors were encountered: