In which Training step do you use HH-RLHF and SHP datasets? Thanks for your help.
In which Training step do you use HH-RLHF and SHP datasets?
Thanks for your help.