After running the code, I’ve got a few questions:
How do I control the number of QA pairs it generates?
No matter how much content I put in the jsonl file, it always ends up making just 5 QA pairs.
If the knowledge content changes next time, do I need to delete the cache and run it again?
Because it seems like it’s still using old data from graph.graphml.
Are there any key parameters I should pay attention to or adjust each time I run it?
My API has the logprobs feature, but I'm still seeing all the loss values are the same. Do I need to manually enable hard case mining?
Thanks!!orz
After running the code, I’ve got a few questions:
How do I control the number of QA pairs it generates?
No matter how much content I put in the jsonl file, it always ends up making just 5 QA pairs.
If the knowledge content changes next time, do I need to delete the cache and run it again?
Because it seems like it’s still using old data from graph.graphml.
Are there any key parameters I should pay attention to or adjust each time I run it?
My API has the logprobs feature, but I'm still seeing all the loss values are the same. Do I need to manually enable hard case mining?
Thanks!!orz