Thank you for this excellent project! I have two questions regarding the SearchQA environment configuration:
-
SearchQA Data Split
I noticed in SkillOpt/configs/searchqa/default.yaml: train_size: 400
Could you clarify how the SearchQA dataset is split for training/validation/testing? Specifically:
Training set: 400 samples (randomly sampled?)
Validation set: 200 samples?
Test set: 1400 samples?
Is this split fixed or randomly sampled each run?
-
max_completion_tokens=512 Truncation Concern
In SkillOpt/skillopt/envs/searchqa/rollout.py, I observed: max_completion_tokens=512.
Does this token limit frequently cause output truncation during agent rollout? Have you observed significant truncation issues in practice?
Thank you for your time and clarification!
Thank you for this excellent project! I have two questions regarding the SearchQA environment configuration:
SearchQA Data Split
I noticed in SkillOpt/configs/searchqa/default.yaml: train_size: 400
Could you clarify how the SearchQA dataset is split for training/validation/testing? Specifically:
Training set: 400 samples (randomly sampled?)
Validation set: 200 samples?
Test set: 1400 samples?
Is this split fixed or randomly sampled each run?
max_completion_tokens=512 Truncation Concern
In SkillOpt/skillopt/envs/searchqa/rollout.py, I observed: max_completion_tokens=512.
Does this token limit frequently cause output truncation during agent rollout? Have you observed significant truncation issues in practice?
Thank you for your time and clarification!