Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请教一下,为什么数据量少的数据集比如采样率设置为0.1 #47

Open
Baboom-l opened this issue May 18, 2024 · 2 comments

Comments

@Baboom-l
Copy link

RefCOCO这些小数据集设置为0.1,而o365这种大型的采样率为1,这样不会导致数据量小的数据上训练不充分吗?还是说这个采样率不是根据数据集大小来的

@shenyunhang
Copy link
Owner

主要是不希望在小数据集上过拟合,希望模型训练过的样本越多越好,所以小数据集的采样率要低。
其实虽然小数据集采样率低,但是迭代步数长,模型还是看了这些数据集好几遍。

@Baboom-l
Copy link
Author

这个采样率是每次迭代的采样概率吗?我以为是对整个数据集的采样

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants