Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

训练相关问题 #43

Closed
CHAOJICHENG5 opened this issue Mar 26, 2024 · 1 comment
Closed

训练相关问题 #43

CHAOJICHENG5 opened this issue Mar 26, 2024 · 1 comment

Comments

@CHAOJICHENG5
Copy link

你好我想请问一下,训练集中需要添加负样本吗,我看论文中Table2统计信息的CodeX-S-Train数据集中大小为32888,而我实际打开CoDeX-S-train.json数据集发现不仅包含正样本32,888,还包含负样本131,522。我想请问在训练的时候数据集中需要添加负样本吗?

@Zhang-Each
Copy link
Collaborator

是的,因为要让模型作出正负样本的判断,所以不能只有负样本,正样本可以通过随机替换头尾实体生成,这和传统的知识图谱表示模型训练是类似的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants