Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如果要使用英文数据集需要更改什么 #25

Open
kuroko730 opened this issue Jan 31, 2021 · 2 comments
Open

如果要使用英文数据集需要更改什么 #25

kuroko730 opened this issue Jan 31, 2021 · 2 comments

Comments

@kuroko730
Copy link

感谢作者带来如此优秀的项目!
我现在想用您的项目中的三元组关系抽取来处理一些英文的数据集。
在自己看来,除了要使用Bert的英文预训练模型之外,请问是否还需要在tokenizer.py 上做修改呢?

@920232796
Copy link
Owner

我觉得tokenizer不需要修改,不过我也没尝试过英文的,不确定,你可以试一下~我觉得是没问题的。

@kuroko730
Copy link
Author

谢谢回复!
我先不改tokenizer,把中文词表换成英文词表试试。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants