Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is possible to pass muti name for one category #5

Closed
graceyangfan opened this issue Dec 5, 2020 · 2 comments
Closed

is possible to pass muti name for one category #5

graceyangfan opened this issue Dec 5, 2020 · 2 comments

Comments

@graceyangfan
Copy link

It seems that the model will first find the names in label_names.txt to mask the original text.If the category name has never or rarely been in the train txt,small train materials will be used in the first step,so does it possible to add more words that are smililar to the category name to the label_names.txt.for example instead of post {0:"sports"},just post {0:"competion","players"},asuming that sports has never occured in the train materials

@yumeng5
Copy link
Owner

yumeng5 commented Dec 6, 2020

Hi @graceyangfan,

Thanks for the question. Yes, you can use multiple words as the label name of each category. In fact, there is not a strict definition of which words should be the "label name"--you can choose to use whatever words that can represent the category well and ideally frequently appear in the corpus. As mentioned in the README file, "if multiple words are used as the label name of a category, put them in the same line and separate them with whitespace characters". As an example, you could refer to DBPedia dataset where some categories have multiple words as the label name.

Let me know if you have any further questions!

Thanks,
Yu

@graceyangfan
Copy link
Author

graceyangfan commented Dec 6, 2020

@yumeng5 thanks for your replay, it really help me to understand the ideas in your paper

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants