Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于修改标签句子长度的情况 #24

Open
lmw0320 opened this issue May 11, 2021 · 1 comment
Open

关于修改标签句子长度的情况 #24

lmw0320 opened this issue May 11, 2021 · 1 comment

Comments

@lmw0320
Copy link

lmw0320 commented May 11, 2021

我发现代码对原始数据的处理方法是用同音字替换,和单字随机替换法(这两种方法,都是没有改变原始句子的长度)。。好像没有实现增减字的情况,不知道这种情况下的label应该是怎样的。。
恳请指点~~

@currenttime
Copy link

同问,我的理解是相同的字补0,不相同的补1
但是如果增减字应该怎么处理?
我的数据例子如下,供大家参考,如果有问题欢迎指出交流:
origin_text,random_text,label
"同意113,871,899股,占出席会议所有股东所持表决权100%,反对0股,弃权0股。","同意113,871,899股,占出席会议所有股专所持表决权100%,反对0股,弃权0股。",0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants