Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hugging face中的模型问题 #14

Open
SXUleiyang opened this issue Jan 5, 2024 · 2 comments
Open

hugging face中的模型问题 #14

SXUleiyang opened this issue Jan 5, 2024 · 2 comments

Comments

@SXUleiyang
Copy link

作者您好,我从nianlong/memsum-word-embedding下载到了您预先训练好的模型,现在的问题是:如何在自己的中文数据集上训练vocabulary_200dim.pkl 和 unigram_embeddings_200dim.pkl。 希望您能回复我的消息

@nianlonggu
Copy link
Owner

你好,你可以用word2vec的训练方式在你的中文数据集上训练word embedding,也可以用一些预训练好的词向量比如https://github.com/Embedding/Chinese-Word-Vectors
或者在huggingface 上找一下有没有Chinese Bert, 用这个Bert 替代MemSum中的local sentence encoder。

@SXUleiyang
Copy link
Author

你好,你可以用word2vec的训练方式在你的中文数据集上训练word embedding,也可以用一些预训练好的词向量比如https://github.com/Embedding/Chinese-Word-Vectors 或者在huggingface 上找一下有没有Chinese Bert, 用这个Bert 替代MemSum中的local sentence encoder。

十分感谢您的回复!我后续还会继续跟进您的工作。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants