-
Notifications
You must be signed in to change notification settings - Fork 759
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use Multilingual pretrain model Bert #40
Comments
you can have a try. |
i want to use model with vietnamese language. The important of change is share parameters, i know. how i can train with my language. Thanks for support =) |
1、you can change vocab.txt in ./albert_config, then set non_chinese to True when create pretrain data using create_pretraining_data.py |
okay. Thanks for support. Best repo =) |
I have tried to pretrain with my dataset, but i see the loss is very small but accuracy is not improve. How i can improve result |
@brightmart Can we have a multilingual model for just Chinese and English? Cause in practical scenerios we may meet many english words in APP names, music names, Apple's all products's name and so on, and Google's multilingual model has too many languages. Our daliy life cannot leave English, you can see that Apple try to use purely Chinese in its products, such as replace Finder with 访达 which I think is totally a mess. |
please tell me, can i use Multilingual pretrain model from Bert to train custom data with albert code ???
The text was updated successfully, but these errors were encountered: