use Multilingual pretrain model Bert #40

kewin1807 · 2019-10-27T18:09:47Z

please tell me, can i use Multilingual pretrain model from Bert to train custom data with albert code ???

brightmart · 2019-10-27T23:44:42Z

you can have a try.
and be aware that there are some differences between bert and albert in modelling.py
why do you want to train multillingual model?

kewin1807 · 2019-10-28T03:13:21Z

i want to use model with vietnamese language. The important of change is share parameters, i know. how i can train with my language. Thanks for support =)

brightmart · 2019-10-28T05:19:26Z

1、you can change vocab.txt in ./albert_config, then set non_chinese to True when create pretrain data using create_pretraining_data.py
2、then do pre train using run_pretraining.py

kewin1807 · 2019-10-28T09:45:32Z

okay. Thanks for support. Best repo =)

kewin1807 · 2019-10-31T11:47:01Z

I have tried to pretrain with my dataset, but i see the loss is very small but accuracy is not improve. How i can improve result

geekboood · 2019-11-24T05:47:39Z

@brightmart Can we have a multilingual model for just Chinese and English? Cause in practical scenerios we may meet many english words in APP names, music names, Apple's all products's name and so on, and Google's multilingual model has too many languages. Our daliy life cannot leave English, you can see that Apple try to use purely Chinese in its products, such as replace Finder with 访达 which I think is totally a mess.
Maybe a language model for just Chinese and English can have huge impact on both research and industry and many multilingual tasks can benefit from it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use Multilingual pretrain model Bert #40

use Multilingual pretrain model Bert #40

kewin1807 commented Oct 27, 2019

brightmart commented Oct 27, 2019

kewin1807 commented Oct 28, 2019

brightmart commented Oct 28, 2019

kewin1807 commented Oct 28, 2019

kewin1807 commented Oct 31, 2019

geekboood commented Nov 24, 2019 •

edited

use Multilingual pretrain model Bert #40

use Multilingual pretrain model Bert #40

Comments

kewin1807 commented Oct 27, 2019

brightmart commented Oct 27, 2019

kewin1807 commented Oct 28, 2019

brightmart commented Oct 28, 2019

kewin1807 commented Oct 28, 2019

kewin1807 commented Oct 31, 2019

geekboood commented Nov 24, 2019 • edited

geekboood commented Nov 24, 2019 •

edited