You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have trained the bpe model with google sentencepiece spm_train tool. when I am trying to build vocabulary with onmt_build_vocab tool, the error is raised:
# zh-ru-translator.yaml## Where the samples will be writtensave_data: run/opennmt_data## Where the vocab(s) will be writtensrc_vocab: /data/translator/code/zh.vocabtgt_vocab: /data/translator/code/ru.vocab# Should match the vocab size for SentencePiecesrc_vocab_size: 30000tgt_vocab_size: 30000share_vocab: False# Corpus opts:data:
corpus_1:
path_src: /data/translator/parallel/zh_train.txtpath_tgt: /data/translator/parallel/ru_train.txtweight: 1transforms: [bpe, filtertoolong]valid:
path_src: /data/translator/parallel/zh_valid.txtpath_tgt: /data/translator/parallel/ru_valid.txttransforms: [bpe, filtertoolong]### Transform related opts:#### Subwordsrc_subword_model: /data/translator/code/zh.modeltgt_subword_model: /data/translator/code/ru.model#### Filtersrc_seq_length: 150tgt_seq_length: 150
does anybody faced this issue before?
PS: Previously I have tried the open net bpe version, it was too slow for me, it run about 2 days without any result.
The text was updated successfully, but these errors were encountered:
I have trained the bpe model with google sentencepiece
spm_train
tool. when I am trying to build vocabulary withonmt_build_vocab
tool, the error is raised:The same with the ru model and then:
This is configuration:
does anybody faced this issue before?
PS: Previously I have tried the open net bpe version, it was too slow for me, it run about 2 days without any result.
The text was updated successfully, but these errors were encountered: