When the training model is successfully restarted, the start-up service fails to find the model. #63

Spring12111 · 2019-07-09T08:53:26Z

The custom training model was successful, and an error occurred when running the python bin/cakechat_server.py command to start the service.

The operating system is Centos7.

nicolas-ivanov · 2019-07-09T09:06:56Z

@Spring1212, the size of vocabulary that you are trying to use is only 26 tokens - not good. Apparently you rewrote the default vocabulary file by running prepare_index_files.py. See the corresponding section of our Readme for a fix guideline: https://github.com/lukalabs/cakechat#training-the-model-from-scratch

Spring12111 · 2019-07-09T09:15:34Z

Is this due to too little training data?

Spring12111 · 2019-07-09T09:16:14Z

Can adding training data avoid this problem?

nicolas-ivanov · 2019-07-09T09:32:29Z

Is this due to too little training data?

You got a different index files because apparently you run prepare_index_files.py and it rewrote the original index files. Even if you had training data in abundance, running this script would still result in overwriting index files, so won't be able to use our pretrained model.

If you just want to use our pretrained model without finetuning on you own data, you don't need to run prepare_index_files.py script. Restore the default index files following the link I send you in the previous message and run the server with our pretrained model after that.

And in case you do want to tune our model or to train your own from scratch, please, read carefully the corresponding sections from our Readme: https://github.com/lukalabs/cakechat#training-the-model

Spring12111 · 2019-07-09T10:18:04Z

According to the self-report file training model from scratch, every step of adding training data into the file is operated.

Spring12111 · 2019-07-09T10:19:46Z

Is this still a pre-training model?

nicolas-ivanov · 2019-07-10T08:07:16Z

Seems like I got the problem: the server is trying to use a reverse model for sampling_reranking mode of generating answers and, I assume, you didn't train it, since it's not mentioned in this section of our Readme.

Option 1: change sampling_reranking to sampling here: https://github.com/lukalabs/cakechat/blob/master/cakechat/api/config.py#L4

Option 2: run python tools/train.py -r to train the reverse model.

In either case, please, note that you'll need to use a larger training set with larger vocabulary to get any meaningful results after training.

stale · 2019-07-17T08:19:55Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale bot added the staled label Jul 17, 2019

stale bot closed this as completed Jul 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When the training model is successfully restarted, the start-up service fails to find the model. #63

When the training model is successfully restarted, the start-up service fails to find the model. #63

Spring12111 commented Jul 9, 2019 •

edited

Loading

nicolas-ivanov commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

nicolas-ivanov commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

nicolas-ivanov commented Jul 10, 2019 •

edited

Loading

stale bot commented Jul 17, 2019

When the training model is successfully restarted, the start-up service fails to find the model. #63

When the training model is successfully restarted, the start-up service fails to find the model. #63

Comments

Spring12111 commented Jul 9, 2019 • edited Loading

nicolas-ivanov commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

nicolas-ivanov commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

Spring12111 commented Jul 9, 2019

nicolas-ivanov commented Jul 10, 2019 • edited Loading

stale bot commented Jul 17, 2019

Spring12111 commented Jul 9, 2019 •

edited

Loading

nicolas-ivanov commented Jul 10, 2019 •

edited

Loading