-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retraining on floydhub.com is not available #20
Comments
@cnlinxi sorry for your inconvenience. I thought using floydhub would be sustainable but it seems very costly in a long run. So, I've decided to cancel my subscription, hence losing the datasets there. I'll get back to you regarding the corpus. Would you mind sharing a bit on what you plan to do with the code? |
@heytitle Sorry for reply too late. I hope to use this model to segment Thai words, and hope to improve it. I hope to provide a good Thai text regularization method. |
@cnlinxi sorry again for my response. You can find the data at Please unzip and make sure the root directory is at Only the first two are relevant for training; Before running the training command below, make sure that you have the
|
@heytitle thank you very much. I have trained this model on BEST 2010. Great work:) |
Is word split by "~" in "best-syllable-tokenized" dataset? |
Can you provide corpus? We cannot retraining this model on floydhub.com in readme. Thanks a lot.
The text was updated successfully, but these errors were encountered: