-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tweet-lm.gz #1
Comments
The language models depend on the type of data that you'll be working with, and should be trained using SRI-LM. It is really easy to train a new language model using SRI-LM, and also very fast. If you want to try out the tool, you can use these two language models that I trained on LA Times English and a sample of tweets that I was working with: i) https://dl.dropboxusercontent.com/u/2424861/latimes-lm.gz |
I would like to try out your system, so do you mind if i ask you to share your two language models again. A heap of thanks |
I uploaded these two files. latimes-lm.gz is split into two files to come in below the 50Mb limit. Simply join them using cat latimes-lm.gz.part* > latimes-lm.gz. |
thanks |
I cannot find the tweet-lm file in the repo, could you add it? or explain how it can be generated? thanks
The text was updated successfully, but these errors were encountered: