GitHub - colinmorris/lm1b-notebook: Various scripts used while playing around with Google Brain's billion word language model

Hacky scripts used while hacking around with the Google Brain billion word language model.

The lm_1b directory is copied straight from the tensorflow models repo, plus some tweaks to their python scripts.

The ineptly named output directory contains a bunch of visualization scripts. To run any of them, you'll need to run dump.sh first (and in order to do that, you'll need to download the data files and install the prerequisites listed in tensorflow/models/lm_1b.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
lm_1b		lm_1b
output		output
README.md		README.md
dump.sh		dump.sh
dump_brown.py		dump_brown.py
dump_words.sh		dump_words.sh
nextwords.sh		nextwords.sh
plot_ppx_per_corpus.py		plot_ppx_per_corpus.py
ppx_to_js.py		ppx_to_js.py
ppx_to_json.py		ppx_to_json.py
prefix.sh		prefix.sh
sentence_ppx.sh		sentence_ppx.sh
zip_sentence_ppx.py		zip_sentence_ppx.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

colinmorris/lm1b-notebook

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages