Various scripts used while playing around with Google Brain's billion word language model
Python Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lm_1b
output
README.md
dump.sh
dump_brown.py
dump_words.sh
nextwords.sh
plot_ppx_per_corpus.py
ppx_to_js.py
ppx_to_json.py
prefix.sh
sentence_ppx.sh
zip_sentence_ppx.py

README.md

Hacky scripts used while hacking around with the Google Brain billion word language model.

The lm_1b directory is copied straight from the tensorflow models repo, plus some tweaks to their python scripts.

The ineptly named output directory contains a bunch of visualization scripts. To run any of them, you'll need to run dump.sh first (and in order to do that, you'll need to download the data files and install the prerequisites listed in tensorflow/models/lm_1b.