GitHub - benkrause/human-level-text-prediction

Code used for blog post Achieving Human-level text prediction.

Code is a mess, but this at least should make it possible to replicate what I did.

Requirements: python 3, chainer, cupy

Instructions for use:

Run the following unix command to recover the original model file

cat model_part.aa model_part.ab > model

download the text8 dataset from http://mattmahoney.net/dc/text8.zip, put the unzipped text8 file in the project directory
Run the main file simply with:

python main.py

Takes an hour or so to run, spits out the entropy of the 75 characters of text from the book "Jefferson the Virginian" to allow for direct comparison to human prediction from this classic paper. Default settings should give a cross-entropy of 1.31 bits/character.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
LICENSE		LICENSE
README.md		README.md
WN.py		WN.py
jefferson.txt		jefferson.txt
mLSTMWN_ch3.py		mLSTMWN_ch3.py
main.py		main.py
model_part.aa		model_part.aa
model_part.ab		model_part.ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

WN.py

WN.py

jefferson.txt

jefferson.txt

mLSTMWN_ch3.py

mLSTMWN_ch3.py

main.py

main.py

model_part.aa

model_part.aa

model_part.ab

model_part.ab

Repository files navigation

Code used for blog post Achieving Human-level text prediction.

Requirements: python 3, chainer, cupy

Instructions for use:

About

Releases

Packages

Languages

License

benkrause/human-level-text-prediction

Folders and files

Latest commit

History

Repository files navigation

Code used for blog post Achieving Human-level text prediction.

Requirements: python 3, chainer, cupy

Instructions for use:

About

Resources

License

Stars

Watchers

Forks

Languages