Skip to content

ggerganov/ggwords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ggwords

Generate n-gram statistics by processing the contents of English books/texts.

Usage

git clone https://github.com/ggerganov/ggwords
cd ggwords
mkdir build
cd build
cmake ..
make -j4

./bin/analyze /path/to/metadata/books.txt /path/to/books/text

Sample data

The data in ./data was generated using https://github.com/pgcorpus/gutenberg

About

Generate language n-gram statistics

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published