Skip to content

BERT Tokenizer & Word2Vec Assisted Markov n-gram Language Model using Node.js

Notifications You must be signed in to change notification settings

raikutro/kusari

Repository files navigation

Kusari Logo

A Trigram Markov Language Model Assisted by a BERT WordPiece Tokenizer by 🤗 and Word2Vec by Google.


Start by running npm run train. Look at the comments inside train_models.js for details on training.

Run npm run test to test your model. Look inside test/sandbox.js for details on running.

Its output is underwhelming, to say the least.

About

BERT Tokenizer & Word2Vec Assisted Markov n-gram Language Model using Node.js

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published