Skip to content

Distributing wordfreq-rs models (v1)

Compare
Choose a tag to compare
@kampersanda kampersanda released this 04 Jun 11:14
· 13 commits to main since this release
daebd0d

Overview

This is a release point to distribute wordfreq-rs models via Assets. (Please note that the code at this tag is still under construction.)

The model files {large,small}_xx.txt (where xx is language code) describe words and their frequencies in the text format:

<word1> <freq1>
<word2> <freq2>
<word3> <freq3>
...

Credits

Copyright 2022 Robyn Speer
Copyright 2023 Shunsuke Kanda

They are obtained by extracting the contents from the original model files {large,small}_xx.msgpack.gz distributed at wordfreq v3.0.2 (https://doi.org/10.5281/zenodo.7199437). Our files are compressed in zstandard.

The model files are licensed under CC BY-SA 4.0. Also, the original sources are listed, following the NOTICE:

If you redistribute the models, please specify these credits as well.