A mechanically generated sorted list of Finnish words and given names.
For most uses, you should probably only look at:
- words/words.txt – a list of Finnish words
- names/*.tsv – Finnish names in frequency order
- names/*.txt – Finnish names in alphabetical order
To regenerate the data, run make
on a reasonably UNIXy system.
To regenerate the names
data, you will need the openpyxl
module for Python 3.
-
The KOTUS word list is licensed under GNU LGPL / EUPL 1.1 / CC-BY 3.0.
-
The Joukahainen corpus is licensed under GNU GPL 2.0 or newer.
-
The Finnish names corpus is licensed under CC-BY 4.0.
-
The glue code is licensed MIT.