Skip to content
Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time


Is an online tool to browse semantic data for Finnish. Try SemFi online now!

The Database

The SemFi data has been extracted from The Finnish Internet Parsebank. This data has been applied in tools such as Poem Machine that is used to create Finnish poetry automatically.

⬇️ Dowload DOI to your own computer. It is released under CC BY-SA 4.0. © 2015-2017 Mika Hämäläinen

The Contents of the Database

The database contains a noun and a verb table which contain syntactically related words with frequencies. There's also a frequencies table that contains the frequencies of all word forms in the corpus.

And finally a verse structure table that contains the syntactic structures of Finnish poem verses. The around 5000 poems analyzed are the ones released in wiki sources.

✉️ In case of questions, contact me.


In case you use the data in a scientific project, please consider citing it as follows:

Hämäläinen, Mika (2018). Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15).

Need for NLP solutions for your business?

Rootroo logo

My company, Rootroo offers consulting related to multilingual NLP tasks. We have a strong academic background in the state-of-the-art AI solutions for every NLP need. Just contact us, we won't bite.


Semantic relations for Finnish words




No releases published


No packages published