# Wordnet

Estonian WordNet API provides means to query Estonian WordNet. WordNet is a network of synsets, in which synsets are collections of synonymous words and are connected to other synsets via relations.

First, let's import the module and create a WordNet object:

In [1]:
from estnltk.wordnet import Wordnet

In [2]:
wn = Wordnet()

## Synsets

The most common use for the API is to query synsets. Synsets can be queried in several ways. The first way is to use the name of the synset, which can be done like this:

In [3]:
wn['laulma']

["Synset('laulma.v.1')", "Synset('laulma.v.2')"]

Synsets can also be queried by specifing pos in addition to its name.

In [4]:
wn['laulma', 'v']

["Synset('laulma.v.1')", "Synset('laulma.v.2')"]

In [5]:
wn[('laulma', 'v')]

["Synset('laulma.v.1')", "Synset('laulma.v.2')"]

The previous options return a list of synsets. However, it is also possible to query for a synset by its position in the list. For example, if you only want the second synset with the name 'laulma', you can specify it like this (this option will return a synset object):

In [6]:
wn['laulma', 2]

"Synset('laulma.v.2')"

It's also possible to retrieve a synset's details, like name and pos:

In [8]:
synset = wn['laulma'][0]
print(synset.name)
print(synset.pos)

laulma.v.1
v


## Relations

We can also query related synsets. There are relations, for which there are specific methods:

In [9]:
synset.hypernyms()

["Synset('häälitsema.v.1')"]

In [10]:
synset.hyponyms()

["Synset('ümisema.v.2')",
 "Synset('üles_laulma.v.1')",
 "Synset('helletama.v.1')",
 "Synset('joiguma.v.1')",
 "Synset('kaasitama.v.1')",
 "Synset('kõõrutama.v.2')",
 "Synset('leelotama.v.1')",
 "Synset('joodeldama.v.1')",
 "Synset('trallitama.v.2')"]

In [11]:
synset.holonyms()

[]

In [12]:
synset.meronyms()

[]

In [13]:
synset.member_holonyms()

[]

More specific relations can be queried with a universal method:

In [14]:
synset.get_related_synset("involved_agent")

["Synset('laulja.n.1')"]

## Similarities

We can measure distance or similarity between two synsets in several ways. For calculating similarity, we provide path, Leacock-Chodorow and Wu-Palmer similarities:

In [17]:
synset = wn['aprill'][0]
target_synset = wn['mai'][0]

In [18]:
synset.path_similarity(target_synset)

0.3333333333333333

In [19]:
synset.lch_similarity(target_synset)

2.159484249353372

In [20]:
synset.wup_similarity(target_synset)

0.8

In addition, we can also find the closest common ancestor via hypernyms:

In [21]:
synset.lowest_common_hypernyms(target_synset)

["Synset('kuu.n.1')"]