OpenWordnet-PT: an open Brazilian Wordnet
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
freeling new release Oct 9, 2018
logo
openMLWN
tmp updated temporary wordnet.br file Apr 6, 2017
.gitignore
CHANGES logos Jan 21, 2016
LICENSE improve in the readme file May 1, 2014
README.md
morphosemantic-links-en.nt.gz new release Oct 9, 2018
morphosemantic-links-pt.nt.gz new release Oct 9, 2018
own-pt.lmf.gz New OWN-PT release. Oct 20, 2017
own-pt.nt.gz new release Oct 9, 2018
suffixes.nt new version openWordnet-PT with new namespace Aug 1, 2015
wn30.ttl example and lexicographerFile properties Aug 2, 2016
wordnet-en.nt.gz new release Oct 9, 2018

README.md

OpenWordnet-PT: An Open Brazilian Wordnet

How to use it?

  • You can browse or search the data in our web interface.

  • You can download the RDF files and load it with any RDF library available for your preferable programming language.

  • You can query the data using our SPARQL Endpoint.

About the RDF

  • Based on http://www.w3.org/TR/wordnet-rdf/ and http://semanticweb.cs.vu.nl/lod/wn30/ with some modifications. More files from the Princeton distribution were considered and not only the database files. More properties and classes are included.

  • Lisp code used to create the RDF files is available.

  • Since we re-use the Princeton WordNet (PWN) synset identifiers, we do not need to repeat all the relationships already listed in wordnet-en.nt.gz. In the own-pt.nt.gz we list simply the new relations that we have added for Portuguese.

Team

Contributors

Related projects

How to cite?

See http://arademaker.github.io/bibliography/coling2012.html

How to contribute?

Please use the GitHub issue tracker (https://github.com/arademaker/wordnet-br/issues) or use the web interface to make suggestions about the data.

History of the Project

The initial version was generated by combining the following data:

  • Princeton WordNet 3.0 was used to obtain English glosses and English terms for synset IDs.

  • The unreleased 2010-12 version of [UWN] (http://www.mpi-inf.mpg.de/yago-naga/uwn/) and MENTA provided candidate terms in Portuguese, candidate glosses in Portuguese (from Wikipedia), and candidate terms in Spanish.

  • The EuroWordNet base concept list (5000_bc.xml) provides the base concept numbers. The original file was mapped from WordNet 2.0 to 3.0 using the mappings from WN-Map. When multiple mappings for a WordNet 2.0 synset existed, all possible WordNet 3.0 synsets were kept. Hence, there may be multiple entries with the same base concept number.

License

Creative Commons License
openWordnet-PT by Escola de Matemática Aplicada, Fundação Getulio Vargas is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at http://github.com/own-pt/openWordnet-PT.
Permissions beyond the scope of this license may be available at http://github.com/own-pt/openWordnet-PT.

Also please consult the LICENSE file.

Note that the wordnet-en.rdf file is based on Princeton WordNet 3.0, being simply its conversion to the RDF format. The Princeton WordNet 3.0 is distributed under the license http://wordnet.princeton.edu/wordnet/license/.