OpenWordnet-PT: an open Brazilian Wordnet
Awk
Latest commit 8e8aed3 Mar 22, 2017 @fcbr fcbr added temporary file.

README.md

OpenWordnet-PT: An Open Brazilian Wordnet

How to use it?

  • You can browse or search the data in our web interface.

  • You can download the RDF files and load it with any RDF library available for your preferable programming language.

  • You can query the data using our SPARQL Endpoint.

About the RDF

  • Based on http://www.w3.org/TR/wordnet-rdf/ and http://semanticweb.cs.vu.nl/lod/wn30/ with some modifications. More files from the Princeton distribution were considered and not only the database files. More properties and classes are included.

  • Lisp code used to create the RDF files is available.

  • Since we re-use the Princeton WordNet (PWN) synset identifiers, we do not need to repeat all the relationships already listed in wordnet-en.nt.gz. In the own-pt.nt.gz we list simply the new relations that we have added for Portuguese.

Team

Contributors

Related projects

How to cite?

See http://arademaker.github.io/bibliography/coling2012.html

How to contribute?

Please use the GitHub issue tracker (https://github.com/arademaker/wordnet-br/issues) or use the web interface to make suggestions about the data.

History of the Project

The initial version was generated by combining the following data:

  • Princeton WordNet 3.0 was used to obtain English glosses and English terms for synset IDs.

  • The unreleased 2010-12 version of [UWN] (http://www.mpi-inf.mpg.de/yago-naga/uwn/) and MENTA provided candidate terms in Portuguese, candidate glosses in Portuguese (from Wikipedia), and candidate terms in Spanish.

  • The EuroWordNet base concept list (5000_bc.xml) provides the base concept numbers. The original file was mapped from WordNet 2.0 to 3.0 using the mappings from WN-Map. When multiple mappings for a WordNet 2.0 synset existed, all possible WordNet 3.0 synsets were kept. Hence, there may be multiple entries with the same base concept number.

License

Creative Commons License
openWordnet-PT by Escola de Matemática Aplicada, Fundação Getulio Vargas is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at http://github.com/own-pt/openWordnet-PT.
Permissions beyond the scope of this license may be available at http://github.com/own-pt/openWordnet-PT.

Also please consult the LICENSE file.

Note that the wordnet-en.rdf file is based on Princeton WordNet 3.0, being simply its conversion to the RDF format. The Princeton WordNet 3.0 is distributed under the license http://wordnet.princeton.edu/wordnet/license/.