OpenWordnet-PT: An Open Access Wordnet for Portuguese
How to use it?
You can browse or search the data in our web interface.
You can download the RDF files and load it with any RDF library available for your preferable programming language.
You can query the data using our SPARQL Endpoint.
About the RDF
Based on http://www.w3.org/TR/wordnet-rdf/ and http://semanticweb.cs.vu.nl/lod/wn30/ with some modifications. More files from the Princeton distribution were considered and not only the database files. More properties and classes are included.
Lisp code used to create the RDF files is available.
Since we re-use the Princeton WordNet (PWN) synset identifiers, we do not need to repeat all the relationships already listed in wordnet-en.nt.gz. In the own-pt.nt.gz we list simply the new relations that we have added for Portuguese.
http://github.com/own-pt/cl-wnbrowser/ a browser and search interface for our wordnet powered by Common Lisp and Apache Solr.
http://compling.hss.ntu.edu.sg/omw/ a browser and search interface for all open wordnets. Our OpenWordnet-PT is the Portuguese Wordnet available on this site.
http://nlp.lsi.upc.edu/freeling/ See the freeling directory for information and the OpenWordnet-PT version in the format used by FreeLing.
http://ontopt.dei.uc.pt Another wordnet-like ontology in Portuguese. It has incorporated OpenWordnet-PT.
How to cite?
How to contribute?
History of the Project
The initial version was generated by combining the following data:
Princeton WordNet 3.0 was used to obtain English glosses and English terms for synset IDs.
The unreleased 2010-12 version of [UWN] (http://www.mpi-inf.mpg.de/yago-naga/uwn/) and MENTA provided candidate terms in Portuguese, candidate glosses in Portuguese (from Wikipedia), and candidate terms in Spanish.
The EuroWordNet base concept list (
5000_bc.xml) provides the base concept numbers. The original file was mapped from WordNet 2.0 to 3.0 using the mappings from WN-Map. When multiple mappings for a WordNet 2.0 synset existed, all possible WordNet 3.0 synsets were kept. Hence, there may be multiple entries with the same base concept number.
openWordnet-PT by Escola de Matemática Aplicada, Fundação Getulio Vargas is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at http://github.com/own-pt/openWordnet-PT.
Permissions beyond the scope of this license may be available at http://github.com/own-pt/openWordnet-PT.
Also please consult the LICENSE file.
Note that the wordnet-en.rdf file is based on Princeton WordNet 3.0, being simply its conversion to the RDF format. The Princeton WordNet 3.0 is distributed under the license http://wordnet.princeton.edu/wordnet/license/.