Copying and sharing ConceptNet

Rob Speer edited this page Aug 26, 2014 · 6 revisions

ConceptNet 5 comes largely from the hard work of hundreds of thousands of people who gave their time and knowledge for free. So ConceptNet is free as well.

Data license

The complete data in ConceptNet is available under the Creative Commons Attribution-ShareAlike 4.0 license.

Additionally, because we track the provenance of the data, you may extract and use a subset of its data under the Creative Commons Attribution 4.0 license.

To give credit to ConceptNet, we suggest this text:

This work includes data from ConceptNet 5, which was compiled by the
Commonsense Computing Initiative. ConceptNet 5 is freely available under
the Creative Commons Attribution-ShareAlike license (CC BY SA 3.0) from
http://conceptnet5.media.mit.edu.

The included data was created by contributors to Commonsense Computing
projects, contributors to Wikimedia projects, Games with a Purpose,
Princeton University's WordNet, DBPedia, OpenCyc, and Umbel.

Credits and acknowledgements

ConceptNet has been developed by:

  • The MIT Media Lab, through various groups at different times:

    • Commonsense Computing
    • Software Agents
    • Digital Intuition
  • The Commonsense Computing Initiative, a worldwide collaboration with contributions from:

    • National Taiwan University
    • Universidade Federal de São Carlos
    • Hokkaido University
    • Tilburg University
    • Nihon Unisys Labs
    • Dentsu Inc.
  • Luminoso Technologies, Inc.

Significant amounts of data were imported from:

  • WordNet, a project of Princeton University
  • Wikipedia and Wiktionary, collaborative projects of the Wikimedia Foundation
  • Luis von Ahn's "Games with a Purpose"
  • DBPedia
  • Umbel, a project of Structured Dynamics LLC

Here is a short, incomplete list of people who have made significant contributions to the development of ConceptNet as a data resource, roughly in order of appearance:

  • Push Singh
  • Catherine Havasi
  • Hugo Liu
  • Hyemin Chung
  • Rob Speer
  • Ken Arnold
  • Yen-Ling Kuo

Licenses for included resources

Commonsense Computing

The Commonsense Computing project originated at the MIT Media Lab and expanded worldwide. Tens of thousands of contributors have taken some time to teach facts to computers. Their pseudonyms can be found in the "sources" list found in ConceptNet's raw data and in its API.

Games with a Purpose

Data collected from Verbosity, one of the CMU "Games with a Purpose", is used and released under ConceptNet's license, by permission from Luis von Ahn and Harshit Surana.

Verbosity players are anonymous, so in the "sources" list, data from Verbosity is simply credited to the pseudonym "verbosity".

UMBEL

UMBEL is available under a Creative Commons Attribution license. Here are UMBEL's license terms, adapted from Umbel's documentation:

UMBEL and its documentation are the joint creative works of Structured Dynamics LLC and Ontotext AD, which grant free use rights thereto, only limited by the attribution terms described in the Creative Commons 3.0 Attribution License. The copyrights to UMBEL and its documentation remain the sole rights of Structured Dynamics LLC and Ontotext AD.

The UMBEL Reference Concept Ontology is based on a faithful but reduced subset extraction of concepts and relationships from the OpenCyc version of the Cyc knowledge base. As such, UMBEL is a lightweight reflection of these sources, but not nearly as capable nor complete.

Use and relations based on UMBEL may therefore not be an accurate representation of what might be obtained in working directly with the source Cyc or OpenCyc knowledge bases.

Wikimedia projects

ConceptNet uses data directly from Wiktionary, the free dictionary. It also uses data from Wikipedia, the free encyclopedia via DBPedia.

Wiktionary and Wikipedia are collaborative projects, authored by their respective online communities. They are currently released under the Creative Commons Attribution-ShareAlike license.

Wikimedia encourages giving attribution by providing links to the hosted pages that the data came from, and DBPedia asks for the same thing in turn. In the raw data and the Web API, the sources of Wikimedia contributions can be found as URLs following the token /s/web.

For example, an assertion attributed to /s/web/de.wiktionary.org/wiki/Sprache/ uses information extracted from the page that can be seen on the Web at http://de.wiktionary.org/wiki/Sprache. Its list of individual contributors can be seen at: http://de.wiktionary.org/wiki/Sprache?action=history

Information from DBPedia is credited in a way that is designed to encourage interoperability with DBPedia. ConceptNet nodes that use information from DBPedia are linked to their DBPedia nodes in RDF N-Triples format. These links can be found in the data/sw_map directory.

WordNet

WordNet is available under an unencumbered license: see http://wordnet.princeton.edu/wordnet/license/. Its text is reproduced below:

WordNet Release 3.0

This software and database is being provided to you, the LICENSEE, by Princeton University under the following license. By obtaining, using and/or copying this software and database, you agree that you have read, understood, and will comply with these terms and conditions.:

Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted, provided that you agree to comply with the following copyright notice and statements, including the disclaimer, and that the same appear on ALL copies of the software, database and documentation, including modifications that you make for internal use or for distribution.

WordNet 3.0 Copyright 2006 by Princeton University. All rights reserved.

THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT- ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS.

The name of Princeton University or Princeton may not be used in advertising or publicity pertaining to distribution of the software and/or database. Title to copyright in this software, database and any associated documentation shall at all times remain with Princeton University and LICENSEE agrees to preserve same.