Skip to content

Knowledge sources

rspeer edited this page Oct 10, 2014 · 13 revisions

Here's a checklist of sources we include or may someday include when building ConceptNet 5.

Current sources

  • OMCS English, Portuguese, Japanese, Dutch, Korean, French (ConceptNet 4)
  • OMCS Chinese
  • GlobalMind
  • Verbosity
  • Wiktionary translations
  • WordNet (word senses should possibly be revised)
  • DBPedia's type and location relationships
  • JMDict
  • OpenCyc via Umbel

Potential future sources

  • More kinds of relations from DBPedia
  • Wikipedia links
  • Rule-based extractions from ConceptNet 4
  • XKCD color survey
  • VerbNet / FrameNet / PropBank (I get these confused with each other. Probably one or two of them can tell us very useful things about verb structure that aren't in any Linked Data project yet.)
  • Freebase (we have some code for it already, and it has lots of overlap with DBPedia; might want to be selective as it has hundreds of millions of assertions)
  • Collocations from Google Books N-grams 2012
  • Bahasa WordNet

Sources we probably can't use

  • EuroWordNet. It's under a very restrictive license, it seems.