Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.
Java JavaScript HTML CSS Shell Batchfile Other
Pull request Compare This branch is 162 commits ahead, 270 commits behind OpenRefine:master.
Latest commit ae83f4f Oct 19, 2015 @DavidLeoni DavidLeoni Update README.md
Permalink
Failed to load latest commit information.
.settings Exclude build directory from Javascript checks Jun 22, 2013
CKAN-Java-Client moved ckanclientj and rdf-extension to opendatarise.deps groupId Sep 6, 2013
IDEs/eclipse Closes #13 frame/navigation Jul 11, 2013
broker Closes #13 frame/navigation Jul 11, 2013
conf Issue 630: Change branding from Google Refine to OpenRefine Oct 18, 2012
extensions finished implementing the ckan graph. Closes #34 Sep 12, 2013
graphics 120 pixel icon for Google OAuth2 registration Feb 10, 2013
licenses Normalize line endings Mar 23, 2013
main update ISearch. Oct 10, 2013
server Fixes #57 Sep 16, 2013
src/test/resources/backup-data-dir restored from master, added ODR.getLocale Aug 26, 2013
.classpath Update to patched version of Butterfly - fixes #652 Aug 18, 2013
.gitattributes Normalize line endings Mar 23, 2013
.gitignore fixed gitignore for .svn files Sep 16, 2013
.project changed eclipse project name to OpenDataRise Jun 3, 2013
.travis.yml Fix YAML so it parses May 26, 2013
CHANGES.txt Closes #13 frame/navigation Jul 11, 2013
LICENSE.txt Fixed Issue 488: ISO 8601 dates not supported in cell editing - cell-… Nov 27, 2011
README.md Update README.md Oct 19, 2015
build.properties Closes #13 frame/navigation Jul 11, 2013
build.xml Merge remote-tracking branch 'upstream/master' Aug 26, 2013
dev_start.bat can build everything, still fails to run (dev_start can't find maven … Aug 9, 2013
pom.xml (re)added ckanclient as module Sep 6, 2013
refine Fix revision calculation Aug 15, 2013
refine.bat Closes #13 frame/navigation Jul 11, 2013
refine.ini Rename data directory from Google/Refine to OpenRefine - closed #777 Aug 15, 2013
start.bat Closes #13 frame/navigation Jul 11, 2013
unsign Add script to unsign Mac executables Jul 29, 2013

README.md

opendatarise

Data integration tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.

Project status: Developing - we are testing using reconciliation sevices of DISI, University of Trento. Currently code is kept in a private repository, when project reaches a sufficient level of stability we will merge changes into the public repo. We keep public wiki updated, though, so you can get an idea of how the project will look like. You can also watch a demo video with a run of ODR on a dataset about certified products.

Additions to OpenRefine:

  • a workflow subdivided in steps
  • an interface to import datasets from ckan repositories with Jackan client and also to visualize resources stats taken with Ckanalyze
  • provenance tracking with TraceProv
  • schema guessing with Open Data Schema Matcher and Column Recognizers
  • suggestions on operations to do based on schema
  • enhanced data validation with column types
  • multivalued cells support
  • semantic tagging of natural language text, using SemText datamodel
  • Abstraction of knowledge base via OpenEntity API
  • online help system
  • maven as dependency management system instead of Ant
  • WAR deployable on Tomcat as build output instead of Refine custom old Jetty server
  • interactive debugging support with a recent version of Jetty
  • enhanced event system for plugins
  • possibility for plugins to attach data to columns, cells and rows

Roadmap: see project issues

Documentation: see the wiki

Platform

Credits

OpenDataRise adds a semantic layer upon the OpenRefine platform, so it owes a great deal of gratitude to OpenRefine authors.

OpenDataRise contributors:
OpenRefine contributors:

Refine was created by Metaweb Technologies, Inc. and originally written and conceived by David Huynh dfhuynh@google.com. Metaweb Technologies, Inc. was acquired by Google, Inc. in July 2010 and the product was renamed Google Refine. In October 2012, it was renamed OpenRefine as it transitioned to a community-supported product.

This is the full list of Open Refine contributors (in chronological order):