Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Documentation For Users
Documentation hub for users
Where to start?
- How to install and run (UpgradingTo2p0 upgrading instructions)
- How to use
- A Google Chrome extension for GitHub-Wiki-Search will help you search our wiki.
Love it so much?
- Join in with the development or help spread the meme about OpenRefine.
- Get the development version so you can play with all the latest and greatest features, but if you are not afraid of bugs.
Essential: These are the features you'd use 80% of the time when you use Refine. They help you clean up your data, extend it, and export it out for other tools to consume.
- Filtering / faceting
- History (undo/redo)
Reconciliation: You can use the "reconciliation" support in Refine to link text names in your data to database identifiers (also known as database keys, IDs). By connecting your data with other databases, you get more value out of your data.
Extensions: Add even more features to Refine by installing extensions.
Recipes and worked examples
For complicated tasks that are commonly done, just follow recipes that other folks have discovered and written up. A recipe is just a sequence of steps, and tips and tricks.
- Recipes (like removing duplicate row values, etc.)
and there are full recipes:
- Geocoding - translating street addresses to lat/lng coordinates
- Fetching URLs From Web Services - grabbing from the Web more data related to the data you already have
- Web Scraping & Extracting items from HTML pages
- Using Python as your expression language
- Parsing and Formatting Phone Numbers - Using Java with Python
- Extending Jython with pypi modules
Many tutorials and worked examples that illustrate project flow and the logic of using OpenRefine are collected under External Resources
- General Refine Expression Language (GREL)
While there's a lot of information in this wiki, there's plenty more online too. Here are some notable sites,
- RefinePro Knowledge Base for OpenRefine
- Tony Hirst's blog
- Martin Magdinier's blog
- Rod Whiteley's GREL Quick Reference (PDF)
- Yours? Let us know!
And of course searching online.