Skip to content
Lesson on OpenRefine for ecology.
Branch: gh-pages
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.github
_episodes
_episodes_rmd
_extras
_includes
_layouts
assets
bin
code
css
data
fig
files
img
.gitignore
.mailmap
.update-copyright.conf
.zenodo.json
AUTHORS
CITATION
CODE_OF_CONDUCT.md
CONTRIBUTING.md
CONTRIBUTORS.md
LICENSE.md
Makefile
NEWS.md
README.md
XX-services.md
_config.yml
aio.md
index.md
reference.md
setup.md

README.md

Create a Slack Account with us Slack Status

OpenRefine-ecology

Lesson on OpenRefine for ecology

Data set notes.

  • This data set is derived from The Portal Project Long-term desert ecology project data. This data file was downloaded and then modified specifically for use with OpenRefine.
    • Taxon names were put back into the file.
    • Globally Unique Identifiers (in the form of UUIDs) were added.
  • These modifications were made in order to illustrate some features of Open Refine.
    • Errors were added to the taxon names (scientificName field), to demonstrate OpenRefine's ability to find likely mis-entered data.
    • These errors can be found using clustering algorithms on the scientificName column, showing the power of the algorithms to find discrepancies quickly and making it simple to fix all issues found.

Options.

  • For someone already familiar with OpenRefine, it would be a very simple matter to substitute a different data set, as desired.
You can’t perform that action at this time.