BD2K Module 12
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.
Audio File

BD2K Open Educational Resources

BD2K OER Materials Blueprint

Module Number: BDK12

Module Title: Data annotation and curation

Module Description:

Data preparation, developing standardized quality assurance processes and pipelines

Team Lead(s): Nicole Vasilevsky Team Members: Nicole Vasilevsky

Module Objectives:

At the completion of this component, the learner will be able to:

  1. Apply data preparation and planning best practices
  2. Describe data annotation and biocuration
  3. Apply data standards to research data sets using manual methods

Module Prerequisites: None

Module Units

Unit 1: Data preparation and planning

Description: This unit describes best practices for data preparation and planning including deciding the best formats to store data, directory and file naming conventions, basic metadata considerations, and data sharing considerations.

Unit 1 Slides: BDK12-1.pptx

Unit 1 Audio: BDK12-1.mp3 - Full lecture, Audio File - Individual Slides

Example: online presentation

Unit 2: File and Directory Naming

Description: This unit describes best practices for digital file and directory naming.

Unit 2 Slides: BDK12-2.pptx

Unit 2 Audio: BDK12-2.mp3 - Full lecture, Audio File - Individual Slides

Unit 1 & 2 Exercise: BDK12_Exercise01.docx

Example: online presentation

Unit 3: Annotating and Curating Data

Description: This unit describes professional biocuration and how researchers can better annotate their data to become biocurators themselves.

Unit 3 Slides: BDK12-3.pptx

Unit 3 Audio: BDK12-3.mp3 - Full lecture, Audio File - Individual Slides

Unit 3 Exercise: BDK12_Exercise02.docx (in

Unit 3 Exercise: Read the blog post: Ontological Annotation of Data and complete BDK12_Exercise03.docx

Example: online presentation

Module Supplemental Materials

Exercises: Glossary: BDK12_GlossaryTerms.pdf

References & Resources: BDK12_Ref.pdf

Recommended readings:

References cited:

References cited in lecture:

A note on Figures and Images

Nothing makes a learning session more engaging than fabulous visuals. While many in the education realm are accustomed to using a variety of rich images under the educational use exception, materials presented in an online educational resource (OER) format that are freely available and allow for users to remix, tweak and build upon the OERs present a unique problem. Images used in these circumstances must carry stringent CC BY-NC-SA (Creative Commons: Attribution – Non-Commercial – Share Alike) copyright.

As a result, the materials provided here have limited imagery as we intend for the users to remix, tweak and make these modules their own. At points in this module I have suggested inserting images of your choosing, not only to help create visual interest, but also to help tailor the educational experience to your audience. For examples, images that are being produced by researchers on your campus or in your department will drive a point home more effectively than generic or stock photos.

How does all of this copyright stuff work? For more information on copyright and fair use, I recommend a couple of resources.

When should you look to add additional images? When you see the clipboard icon, please consider identifying relevant images to the presentation. Suggested images may be hyperlinked, but not embedded in the presentation. Use your creativity when identifying images!

Where do I find images? There are several sources that might be available to you. Depending on how you plan on using the BD2K modules, you may have more flexibility to locate images. Once you have identify the license that you wish to use, you can search with those restrictions in mind.

  • Google Images: Head to Google Advanced Image Search and under the “usage rights” filter, select the filter that matches your requirements.
  • Flickr Creative Commons: Many users of Flickr have elected to allow their photographs to be reused. To browse or search for CC licensed images, head to
  • Institutional licenses: depending on your home institution, your library may subscribe to an image database that may be useful. Please consult with your librarian to see if such assets are available to you.
This material was developed by Oregon Health & Science University, supported by the National Institute of General Medical Sciences, funded by the NIH Big Data to Knowledge Initiative, under Award Number 1R25GM114820.