Skip to content

randomeffect/avoid-data-disaster

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

94 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Avoiding data disasters

Description

  • How much data would you lose if your laptop was stolen?
  • Have you ever emailed your colleague a file named 'final_final_versionEDITED'?
  • Have you ever struggled to import your spreadsheets into R?

As a researcher, you will encounter research data in many forms, ranging from measurements, numbers and images to documents and publications. Whether you create, receive or collect data, you will certainly need to organise it at some stage of your project. This workshop will provide an overview of some basic principles on how we can work with data more effectively. We will discuss the best practices for research data management and organisation so that our research is auditable and reproducible by ourselves, and others, in the future.

Aims: During this course you will learn about:

  • Options for backing up your computer
  • Ideas for naming and organising your files
  • Strategies for exchanging files with collaborators
  • Tips and tricks to make sure that your spreadsheets are readable by programming languages such as R
  • Learn how to use the OpenRefine software for data cleaning
  • Preparing high-throughput biological data for submission to a public repository such as Gene Expression Omnibus (GEO) or ArrayExpress

Objectives: After this course you should be able to:

  • Select an appropriate backup strategy for your data
  • Organise your files in a more structured and consistent manner
  • Avoid common pitfalls in spreadsheet manipulation
  • Known what resources are available at The University of Cambridge for Research Data Management

Materials

References