Skip to content

WolframResearch/Data-Curation-Training

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Curation for the Wolfram Data Repository

This repository contains notebooks and other resources corresponding to a series of Twitch live coding sessions designed to teach the fundamentals of curating data in the Wolfram Language and preparing it for submission and publication in the Wolfram Data Repository.

Contents

Becoming a Certified Data Curator, Part 1

  • Watch and interact with a Wolfram|Alpha data scientist as he demonstrates how to curate several datasets using the Wolfram Language.
  • Screencast - twitch.tv
  • Notebooks and other resources - Part1

Becoming a Certified Data Curator, Part 2

  • Topics covered: scraping HTML, clustering data, dealing with (somewhat) messy data
  • Screencast - twitch.tv
  • Notebooks and other resources - Part2

Becoming a Certified Data Curator, Part 3

  • Topics covered: scraping XML & JSON, avoiding common errors
  • Screencast - twitch.tv
  • Notebooks and other resources - Part3

More information: