diff --git a/.DS_Store b/.DS_Store new file mode 100644 index 0000000..5008ddf Binary files /dev/null and b/.DS_Store differ diff --git a/README.md b/README.md index c990e08..596827d 100644 --- a/README.md +++ b/README.md @@ -3,14 +3,14 @@
In this presentation, Tom Johnson of the Institute for Analytic Journalism and I will demonstrate various ways of harvesting data from the Internet without programming. While we heartily recommend that reporters explore the power of programming languages such as Python, Ruby and R, we believe these software tools are a valuable means to getting information that is otherwise unobtainable.
Open Refine is a vital tool for cleaning dirty data. A typical example is when a dataset contains names of people or companies but with inconsistent spelling that needs to be standardized. At NICAR, Nils Mulvad and I will walk through a tutorial he created. The exercise is here, the practice data here and here.
+Open Refine is a vital tool for cleaning dirty data. A typical example is when a dataset contains names of people or companies but with inconsistent spelling that needs to be standardized. At NICAR, Nils Mulvad and I will walk through a tutorial he created. The exercise is here, the practice data here and here.
diff --git a/scraping.pptx b/scraping.pptx deleted file mode 100644 index 900053e..0000000 Binary files a/scraping.pptx and /dev/null differ