Talk at Workshop on Visualization for Big Data: Strategies and Principles, Fields Institute
Want to see the slides?
Want to see a video of the talk?
The Big Data Brain Drain: Why Science is in Trouble: http://jakevdp.github.io/blog/2013/10/26/big-data-brain-drain/
RStudio IDE: http://www.rstudio.com/products/rstudio/
The R Project: http://www.r-project.org
STAT 545: http://stat545-ubc.github.io
2014-08-17 NYT article For Big-Data Scientists:, 'Janitor Work' Is Key Hurdle to Insights by Steve Lohr
Data carpentry blog post by David Mimno (response to NYT article re: janitor work)
Daring Fireball's markdown page. Kind of where it all begins, but other references that are more recent and specific to our context are more relevant.
Carson Sievert's talk Reproducible web documents with R, knitr & Markdown
MathJax + RStudio specifics:
- RStudio documentation page about Equations in R Markdown
Pandoc: If you need to convert files from one markup format into another, pandoc is your swiss-army knife.
R Markdown: http://rmarkdown.rstudio.com
Simple Markdown and R Markdown example files are here: https://github.com/jennybc/2013-11_sfu
Baumer, B., Cetinkaya-Rundel, M., Bray, A., Loi, L., & Horton, N. J. (2014). R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics. Technology Innovations in Statistics Education, 8(1). link
RStudio + R Markdown specifics:
"FINAL".doc from PHDCOMICS: http://www.phdcomics.com/comics/archive.php?comicid=1531
Jeromy Anglim's blog post Getting Started with R Markdown, knitr, and Rstudio 0.96 and a Gist containing the source
Yihui Xie blog post about when to use Markdown and when to use LaTeX
Ram, K 2013. Git can facilitate greater reproducibility and increased transparency in science. Source Code for Biology and Medicine 2013 8:7. Go to the associated github repo to get the PDF (link at bottom of README) and to see a great example of how someone managed the process of writing a manuscript with git(hub).
Karl Broman's tutorial, aimed at stats / data science types: An introduction to Git/Github.
SourceTree Git client, with a GUI. Available for Windows or Mac.
RStudio + Git(Hub)
- Blog post from The Molecular Ecologist on using GitHub with R and RStudio
xkcd on Git commit messages: http://xkcd.com/1296/
A few R packages on GitHub:
GitHub repos from government, media, and more
- http://ncip.github.io (NCI’s informatics program)
- https://github.com/LSST (Large Synoptic Survey Telescope)
- https://github.com/ged-lab/ (Titus Brown lab)
- https://github.com/lh3 (Heng Li lab)
Rough notes on GitHub usage in STAT 545: http://stat545-ubc.github.io/bit004_stat545-use-of-github.html
R Graph Catalog: http://shinyapps.stat.ubc.ca/r-graph-catalog/
A few Shiny apps by STAT 545 students: