Skip to content
Go to file
Cannot retrieve contributors at this time
123 lines (107 sloc) 5.01 KB
layout: post
title: "Turning Researcher Fatigue into Puppy Fatigue"
date: 2017-06-19 14:40:00
jumbo_title: Turning Researcher Fatigue into Puppy Fatigue
On June 16th I gave the following short talk at Harvard's Institute for Quantitate
Social Sciences before I was on a panel with
[Victoria Stodden](,
[Thomas Leeper](,
and [Margo Seltzer]( as part of the 2017
Dataverse Community Meeting. The discussion centered around the "researcher
fatigue" that some experience when they're unwilling to use computational tools
for reproducible research. You can find the slides for this talk
[here]( The source code for this blog post is
a little different than usual, if you're interested you can find it [here](
```{r, eval=FALSE, echo=FALSE}
paste0("", 1:14) %>%
walk2(1:14, ~ webshot(.x, file = paste0(file.path("img",
"2017-06-19-Turning-Researcher-Fatigue-into-Puppy-Fatigue", .y), ".png"),
delay = 0.5, vwidth = 400, vheight = 300, zoom = 2))
```{r, echo=FALSE, results='asis'}
row_template <- '<div class="row">
<div class="col-md-6 pull-left">
<!-- <iframe src="" style="width:400px;height:300px" frameborder="0" scrolling="no"></iframe> -->
<img style="width:100%%;max-width:400px" src="/img/2017-06-19-Turning-Researcher-Fatigue-into-Puppy-Fatigue/%s.png">
<div class="col-md-1 hidden-xs">
<div class="col-md-5 pull-right">
# slides <- paste0("", 1:14)
slides <- 1:14
text_ <- c(
# 1
"I was thinking about the topic for this panel, and I started imagining some
of the potential uses for a data repository like Dataverse beyond the features
it already offers.",
# 2
"In the beginning of the reproducible research movement sharing data, code,
and other research artifacts was a hard problem.",
# 3
"Thankfully the right investments in infrastructure were made and now we have
resources like Dataverse!",
# 4
"However at the same time many other file storage services proliferated
making it easy to just throw files and code online without the well thought
out structure of an academic data repository.",
# 5
'The "throwing files online" approach is currently the least fatiguing path!
My boss [Jeff Leek]( (pictured here) just wrote a
[blog post](
championing just posting plain text files online. If you just put your data
on your website then nobody has to log in to a platform, theoretically they
can have the data in their computing environment with a URL and one line of
# 6
"My question is - what kind of value can a data repository provide beyond
being a place to make data available?",
# 7
"Well thankfully we have the tools of open science available to us in a data
repository! We have ORCiD IDs, DOIs, and all of the metadata that
our datasets are labelled with. What if we used these resources to make a data
repository a destination for discovering new research, datasets, and potential
# 8
"Here's a screenshot from my recent front page of of books I might
be interested in. Clearly this is the result of a clustering algorithm, based
on the books I've looked at or purchased in the past. These kinds of recommender
systems have become off-the-shelf technology in the last few years, and you
could imagine how incorporating a recommender system into a data repository
might help users discover research. We're collecting so much metadata about
the datasets going into these repositories, why not use that metadata in order
to suggest other datasets that a researcher might be interested in?",
# 9
"What if we thought about a data repository as a kind of meta-data product?",
# 10
"We've recently been working on a project that's similar in spirit in the Johns
Hopkins Data Science Lab. We've been working with Lucy and Nick, both PhD
students in the Biostatistics Department at Vanderbilt University.",
# 11
"They've developed a web application called Papr, which shows you abstracts
bioRxiv pre-prints. You can swipe left on an abstract if you don't like the
paper, or swipe right if you do like the paper.",
# 12
"Based on metadata for each pre-print we're able to construct a *paper-space*
where we can match you with researchers who have similar interests.",
# 13
"My hope and my proposal is that we can take ideas from other technology
products to make reproducible research tools more user-focused, so that users
will making their research reproducible. Perhaps one day the idea of researcher
fatigue will seem as foreign as puppy fatigue!",
"Thank you!"
text_ <- text_ %>% map_chr(~ renderMarkdown(text = .x))
walk2(slides, text_, ~ cat(sprintf(row_template, .x, .y)))
You can’t perform that action at this time.