/
REGION_resource.tex
165 lines (150 loc) · 7.78 KB
/
REGION_resource.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
% Created 2015-06-24 Wed 16:14
\documentclass[11pt, a4paper]{article}
%\documentclass[scrartcl,a4paper,utopia,10pt]{article}
\usepackage[latin1]{inputenc}
\usepackage[T1]{fontenc}
\usepackage{fixltx2e}
\usepackage{natbib}
\usepackage{graphicx}
\usepackage{longtable}
\usepackage{float}
\usepackage{wrapfig}
\usepackage{soul}
\usepackage{textcomp}
\usepackage{marvosym}
\usepackage{wasysym}
\usepackage{latexsym}
\usepackage{amssymb}
\usepackage{hyperref}
\tolerance=1000
\usepackage{color,hyperref}
\definecolor{darkblue}{rgb}{0.0,0.0,0.3}
\hypersetup{colorlinks,breaklinks,
linkcolor=darkblue,urlcolor=darkblue,
anchorcolor=darkblue,citecolor=darkblue}
\usepackage{fullpage}
%\usepackage{microtype, fullpage}
%\providecommand{\alert}[1]{\textbf{#1}}
\title{WooW-II: Workshop on open Workflows\footnote{The creation of this workshop is generously sponsored by the European
Union's Seventh Framework Programme ``Foster'' (see as well \url{https://www.fosteropenscience.eu/event/workshop-open-workflows}).
}}
\author{Daniel
Arribas-Bel\footnote{\href{mailto:D.Arribas-Bel@bham.ac.uk}{D.Arribas-Bel@bham.ac.uk}
School of Geography, Earth and Environmental Sciences, University of
Birmingham (UK).}
\and
Thomas de Graaff
\footnote{\href{mailto:t.de.graaff@vu.nl}{t.de.graaff@vu.nl}
Department of Spatial Economics, VU University, Amsterdam (Netherlands).}}
\date{\today}
\hypersetup{
pdfkeywords={},
pdfsubject={},
pdfcreator={Emacs Org-mode version 7.9.3f}}
\begin{document}
\maketitle
\begin{abstract}
This resource describes WooW-II, a two-day workshop on open workflows for
quantitative social scientists. The workshop is broken down in five main
parts, where each of them typically consists of an introductionary tutorial and a
hands-on assignment. The specific tools discussed in this workshop are
\texttt{Markdown}, \texttt{Pandoc}, \texttt{Git}, \texttt{Github}, \texttt{R},
and \texttt{Rstudio}, but the theoretical approach applies to a wider range of
tools (e.g., \LaTeX \, and \texttt{Python}). By the end of the workshop,
participants should be able to reproduce a paper of their own and make it
available in an open form applying the concepts and tools introduced.
\end{abstract}
\section{Background}
\label{sec-1}
As in most social sciences, virtually no training is provided in regional
science on workflow design and choice of appropriate tools, especially not
from the viewpoint of \emph{open science} \citep{healy2011choosing,
arribas2014}. Students and young researchers
typically receive no guidance as to why or how they should adopt habits that
favor the open science principles in their research activity. This is
unfortunate, because learning and adopting new tools and workflows require a
large time investment, which will only pay-off in the long run. The best time
to do this is early in the career when one still has (some) time available.
Therefore, this workshop is specifically aimed at young researchers and covers
the main ideas behind a well-designed workflow with openness, transparency and
reproducibility in mind. At the same time, the content provides an
introductory, hands-on, overview of a
set of free tools that have been designed with such values in
mind.
We do not get into every detail of each tool. Instead, we aim to give a gentle
introduction, to provide further material and to place them in the appropriate
context. Specific emphasis is set on how certain tools contribute to
building a coherent open workflow and how they relate to each other. The main
areas reviewed are: mark-up languages such as \texttt{Markdown};
reference managers, particularly those open and free such as \texttt{Bibtex}
which are compatible with \LaTeX; conversion tools such as \texttt{Pandoc};
open environments for statistical computing such as \texttt{R} or
\texttt{Python}; version control systems such as \texttt{Git}; and online
hosting on open repositories such as \texttt{GitHub}. At the end of the
workshop, participants should be able to reproduce a paper of their own and
make it available in an open form applying the concepts and tools introduced.
Materials are organized in a website that is openly hosted on GitHub and
licensed using Creative Commons so access, remix and redistribution are
permitted.
\section{Description of the resource}
\label{sec-2}
The structure of the workshop is organized in two main blocks. The first
session introduces basic concepts such as open science, transparency and
reproducibility. Here, we stress the relevance of paying attention to the way
science is carried out and connect it to the choice of tools that allow such
values to be seamlessly embraced in the day-to-day practice of quantitative
research in social science. The second, longer, part of the workshop includes
four sessions with hands-on overviews of specific tools that have been
designed with open science principles in mind and that hence provide the
ingredients of a well-thought open workflow. This is delivered alternating
presentation time with hands-on practice, allowing participants to get a real taste of
what using the tools implies and experience their advantages.
%
The five sessions are presented as follows:
\begin{enumerate}
\item In this 3h. session we introduce the concepts of workflow, openness and
reproducibility. In the first part, We argue why they are important and
what as social scientists we can learn from data scientists. Our main
argument is that, even though in the social sciences complete
reproducibility is often infeasible, we should strive for research to
become \emph{as reproducible} as possible.
\item In this 2h. session we introduce the concepts of version control and
automation of tasks. The first relates to keeping track of changes as they
occur throughout the proccess, while the second one allows us to break up
the different components of an analysis and have them automatically run,
when needed, in the correct sequence. The two tools with which we will
play to explore these ideas practically are \texttt{git} and
\texttt{make}.
\item In this 2h. session we introduce the concept of markup languages and
working with the terminal. In particular we focus on \texttt{Markdown}: a
very lightweight markup language (and probably the fastest way to create
slides). In particular we deal with \texttt{RStudio} and
\texttt{Markdown}. This enables writing a (part of a) paper in
\texttt{Markdown} in \texttt{RStudio} including headers, links, formula's,
tables and references. Using \texttt{RStudio} allows as well for
exporting to more well-known formats , such as \texttt{docx},
\texttt{html} and \texttt{pdf}.
\item In this 3h. session we take an overview of the main ideas behind making
data analysis reproducible and transparent. We use the \texttt{R}
statistical platform in combination with \texttt{RStudio} for two main
reasons: (\emph{i}) it works the best \textbf{out of the box} for
\emph{our} purposes and (\emph{ii}) at the moment most researchers
probably work with this combination for reproducibility.
\item In this final 1.5h session we introduce how one could make your
reproducible research open. This basically means making use of
repositories such as \texttt{Github}, which not only serves as a backup
repository, but as well as a way of collaboration with known and unknown
authors. Further, we show that making slides in \texttt{RStudio} is a
breeze and why you actually might want to publish a document in
\texttt{HTML} instead on paper.
\end{enumerate}
\section{Resource links}
\label{sec-3}
\begin{itemize}
\item Website: \href{http://darribas.org/WooWii/}{http://darribas.org/WooWii/}
\item Materials: \href{https://github.com/darribas/WooWii}{https://github.com/darribas/WooWii}
\end{itemize}
%\bibliographystyle{kluwer}
\bibliographystyle{apalike}
\bibliography{refs}
\end{document}