Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2nd workshop 2018 - proposal #8

Closed
nuest opened this issue Oct 2, 2017 · 18 comments
Closed

2nd workshop 2018 - proposal #8

nuest opened this issue Oct 2, 2017 · 18 comments
Assignees

Comments

@nuest
Copy link
Member

nuest commented Oct 2, 2017

The call for workshops is out. Deadline is 30 November 2017.

2017 proposal

Ideas for topics

  • re-do http://o2r.info/agile-2017/ as it happened (introduce the topic to newcomers, share experiences)
  • present current paper (in writing), discuss results
  • focus on suggestions for AGILE (conference, member labs)
  • re-do the original workshop as intended (call for papers, public review, present reproducible works)
  • provide hands-on instructions for RR

Please feed free to add comments and more ideas here, I'll integrate them in this comment asap.

@foost
Copy link
Collaborator

foost commented Oct 9, 2017

from the Skype call:

  • present current paper and results, discuss
  • suggestions and best practice guide
  • hands-on practical work
  • think about how or what participants can submit, and whether and how there will be a review

@nuest
Copy link
Member Author

nuest commented Oct 9, 2017

  • submit code and data sample beforehand for organizers to review/prepare
  • not just become a pre-conference conference

@nuest
Copy link
Member Author

nuest commented Oct 9, 2017

A draft for the proposal is ready for contributions at https://github.com/o2r-project/agile-2017/blob/master/public/files/agile2018_proposal.md

@nuest
Copy link
Member Author

nuest commented Nov 1, 2017

more ideas: let's have a "hands-on" morning, and a more theoretical afternoon?

But there is a worth in having a more conceptual/theoretical discussion, might attract different people. Can we move that to some evening event?

Ideas for hands-on sessions

  • How to write a reproducible paper with R markdown
  • How to write a reproducible lab notebook with Jupyter
  • scripting GIS workflows in QGIS and ArcGIS

@nuest
Copy link
Member Author

nuest commented Nov 8, 2017

TUD thinks about RDM workshop, get back to them.

@foost
Copy link
Collaborator

foost commented Nov 21, 2017

150 minutes seem little time to teach enough for participants to be able to do anything meaningful.
do you mean to split up the group as per expressed interests?

@cgranell
Copy link
Collaborator

the practical proposal is ok, but I'd like to have content more catchy in the sense of "actually" reproducing two published papers (one in R, the other in python). Using these running examples, participants will try to reproduce them, face actual problems in doing so, and at the same time will learn practical aspects of reproducibility in R (markdown, etc) and python (jupyter, ArcGIS scripting, etc)..

@nuest
Copy link
Member Author

nuest commented Nov 22, 2017

@cgranell So you suggest to prepare two papers and then go through the process of reproducing them? @MarkusKonk Has a lot of experience in that :-)

I like the idea because it is much better to prepare than a "bring your own data" approach.

Yes, I suggest to split up the groups. I would also strictly focus on the workflow/reproducibility aspects. So for R, we would assume people know R already, but introduce them to R Markdown (e.g. by creating an R Markdown from a PDF-paper with a script as @cgranell suggests) and even the rticles package so you have the publication ready for submission, and upload it to Zenodo Sandbox at the end.
The same for Python+?-GIS: We're not teaching GIS, but show how you can construct a reproducible workflow instead of point-and-click.

@foost Would you prefer a full-day workshop? We can also cut down on the time for "Reproducibility at AGILE", just make that 15 mins. without discussion, would bring up the time for hands-on to 3 hrs.

@nuest nuest self-assigned this Nov 22, 2017
@nuest
Copy link
Member Author

nuest commented Nov 22, 2017

I've updated the proposal a little bit to mention "publications", because we're not introducing workflow tools etc, but do have a focus on reproducible papers / reproducible publications.

This would shift the content for the GIS-stuff a bit: R Markdown also supports chunks in other programming languages, so it should be no problem to create a reproducible paper using QGIS/any Python library with it :-).

@MarkusKonk
Copy link

Right, in the last months we reproduced several papers that were published by Copernicus. It was pretty interesting not only because of the different types of errors but also regarding the output of the code, i.e. the figures which were rarely equal to those reported in the paper and oftentimes quite far away from that. I am currently discussing that in another paper but we can discuss it in the workshop, too.

@cgranell
Copy link
Collaborator

Yes, that's the idea for the hands-on session

Let's start simple. Imagine we identify a paper that 's elegible to be reproduced in R. We can split the participants into groups so that all groups try to reproduce the same paper, following the workflow you specified before. The session might combine brief explanations (how-to use R markdown, how-to...) mixed with group work. Spontaneous discussion are encouraged to discuss problems that groups face while getting data, finding scripts, downloading scripts, solving execution errors, etc... Each group also write down what they are doing (based on a form we provide). For example, if finding data was easy, difficult , etc; how long does it take (in minutes), which score (0-3) was given etc. During the last half an hour, we can reflect on the difficulties to reproduce a paper, supporting tools, and best practices. As a result, we would get an experimental assessment of our proposed criteria for reproducible research, which can be input for going on out work.

Additionally, some of us might take the role of "observer" to monitor and take notes about participants' behaviour while trying to reproduce papers ...I mean, reproducibility is not only technology; it requires mind shift, a different way of doing research, create a habit, which can be studied from a social perspective too, complemented the previous form .

Other configurations might be possible...
Case 2: choose 2 papers and half of participants reproduce one , the other half the second paper.

If the above R experiment worked fine , we might extend it to python too, perhaps as a reproducibility course as part of the AGILE PhD School

Too ambitious??

@nuest
Copy link
Member Author

nuest commented Nov 22, 2017

I would not make this into a user study (how easy they find things) but tend to a more controlled version. But that are details we can figure out later. In the workshop description we could put something along the lines of

Participants explore the practical principles of reproducible papers by reproducing a provided real-world publication. Together with the instructors they create a reproducible document from public text/code/data and publish it in a data repository (and in a code repository/GitHub?).

We (@nuest @MarkusKonk) could do that for R. I do fear that if we use only R we might loose participants, so it would be great if we could offer the same (either with a Jupyter Notebook, or with Markdown + Python) for Python users. The Python example might include ArcGIS/QGIS, but we would for now not mention that.

@cgranell Totally agree on the "mind shift" aspect. We should try to teach a different way of working rather than teaching many features of tool X.

@foost What do you think? We might not appeal "regular GIS users", because you should have some experience in either R or Python, but having that as a prerequisite could solve your concerns wrt time.

We should eat our own dogfood and make the teaching material part of a GitHub repo und CC-BY, so of course AGILE PhD school is a reasonable future step to teach this.

@MarkusKonk
Copy link

To me, it sounds interesting.

@hoferb
Copy link

hoferb commented Nov 23, 2017

I agree completely with changing to reproducing existing work - or looking into how to make a 'research package' from existing material. Seems to me that both sides of the coin can be discussed.

(I just tried to reproduce the R script Daniel prepared but failed early on, when installing a package did not work. Very annoying. I would enjoy to learn more about reproducition using R Markdown ;)

@nuest
Copy link
Member Author

nuest commented Nov 23, 2017

Description updated, feedback welcome (especially on the newly added "Objectives" section).

@MarkusKonk
Copy link

Hi,
The proposal reads well.
Just a few comments on the description:

  • "newcomers" is probably not the right word
  • "...a short description of experience in R/Python" - do we have to say that they cannot attent if they are not familiar with R/Python? How about explicitly mentioning expected skills?
  • "...a summary of previously conducted or currently running computational work"
  • preferred hands-on session (R or Python)

@nuest
Copy link
Member Author

nuest commented Nov 29, 2017

I have updated the workshop proposal because we were a lot above the 1000 character limit for the workshop abstract, which I forgot to put in my template... anyways, we do have a longer text ready for the website now :-)

I also uploaded the PDF for submission, just in case somebody wants to do a final check: https://github.com/o2r-project/agile-2017/blob/master/public/files/agile2018_workshop_proposal_reproducible-research-publications-at-AGILE.pdf

@nuest nuest changed the title 2nd workshop 2018 2nd workshop 2018 - proposal Nov 30, 2017
@nuest
Copy link
Member Author

nuest commented May 18, 2018

Workshop was submitted and accepted, see https://agile-online.org/programme-2018/agile-workshops-2018

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants