# Towards Open Science in Acoustics: Foundations and Best Practices

### Sascha Spors, Matthias Geier, Hagen Wierstorf

* Institute of Communications Engineering, University of Rostock
* Filmuniversität Babelsberg *KONRAD WOLF*

Sascha.Spors@uni-rostock.de

7.3.2017

# Who should benefit from my research? 

* myself
* my future self
* my boss
* my colleagues
* other researchers
* all people in the world
* science itself

# Reproducibility of a Listening Experiment

1. Idea
2. Design of the Listening Experiment
    * hypothesis
    * design
3. Implementation & Computation
    * mathematical derivations
    * implementation of signal processing
    * implementation of graphical user interface & control logic
    * numerical simulations
    * generation of stimuli
4. Experiment
5. Analysis
    * anonymization of data
    * outlier removal
    * statistical analysis
6. Manuscript
    * text, references
    * visualization of results (plots)
7. Peer Review
    * ratings, comments
    * revision of manuscript
8. Publication
9. Aftermath
    * reproduction by third parties
    * post publication review
    * errata, code revision

# The Scientific Method

**Branches of the scientific method** [Donoho, 2009]
1. Deductive $\rightarrow$ mathematics, formal logic

2. Empirical $\rightarrow$ statistical analysis of controlled experiments

3. Computational
    1. large-scale simulations
    2. data-driven computational science



**Classification of Reproducibility** [Stodden, 2014]
* empirical reproducibility
* computational reproducibility
* statistical reproducibility

# Open Science

* Open Methodology
* Open Data
* Open Source
* Open Access
* Open Educational Resources
* Open Peer Review

according to [http://openscienceasap.org/open-science/]

# Reproducibility of a Listening Experiment - Revisited

1. 
2. -> Open Methodology
3. -> Open Data, Open Source
4. -> Open Data
5. -> Open Data, Open Source, Open Methodology
6. -> Open Data, Open Access
7. -> Open Peer Review
8. -> Open Access
9. -> Open Access

# Incentives and Barriers

### Selected Results from a Survey of the Machine Learning Community

**Barriers (Data / Code)**
* time to document and clean up (54% / 77%)
* dealing with questions from users (34% / 52%)
* not receiving attribution (44% / 42%)
* possibility of patents (-- / 40%)
* legal barriers (e.g. copyright) (34% / 41%)

**Incentives (Data / Code)**
* encourage scientific advancement (81% / 91%)
* encourage sharing in others (90% / 79%)
* be a good community member (86% / 79%)
* set a standard in the field (82% / 76%)
* improve the calibre of research (85% / 74%)

Results from [Stodden, 2010], N=134

# Management of Research Data

* a systematic management of research data is a prerequisite for open and reproducible science
* becoming mandatory in funding schemes (DFG, H2020, NSF, ...)

**Best Practices** [DFG, HRK, Stodden, H2020]
* develop a comprehensive data management plan
* use workflow tracking in the research process
* make data findable, accessible, interoperable and reusable (FAIR)
* apply open licensing models
* offer training and qualification

# Services for Open Science

Generic repositories including version tracking
* GitHub
* Bitbucket

Repositories for Research Data
* Zenodo
* Research Compendia
* Reproducible Research
* ResearchGate

Virtual Research Environments
* Open Science Framework

Journals
* Journal of Open Research Software

# Copyright and Licenses

* unclear situation when publishing content without explicit license
* license should be as open as possible in order to promote re-use
* legal implications are complex and hard to oversee

**Available Licensing Frameworks**
* Software: GNU Public License, BSD, MIT, ...
* Content: Creative Commons, ...

**Recommendations**
* Reproducible Research Standard (RRS) [Stodden, 2009]
* ...

# Personal Experience

* public release of the SoundScape Renderer (SSR) in 2010
* various toolboxes, datasets, open educational resources ($\rightarrow$ https://github.com/spatialaudio)
* Open Science in Two!Ears
* internal data management: Redmine, svn, git
* public releases: github, zenodo, wordpress

**Benefits**
* documentation/clean up/discussions for public release
* bug reports
* positive community feedback
* potentially more citations []

**Challenges**
* initial effort (training, ...)
* missing versioning tool/platform for data bases

# Conclusions

* reproducibility of results is essential for the scientific method
* Open Science by itself does not ensure the ease of reproducibility
* scientific innovation vs. evaluation measures
* global trend towards sharing
* data repository for acoustics/audio $\rightarrow$ talk by Stefan Weinzierl

# References

* [Stodden, 2009] Vitoria Stodden, The Legal Framework for Reproducible Scientific Research, Computing in Science & Engineering, January/February 2009.
* [Stodden, 2014] Victoria Stodden, Resolving Reproducibility in Computational Science: Tools, Policy, and Culture, 2014.
* [Donoho, 2009] David Donoho, Arian Maleki, Inam Rahman, Morteza Shahram, Victoria Stodden, 15 Years of Reproducible Research in Computational Harmonic Analysis, Computing in Science and Engineering, 11(1), 2009.