
## Best practices



### Reproducibility and Replicability

Just like any other scientific study, numerical models should be reproducible and most importantly replicable.
- Reproducible means that the same model on the same machine with the same code should produces the same results at the stated precision.
- Replicable means researchers can obtain similar enough results from independent development - i.e., obtain the same conclusions using different research methodology.

Reproducibility and replicability imply that the software used to conduct the study as well as the specific model setups, installation environment specifications, and post-processing workflow should be available to the rest of the scientific community. Increasingly, scientific journals are requesting or even requiring the publication of data and software along with the manuscript. Although the requirements vary per journal, it is good practice to adhere to these principles for every publication.

### Software and data management

Code and data should be archived. The data here refers to both input and output data. Observational data can be made available to support the chosen modeling design and initial and boundary condition.

Code and data archiving enables others finding, reusing, and extending the work.

Proper research data management should address the following points:
- description of data (metadata), their collection and production, and the reuse of existing datasets;
- storage, backup, and security during the project;
- legal and ethical requirements, as well as codes of conduct;
- what, how, and where data will be stored, accessed, and identified; and
- who will deposit and maintain the data.

There are plenty of repositories that can be used to archive data and code:
- Institutional repositories.
- Domain-specific repositories - e.g., [EarthChem](https://www.earthchem.org/), [EarthScope](https://www.earthscope.org/data/), [PANGAEA](https://www.pangaea.de/), and [HydroShare](https://www.hydroshare.org/).
- General repositories - e.g., [figshare](https://figshare.com/), [Dryad](https://datadryad.org/stash), and [Zenodo](https://zenodo.org/).

AGU provides a listing and description of [domain-discipline repositories](https://data.agu.org/resources/useful-domain-repositories). See also the [Generalist Repository Comparison Chart](https://doi.org/10.5281/zenodo.3946719) and https://www.re3data.org/.


All these repositories provide persistent identifier (PID) like a digital object identifier (DOI), which ensures the modeler can get credit for their coding efforts through citation of the code and/or data when used by others.

<div style="text-align: right">  &rarr; <b>NEXT:</b> <a href="11_publishing_geodynamic_modeling_L0.ipynb">Publishing</a> &nbsp;&nbsp;
<img src="../assets/education-gem-notebooks_icon.png" alt="icon"  style="width:4%">
</div>