This repository includes DIME's commitments to high-quality research. In the spirit of transparency, we are developing and version-controlling DIME's standards on GitHub. This repository is maintained by the DIME Analytics team.
There are two components to this repository:
- DIME Researchers must secure ethics approval from an institutional review board (IRB) and, if applicable, any relevant authority in the study location, for studies directly involving human subjects or using personally-identifying information.
- All DIME team members that handle personally-identifiable information must have up-to-date Human Subjects Research Certification.
- DIME staff must ensure confidentiality, privacy, and anonymity of study participants; study participants must have the opportunity to provide informed consent, and revoke that consent at any time.
For details and implementation resources, see the Research Ethics Guidelines
- All DIME projects must be registered prior to receiving implementation funding.
For implementation resources, see the Research Transparency Guidelines
- All DIME projects will use GitHub to document data work.
- DIME Research assistants will regularly participate in peer code review sessions.
- All DIME projects will have a master script that runs all the other scripts that are needed for the project, in order.
- Computational Reproducibility must be verified by DIME Analytics prior to publication for all DIME Working Papers and academic publications.
For implementation resources, see the Research Reproducibility Guidelines
- All personally-identifiable data must be stored and transferred securely, including in communication with field staff.
- All servers for data collection must be encrypted both in transit and at rest.
- All identified data must be stored only in securely encrypted locations, and must always be encrypted when shared, even if shared only within the project team
- All DIME projects must follow the DIME Data Back-up Protocols
For implementation resources, see the Data Security Guidelines
- Data must be de-identified prior to publication, DIME Analytics can assist with assessing the risk of statistical disclosure.
- All DIME publications must include a replication package, including a citation or reference to the data in the World Bank Microdata Catalog and a GitHub repository containing the code required for replication of the paper.
- Data must be deposited into the World Bank's Development Data Hub no later than six months after it is acquired, with the provision to also later deposit any revisions or updates, as per the World Bank Procedure on Development Dataset Acquisition, Archiving and Dissemination
For implementation resources, see the Data Publication Guidelines
The DIME data map provides a high quality professional framework for planning data work.
- For a general discussion on the usages and benefits, see the Data Map article on the DIME Wiki.
- To download our Data Linkage Table template click here
- For an example of a Data Map with a Data Linkage Table, Master datasets and Data Flow Charts, see here
DIME code should adopt the following guides:
- Python: PEP 8
- R: Tidyverse style guide
- Stata: DIME Analytics style guide (see the Appendix in the DIME Data Handhook)