Skip to content
Sam Levin edited this page Feb 5, 2018 · 2 revisions

Welcome to the Padrino wiki!

Overarching goal for the data base -

  1. Allow a user to download an "IPM representation" from the data base and reconstruct it for subsequent analyses

    • Important for reproducibility
  2. Supply all relevant metadata so that they can be re-analyzed with confidence in results

    • Important for comparative demography and "big data" analyses
  3. Integration (hehe) with Com(p)adre. This is much more long term goal, but it doesn't hurt to start thinking about it now.

With these goals in mind, I've created a version of an SQL data base on my local machine (too poor to buy my own server at the moment, but hopefully will have a shareable version soon) that follows the Abbreviated schema. SQL seems like a good choice to keep the data base itself language-agnostic (at least from the end user's perspective). There are well maintained packages that enable one to read from and write to SQL data bases from R, Python, Julia, and probably a bunch of other languages too.

Nearly all of the data in this data base is stored as strings, with the notable exception of the actual parameter values and a couple fields in the metadata. I suggest reading about the data types before trying to enter data as it will show you which fields are required and which are optional. You can see how I've set them here.

My next goal for this Wiki is to put together a detailed accounting of how each column of the Excel data entry template should be filled in. However, I'd also like to wait until after the first iteration of the data entry process to make sure that page includes some of the lessons learned from the exercise.

Clone this wiki locally