Identify datasets for potential inclusion in the ODL #28

Daniel-Mietchen · 2018-10-24T00:39:19Z

One way to start looking into this would be to check open resources like

https://github.com/awesomedata/awesome-public-datasets
and see how sustainable/ usable the data are there.

On that basis, we could then decide (see also the inclusion criteria in ODL, as per #18 ) as to whether we'd like to go for datasets scoring high and/or low / average on those scales.

Daniel-Mietchen · 2018-10-25T23:59:00Z

Another potential candidate: http://retractiondatabase.org/ — described by some as "antediluvian".

Daniel-Mietchen · 2018-10-29T13:22:46Z

Another one: https://orcid.org/blog/2018/10/24/2018-public-data-file .

Daniel-Mietchen · 2018-10-31T02:58:22Z

Datasets and code involved in projects for which there is a bug bounty, e.g. https://rubenarslan.github.io/posts/2018-10-26-on-making-mistakes-and-my-bug-bounty-program/ .

Daniel-Mietchen · 2018-11-02T08:00:59Z

allofplos, as per https://github.com/PLOS/allofplos

Daniel-Mietchen · 2018-11-18T07:36:29Z

https://doi.org/10.5061%2Fdryad.n5g39d7 - & mdash; probably the most comprehensive public dataset about Hemimastigophora to date

Daniel-Mietchen · 2018-11-29T15:48:41Z

"Teaching data science with real world datasets"
https://twitter.com/emcandre/status/1068139908836012032

Daniel-Mietchen · 2018-12-15T23:52:49Z

Gaia star catalog data, as per http://sci.esa.int/gaia/60192-gaia-creates-richest-star-map-of-our-galaxy-and-beyond/

Daniel-Mietchen · 2019-01-03T15:39:19Z

Here is some inspiration from the kinds of data and related services hosted at IDigInfo's data portal:

https://idiginfo.org/?q=projects

Daniel-Mietchen added documentation How things (are supposed to) work infrastructure That which will only be noticed if it isn't working discoverability How to discover data or their existence labels Oct 24, 2018

Daniel-Mietchen added this to Needs triage in Reference datasets via automation Oct 24, 2018

Daniel-Mietchen changed the title ~~Identify datasets that may be worth including in the ODL~~ Identify datasets for potential inclusion in the ODL Oct 24, 2018

Daniel-Mietchen mentioned this issue Oct 24, 2018

Identify software, tools and workflows for potential inclusion in the ODL #29

Open

Daniel-Mietchen added this to To do in Public beta via automation Oct 24, 2018

Daniel-Mietchen added this to Needs triage in Private beta via automation Oct 24, 2018

Daniel-Mietchen moved this from Needs triage to High priority in Private beta Oct 24, 2018

Daniel-Mietchen moved this from Needs triage to High priority in Reference datasets Oct 24, 2018

Daniel-Mietchen added the policy Basic rules and guidelines on how the Open Data Lab operates label Oct 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Identify datasets for potential inclusion in the ODL #28

Identify datasets for potential inclusion in the ODL #28

Daniel-Mietchen commented Oct 24, 2018

Daniel-Mietchen commented Oct 25, 2018

Daniel-Mietchen commented Oct 29, 2018

Daniel-Mietchen commented Oct 31, 2018

Daniel-Mietchen commented Nov 2, 2018

Daniel-Mietchen commented Nov 18, 2018

Daniel-Mietchen commented Nov 29, 2018

Daniel-Mietchen commented Dec 15, 2018

Daniel-Mietchen commented Jan 3, 2019

Identify datasets for potential inclusion in the ODL #28

Identify datasets for potential inclusion in the ODL #28

Comments

Daniel-Mietchen commented Oct 24, 2018

Daniel-Mietchen commented Oct 25, 2018

Daniel-Mietchen commented Oct 29, 2018

Daniel-Mietchen commented Oct 31, 2018

Daniel-Mietchen commented Nov 2, 2018

Daniel-Mietchen commented Nov 18, 2018

Daniel-Mietchen commented Nov 29, 2018

Daniel-Mietchen commented Dec 15, 2018

Daniel-Mietchen commented Jan 3, 2019