Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Responsibility & Sharing Guidelines #6

Open
Nolski opened this issue Jul 29, 2019 · 1 comment

Comments

@Nolski
Copy link
Collaborator

commented Jul 29, 2019

Summary

Many of our AI focused cohorts are unsure as to how/whether they should release their training datasets along-side their machine learning code.

Details

The UN OCHA has provided great guidance on releasing data, determining sensitivity, etc. The main guidance document can be found here:

https://centre.humdata.org/wp-content/uploads/2019/03/OCHA-DR-Guidelines-working-draft-032019.pdf

I have contacts at HDX which might serve as a the perfect platform to release their data on.

Outcome

An initial document or perhaps roadmap template for assessing & releasing a data set online.

@Nolski

This comment has been minimized.

Copy link
Collaborator Author

commented Jul 29, 2019

UNICEF has shown interest in their mentors assigning a number of "homework" assignments. Below is a few ideas of some assignments which can be done.

Homework Assignments

Phase 1

Data Ecosystem Map: This is to document where their data is coming from, what's using it, stakeholders, etc. The benefit here is not just helping organize how data collection will work in production but also projecting partnerships which may need to be formed in the future.

Information Sharing Protocol: This document is all about conducting an initial assessment of the sensitivity of information you are collecting, who you want to share it with, and how. It's fairly well thought out but is worth a review and potentially some edits. If you are in the EU, it might just be better to do a Data Protection Impact Assessment

Phase 2

Determine Dataset Structure: As you create new data sets, update them, and remove old datasets, you should follow a easy to follow protocol for keeping track of your various data sets as they're updated and deprecated.

:

@Nolski Nolski moved this from To do to In progress in LibreCorps Master Tracker Aug 4, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
1 participant
You can’t perform that action at this time.