Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Public release of underlying survey data & analysis #20

Open
3 of 6 tasks
Tracked by #9
harisood opened this issue May 17, 2023 · 12 comments
Open
3 of 6 tasks
Tracked by #9

Public release of underlying survey data & analysis #20

harisood opened this issue May 17, 2023 · 12 comments
Assignees
Labels
WP1 Work package 1 work

Comments

@harisood
Copy link
Member

Summary of issue

At some point soon we need to release a public version of the survey data, accompanied by reproducible code with the analysis we have run on the responses.

This needs to be done carefully to ensure any public release is as safe as possible

What needs to be done?

  • Determine what of the results we can/can't publicly release
  • Prepare a public facing version of the results for release
  • Prepare a public facing notebook with our analytical methods

Who can help?

Issue checklist

  • I have assigned the appropriate work package label to this issue, and added any other relevant labels
  • I have added this to the SATRE backlog (public) project board
  • I have assigned this issue to at least one person on the SATRE team
@manics
Copy link
Member

manics commented Jun 1, 2023

How does this sound for the initial release?

  • for each categorical question give counts of the responses
  • roles: one person organises these into the high-level stakeholder groups (users, operators, builders, strategy, other), provide summary counts for each of these categories
  • institutions: one person organises these into a few classes (e.g. universities, government, other public sector, private sector, whatever the person doing the work thinks is appropriate), provide summary counts for each of these categories
  • other free text answers: either omit from the first version, or one person reads through and writes a summary paragraph.

In future we can consider releasing anonymised individual responses so you can look for correlations, but this is higher risk and will take longer to assess or anonymise, so should be done in a future issue.

@JimMadge
Copy link
Member

JimMadge commented Jun 2, 2023

@manics's plan sounds good to me.

The only thing I would add is to do this for the free text questions where there are clear categories produce counts like with the likert questions. I would hope that the function in that notebook should do most of the heavy lifting.

@harisood
Copy link
Member Author

harisood commented Jun 5, 2023

Agreed, do we want to set a date for when to have this ready by? And maybe assigning responsibility?

@manics
Copy link
Member

manics commented Jun 6, 2023

I can work on:

  • categorical questions
  • roles
  • institutions

next week

I'm happy to look at the free text answers but that'll take longer, or someone else could look at them?

@JimMadge
Copy link
Member

JimMadge commented Jun 6, 2023

I wrote the free text -> categories function, so I'm happy to do that.

I would like to have someone else pass their eyes over that though. To make sure the code is doing what we think it should, and to make sure that the choices we make (e.g. which responses to drop, which words to count or not count, which synonyms to use) are reasonable.

@harisood
Copy link
Member Author

harisood commented Jun 6, 2023

@manics if you have time to do free text as well that'd be huge, otherwise I can try and find some time!

@manics
Copy link
Member

manics commented Jun 12, 2023

I've created a private spreadsheet for splitting roles and institutions into categories which can be made public

@harisood
Copy link
Member Author

Have you had a chance to look at free text?

@manics
Copy link
Member

manics commented Jun 13, 2023

I haven't

@manics
Copy link
Member

manics commented Jun 14, 2023

Foillowing the last WP1 meeting the plan is to create a new spreadsheet containing the raw (row-by-row) survey results, with the following changes:

  • role column is replaced by a role categories from the private spreadsheet
  • institution column is replaced by an institution category from the private spreadsheet
  • categorical (Likert scale) columns are unchanged
  • Simple free text columns which can be automatically parsed into a list of categories (e.g. programing languages) will be replaced by those categories if possible. @JimMadge is checking- do you think we could make the single replacement column contain semi-colon separated alphabetically ordered values e.g. java;python;r so that this can be easily exported as CSV if we want?
  • Complex free text columns which can't be easily summarised row-by-row will be removed, and will be summarised in future

The resulting spreadsheet will be checked for sensitive data, and if there are no problems it will be made public. The JISC online surveys JSON schema for the questionnaire will also be made available.

@JimMadge
Copy link
Member

@manics Yes that looks possible, an example is in the last cell here.

@harisood
Copy link
Member Author

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WP1 Work package 1 work
Projects
Status: 🏗 In progress
Development

No branches or pull requests

4 participants