Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VincentC_2016 Antibiotics Metadata do not match published paper #68

Closed
cmirzayi opened this issue May 27, 2022 · 3 comments
Closed

VincentC_2016 Antibiotics Metadata do not match published paper #68

cmirzayi opened this issue May 27, 2022 · 3 comments
Assignees

Comments

@cmirzayi
Copy link
Contributor

antibiotics_family and antibiotics_current_use appear to be wrong for many of the samples for the VincentC_2016 study.

All 229 samples are coded as yes for antibiotics_current_use but that is not true according to the original paper. A handful of participants never received antibiotics over the course of the study and many did not begin the study on antibiotics.

Additionally there are issues with the classes of antibiotics. For instance, the original paper states that 81 samples in the control group alone were exposed to cephalosporins and yet the data in cMD seem to only have 5 samples with cephalosporins exposure:

                                                                                  study_name
antibiotics_family                                                                 VincentC_2016
  aminoglycosides;beta_lactamase_inhibitors;laxatives;penicillin                               1
  beta_lactamase_inhibitors;carbapenems;cephalosporins;fluoroquinolones;penicillin             2
  beta_lactamase_inhibitors;carbapenems;cephalosporins;penicillin                              1
  beta_lactamase_inhibitors;carbapenems;fluoroquinolones;penicillin                            2
  beta_lactamase_inhibitors;macrolides;penicillin                                              2
  beta_lactamase_inhibitors;penicillin                                                         6
  carbapenems                                                                                  4
  carbapenems;fluoroquinolones                                                                 7
  carbapenems;fluoroquinolones;laxatives                                                       1
  carbapenems;laxatives                                                                        1
  cephalosporins                                                                               2
  fluoroquinolones                                                                             2
  macrolides                                                                                   2

The culprit here seems to be the incomplete metadata available on SRA, but the numbers on SRA also don't seem to quite match what's in cMD either--the SRA data for instance list 4 unique samples with cephalosporins exposure.

Where did the data for antibiotics_family come from?

I've already emailed the corresponding author of the original paper to see if she can provide some clarifications/a more comprehensive metadata table but haven't heard anything back yet. However, it would be helpful to have someone more familiar with the curation of these data look into this as well.

@cmirzayi
Copy link
Contributor Author

I have heard back from the corresponding author and the corrected metadata and accompanying data dictionary are attached.
data_dictionary_09062022.xlsx
jgh_09062022.csv

@paolinomanghi
Copy link
Collaborator

paolinomanghi commented Oct 24, 2022 via email

@azenuser
Copy link
Collaborator

This is solved, I'm closing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants