Develop Pacbio for NMDC submission #413

mslarae13 · 2023-09-06T18:45:12Z

Deliverable this task is associated with

See Deliverables tab here: https://docs.google.com/spreadsheets/d/1z_b6WbuTk4pI0Q-Z-rfCgC-8R3m3F2_JDevYuK8CjYE/edit?usp=sharing

3

RACI

Tag people in their roles

Responsible: Montana
Accountable:
Consulted: @aclum , @emileyfadrosh
Informed:

Describe the the task?

Criteria for completion

Users can submit metadata via NMDC for JGI Pacbio analysis

Estimate people time

8

Completion Date (Goal)

~~Oct 20~~
Rescheduled, Feb 23rd

Target Sprint Start & End Dates

Start: Sept 11
End: ~~Oct 20~~ Feb 23

Tag Blocker/Contingent upon isues

[Tagg issues]

mslarae13 · 2023-12-09T00:16:29Z

See above issue, the slots that are different for metagenome - long reads have different requirememnts for metagenome - short reads.

To accomplish this check for JGI sample submission, the long and short reads will be split apart, in the "multi omics" data selection, if metaG is selected, additional check boxes will appear for long or short reads.

In the metadata file, long and short reads will be added to the analysis type option

when selected, those assigned long will appear in a "JGI Metagenomics - Long Reads" template tab... and those assigned to short will appear in a "JGI Metagenomics - Short Reads" tab.

@pkalita-lbl when do you think we can work on this? I think I or @bmeluch can make the updates to add the interface and the requirements updates?

pkalita-lbl · 2023-12-11T17:13:04Z

@mslarae13 can I turn the question around and ask when do we need to have this done?

ssarrafan · 2023-12-15T21:48:28Z

At least questions are in progress. Moving to next sprint. @mslarae13 let me know if this should be in the backlog instead.

mslarae13 · 2023-12-27T19:17:20Z

We should do this as part of the expansion / updates to the submission portal interface.
See #433

I think this rolls into the the updating tabs task

mslarae13 · 2024-01-04T21:56:27Z

In schema, pacbio instrument will capture that it's long reads.

mslarae13 · 2024-01-18T18:43:51Z

Decided to separate out long and short reads for metaGs at step 4, Multi-Omics data (for JGI), and on the analysis slot. When a user selects metaG they can choose long or short read.

mslarae13 · 2024-04-19T23:45:01Z

@pkalita-lbl

Functionality on the submission portal is great and works with no issues
I did have a realization / question about a potential problem

It was previously asked by Mark if the dna_slot and rna_slot (s) could beconsolidated to just a single slot.
You concluded here that no, because the data goes into mongo associated with a single biosample.

If sample 1 has long and short read data, don't we have the same issue?

pkalita-lbl · 2024-04-22T16:09:56Z

🤦🏻 🤦🏻 🤦🏻

Yes, you're absolutely right. That's my bad for not thinking of that. I'll make a new issue to deal with that.

In the meantime, it doesn't really hurt anything to collect data like this in the submission portal. But if we get any submissions with data like that, we'll just need to hold off on bringing that data into Mongo until the issue is resolved.

EDIT: Here's the new issue microbiomedata/nmdc-schema#1937

mslarae13 · 2024-04-24T17:23:47Z

Thanks @pkalita-lbl !
Let's check with @aclum

Alicia, of these DNA vs RNA slots that are JGI specific... do we need to store any of them in NMDC/mongo? Or can they be considered "submission portal & UF specific"?

See the slots in MGInterface in submission schema : https://github.com/microbiomedata/submission-schema/blob/0b9413915f63bd7fa9be70f32061db49dc422009/src/nmdc_submission_schema/schema/nmdc_submission_schema.yaml#L34798

ssarrafan · 2024-05-06T18:03:01Z

Thanks @pkalita-lbl ! Let's check with @aclum

Alicia, of these DNA vs RNA slots that are JGI specific... do we need to store any of them in NMDC/mongo? Or can they be considered "submission portal & UF specific"?

See the slots in MGInterface in submission schema : https://github.com/microbiomedata/submission-schema/blob/0b9413915f63bd7fa9be70f32061db49dc422009/src/nmdc_submission_schema/schema/nmdc_submission_schema.yaml#L34798

@aclum can you respond to this when you get a chance?
I'll remove this from the sprint and add backlog label since it hasn't been updated for 2 weeks.

aclum · 2024-05-06T18:21:39Z

I would like to keep dna_isolate_meth and map it to a slot on NMDC's Extraction class. However in looking at those slots we've conflated extraction target and how the extraction was done into one permissible value. If we were to store the values JGI has we'd need to just allow this to be a string b/c JGI doesn't place any CV on this so this needs further discussion with @turbomam

mslarae13 · 2024-06-12T20:58:49Z

we've conflated extraction target and how the extraction was done into one permissible value.
Fixed in nmdc-schema and merged into berk-schema.

Make short and long read, deal with mapping the 1 field we care about back later.

mslarae13 · 2024-06-17T16:33:03Z

Schema change, post berk.

mslarae13 self-assigned this Sep 6, 2023

bmeluch mentioned this issue Jan 2, 2024

Add "JGI - metagenomics - long read" class to submission schema microbiomedata/submission-schema#168

Closed

mslarae13 assigned pkalita-lbl and unassigned mslarae13 Feb 1, 2024

mslarae13 self-assigned this Apr 19, 2024

pkalita-lbl mentioned this issue Apr 22, 2024

Add slots to Biosample class to disambiguate standard MG metadata vs long-read MG metadata microbiomedata/nmdc-schema#1937

Open

ssarrafan added the backlog label May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop Pacbio for NMDC submission #413

Develop Pacbio for NMDC submission #413

mslarae13 commented Sep 6, 2023 •

edited

Loading

mslarae13 commented Dec 9, 2023

pkalita-lbl commented Dec 11, 2023

ssarrafan commented Dec 15, 2023

mslarae13 commented Dec 27, 2023

mslarae13 commented Jan 4, 2024

mslarae13 commented Jan 18, 2024

mslarae13 commented Apr 19, 2024

pkalita-lbl commented Apr 22, 2024 •

edited

Loading

mslarae13 commented Apr 24, 2024

ssarrafan commented May 6, 2024

aclum commented May 6, 2024

mslarae13 commented Jun 12, 2024

mslarae13 commented Jun 17, 2024

Develop Pacbio for NMDC submission #413

Develop Pacbio for NMDC submission #413

Comments

mslarae13 commented Sep 6, 2023 • edited Loading

mslarae13 commented Dec 9, 2023

pkalita-lbl commented Dec 11, 2023

ssarrafan commented Dec 15, 2023

mslarae13 commented Dec 27, 2023

mslarae13 commented Jan 4, 2024

mslarae13 commented Jan 18, 2024

mslarae13 commented Apr 19, 2024

pkalita-lbl commented Apr 22, 2024 • edited Loading

mslarae13 commented Apr 24, 2024

ssarrafan commented May 6, 2024

aclum commented May 6, 2024

mslarae13 commented Jun 12, 2024

mslarae13 commented Jun 17, 2024

mslarae13 commented Sep 6, 2023 •

edited

Loading

pkalita-lbl commented Apr 22, 2024 •

edited

Loading