-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop Pacbio for NMDC submission #413
Comments
See above issue, the slots that are different for metagenome - long reads have different requirememnts for metagenome - short reads. To accomplish this check for JGI sample submission, the long and short reads will be split apart, in the "multi omics" data selection, if metaG is selected, additional check boxes will appear for long or short reads. In the metadata file, long and short reads will be added to the analysis type option when selected, those assigned long will appear in a "JGI Metagenomics - Long Reads" template tab... and those assigned to short will appear in a "JGI Metagenomics - Short Reads" tab. @pkalita-lbl when do you think we can work on this? I think I or @bmeluch can make the updates to add the interface and the requirements updates? |
@mslarae13 can I turn the question around and ask when do we need to have this done? |
At least questions are in progress. Moving to next sprint. @mslarae13 let me know if this should be in the backlog instead. |
We should do this as part of the expansion / updates to the submission portal interface. I think this rolls into the the updating tabs task |
In schema, pacbio instrument will capture that it's long reads. |
Decided to separate out long and short reads for metaGs at step 4, Multi-Omics data (for JGI), and on the analysis slot. When a user selects metaG they can choose long or short read. |
Functionality on the submission portal is great and works with no issues It was previously asked by Mark if the dna_slot and rna_slot (s) could beconsolidated to just a single slot. If sample 1 has long and short read data, don't we have the same issue? |
🤦🏻 🤦🏻 🤦🏻 Yes, you're absolutely right. That's my bad for not thinking of that. I'll make a new issue to deal with that. In the meantime, it doesn't really hurt anything to collect data like this in the submission portal. But if we get any submissions with data like that, we'll just need to hold off on bringing that data into Mongo until the issue is resolved. EDIT: Here's the new issue microbiomedata/nmdc-schema#1937 |
Thanks @pkalita-lbl ! Alicia, of these DNA vs RNA slots that are JGI specific... do we need to store any of them in NMDC/mongo? Or can they be considered "submission portal & UF specific"? See the slots in MGInterface in submission schema : https://github.com/microbiomedata/submission-schema/blob/0b9413915f63bd7fa9be70f32061db49dc422009/src/nmdc_submission_schema/schema/nmdc_submission_schema.yaml#L34798 |
@aclum can you respond to this when you get a chance? |
I would like to keep dna_isolate_meth and map it to a slot on NMDC's Extraction class. However in looking at those slots we've conflated extraction target and how the extraction was done into one permissible value. If we were to store the values JGI has we'd need to just allow this to be a string b/c JGI doesn't place any CV on this so this needs further discussion with @turbomam |
Make short and long read, deal with mapping the 1 field we care about back later. |
Schema change, post berk. |
Deliverable this task is associated with
See Deliverables tab here: https://docs.google.com/spreadsheets/d/1z_b6WbuTk4pI0Q-Z-rfCgC-8R3m3F2_JDevYuK8CjYE/edit?usp=sharing
RACI
Tag people in their roles
Describe the the task?
Criteria for completion
Estimate people time
Completion Date (Goal)
Oct 20Target Sprint Start & End Dates
Oct 20Feb 23Tag Blocker/Contingent upon isues
The text was updated successfully, but these errors were encountered: