Support FASTA files which contain all reads per sample (i.e. not just representative seqs) #197

cduvallet · 2018-11-05T02:09:16Z

Improvement Description
There needs to be a way to import non-representative quality-filtered FASTA files which are somehow associated with their sample IDs, as brought up in our discussion on revamping the import tutorial (PR#358 in docs).

I imagine the trickiest part of this will be figuring out how to de-multiplex samples (if they aren't already). It will probably require a specific format for each sequence header, with the sample ID in a certain spot (probably with a delimiter of some sort separating it from other info). This will be annoying/hard because I imagine there isn't really a standard way for these files to be formatted (especially if they were acquired from previously published studies).

Proposed Behavior
A potentially easier to place to start would be to allow for importing of demultiplexed FASTA files (i.e. one FASTA file per sample). This is probably sufficient for most people's needs, actually -- users would likely need to do some file manipulations and wrangling to make the non-demultiplexed file fit QIIME 2 format specifications, so if they're doing that they might as well just split them into separate files. Not sure, up for discussion!

References
discussion on revamping the import tutorial (PR#358 in docs)

cduvallet mentioned this issue Nov 5, 2018

Reorganize and clarify import tutorial qiime2/docs#358

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support FASTA files which contain all reads per sample (i.e. not just representative seqs) #197

Support FASTA files which contain all reads per sample (i.e. not just representative seqs) #197

cduvallet commented Nov 5, 2018 •

edited by Mestabrook3

Loading

Support FASTA files which contain all reads per sample (i.e. not just representative seqs) #197

Support FASTA files which contain all reads per sample (i.e. not just representative seqs) #197

Comments

cduvallet commented Nov 5, 2018 • edited by Mestabrook3 Loading

cduvallet commented Nov 5, 2018 •

edited by Mestabrook3

Loading