You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improvement Description
There needs to be a way to import non-representative quality-filtered FASTA files which are somehow associated with their sample IDs, as brought up in our discussion on revamping the import tutorial (PR#358 in docs).
I imagine the trickiest part of this will be figuring out how to de-multiplex samples (if they aren't already). It will probably require a specific format for each sequence header, with the sample ID in a certain spot (probably with a delimiter of some sort separating it from other info). This will be annoying/hard because I imagine there isn't really a standard way for these files to be formatted (especially if they were acquired from previously published studies).
Proposed Behavior
A potentially easier to place to start would be to allow for importing of demultiplexed FASTA files (i.e. one FASTA file per sample). This is probably sufficient for most people's needs, actually -- users would likely need to do some file manipulations and wrangling to make the non-demultiplexed file fit QIIME 2 format specifications, so if they're doing that they might as well just split them into separate files. Not sure, up for discussion!
Improvement Description
There needs to be a way to import non-representative quality-filtered FASTA files which are somehow associated with their sample IDs, as brought up in our discussion on revamping the import tutorial (PR#358 in docs).
I imagine the trickiest part of this will be figuring out how to de-multiplex samples (if they aren't already). It will probably require a specific format for each sequence header, with the sample ID in a certain spot (probably with a delimiter of some sort separating it from other info). This will be annoying/hard because I imagine there isn't really a standard way for these files to be formatted (especially if they were acquired from previously published studies).
Proposed Behavior
A potentially easier to place to start would be to allow for importing of demultiplexed FASTA files (i.e. one FASTA file per sample). This is probably sufficient for most people's needs, actually -- users would likely need to do some file manipulations and wrangling to make the non-demultiplexed file fit QIIME 2 format specifications, so if they're doing that they might as well just split them into separate files. Not sure, up for discussion!
References
discussion on revamping the import tutorial (PR#358 in docs)
The text was updated successfully, but these errors were encountered: