Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nesoni clip: is discarding sequence descriptions #3

Closed
tseemann opened this issue Aug 26, 2012 · 1 comment
Closed

Nesoni clip: is discarding sequence descriptions #3

tseemann opened this issue Aug 26, 2012 · 1 comment
Assignees
Labels

Comments

@tseemann
Copy link
Member

Illumina now produces FASTQ without the /1 and /2, and instead uses a fasta description (ID, space, DESC).

@M00855:4:000000000-A16FH:1:1101:14529:1450 2:N:0:1
TGGGCAGCAGCGACTTCTGCCACAGTGTCGGTGACATGCCAAACGGTGGGT

When passing through the clip: tool, the DESC is discarde

d@M00855:4:000000000-A16FH:1:1101:14529:1450
GGGCAGCCTCAGCGCCCCGATGGGCGGAATGGGCCTGTCGGGCGT

ie. the "2:N:0:1" is missing.

@ghost ghost assigned pfh Aug 26, 2012
@pfh
Copy link
Member

pfh commented Aug 29, 2012

Description will now be retained by "clip:".

However, many tools in nesoni and outside of nesoni expect reads to be uniquely named (unless they are explicitly treated as pairs).

nesoni.io.check_name_uniqueness can be used to check input. This is currently used by "shrimp:" and now also by "clip:".

@pfh pfh closed this as completed Aug 29, 2012
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants