Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

problem of using subseq #5

Closed
ymwur opened this issue Feb 5, 2013 · 1 comment
Closed

problem of using subseq #5

ymwur opened this issue Feb 5, 2013 · 1 comment

Comments

@ymwur
Copy link

ymwur commented Feb 5, 2013

Hi,
I try to extract sequences from a fastq file using the "subseq" in seqtk.
But the extract file contains only the 1st sequence but no others.
I am wondering whether my name.lst file does not fit with what seqtk needs.
I have names of each sequence without other symbols each line in the name.lst. But the fastq file starts each sequence name with a @. Should I add @ in front of each sequence name?
Or what other problem it can be?

Any suggestion is welcome.
Thanks,

Chih-Ming

@vsbuffalo
Copy link
Contributor

Hi Chih-Ming,

I am not the maintainer of seqtk, so take my advice with a grain of salt. subseq does not require '@'s before entries. See this example for usage:

$ echo ">test-1 test\\nGATCTATATGCT\\n>test-2\\nAGTATCATGAGCT\\n>test-3\\nGATTCTTATGCGG" > seqs.fa
$ cat seqs.fa
>test-1 test
GATCTATATGCT
>test-2
AGTATCATGAGCT
>test-3
GATTCTTATGCGG
$ echo "test-1\\ntest-3" > names.lst
$ cat names.lst
test-1
test-3
$ seqtk subseq seqs.fa names.lst
>test-1
GATCTATATGCT
>test-3
GATTCTTATGCGG

Hope this helps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants