Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What parameters best resmble blastn #54

Closed
ghost opened this issue Nov 8, 2017 · 6 comments
Closed

What parameters best resmble blastn #54

ghost opened this issue Nov 8, 2017 · 6 comments
Labels

Comments

@ghost
Copy link

ghost commented Nov 8, 2017

I built a minimap2 index of the NCBI NT database.

I have a series of metagenomic contigs I want to align against said database. Using blast (megablast), around 40% of my sequences have mappings against the nt reference.

Using minimap2 with -x asm10, I only get ~20% contigs with mappings to my reference.

What parameters in minimap2 would best fit this task? I've tried reducing mismatch and gap penalties without success.

@lh3 lh3 added the question label Nov 8, 2017
@lh3
Copy link
Owner

lh3 commented Nov 8, 2017

What is the exact command line? I would use something close to the default setting like:

minimap2 -c nt.gz contigs.fa

Note that you need to rebuild the index as asm10 uses different -k and -w.

Let me know what you get. Thanks.

@lh3
Copy link
Owner

lh3 commented Nov 8, 2017

PS: asm10 uses -k 19 -w 19. The default uses -k 15 -w 10. It will be much more sensitive to divergent hits. If sensitivity is still an issue, try -k 15 -w 5.

@ghost
Copy link
Author

ghost commented Nov 8, 2017

Command line was:
index building:
minimap2 -x asm10 -d nt.mmi nt.gz

mapping
minimap2 -x asm10 -t 64 -K 10G -N 25 nt.mmi contigs.fa.gz > contigs.paf

Trying with default and -k 15 -w 5

@lh3
Copy link
Owner

lh3 commented Nov 24, 2017

Just curious: how is your experiment going? Can minimap2 achieve acceptable sensitivity?

@ghost
Copy link
Author

ghost commented Nov 24, 2017

Hi,
Have had some parsing problems with downstream visualization and MD tags, but so far I have had excellent results with k15 w5. Much faster and memory efficient than blast, last,lastz, and malt(http://ab.inf.uni-tuebingen.de/data/software/malt/download/welcome.html).

In terms of mappability, results are consistent with short reads. Now I'm checking if taxonomic profiles are also the same. Will let you know asap

@lh3
Copy link
Owner

lh3 commented Jan 18, 2018

I am closing this for now. When you have more information to share, feel free to post new information here or create a new issue. I will be notified either way. Thanks!

@lh3 lh3 closed this as completed Jan 18, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant