Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: window-length pros and cons #84

Open
JohnUrban opened this issue Aug 21, 2018 · 3 comments
Open

Question: window-length pros and cons #84

JohnUrban opened this issue Aug 21, 2018 · 3 comments

Comments

@JohnUrban
Copy link

Hi,

RaCon is great. I was wondering if there were pros and cons to the --window-length parameter, which is 500bp by default. I tested 5kb -- so I know it gets a lot slower with length, but I was hoping there would also be some arguments that favor using a longer window such as 5kb.

Also, between minimap2 and RaCon there is clearly an ultra-fast gap-filling solution. I am toying around w/ the 2 tools in a kind of clunky way to do so. Do you have any recommendations for that?

Best,

John

@rvaser
Copy link
Collaborator

rvaser commented Aug 22, 2018

Hi John,
longer window lengths should yield a bit higher accuracy, although there is no guarantee (I have a comparison somewhere, will update the comment next week). The speed downgrades more when the length of the longest sequence in a window multiplied with the maximal absolute value from the alignment scoring matrix exceeds the size of 16bit signed integer (~32k). 32bit integers are then used instead to avoid overflows in SIMD instructions. You can try window length 1500 with default scoring parameters.

If I understood correctly, you are trying to use minimap2 and racon to fill gaps of unfinished assemblies with TGS data? Or is it something else?

Best regards,
Robert

@JohnUrban
Copy link
Author

Hi Robert,

Thanks for the tips.

As for gap-filling -- yes I was able to accidentally fill small gaps using minimap2 and RaCon. It then occurred to me that Minimap2 and RaCon could be combined to have the same functionality (I think) as the commonly-used PBJelly2 gap-filler. Heck -- PBJelly2 could probably be updated w/ these 2 programs b/c now it uses BLASR and PBDagCon.

Best,

John

@rvaser
Copy link
Collaborator

rvaser commented Sep 13, 2018

If miminap2 was able to map a whole read over a short gap (without break points), then racon will probably fill the gap. Otherwise, you have to fill the gap with any reads before polishing so that other reads can map to it.

Best regards,
Robert

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants