Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Aligned Query Seq/Subject Seq to M8 output? #127

Closed
BGemler opened this issue Apr 1, 2019 · 7 comments
Closed

Add Aligned Query Seq/Subject Seq to M8 output? #127

BGemler opened this issue Apr 1, 2019 · 7 comments
Milestone

Comments

@BGemler
Copy link

BGemler commented Apr 1, 2019

Would it be possible to add an option to --outfmt (for .m8/.m9 output file extensions) for searchp and searchn to return the aligned portions of the query sequence/subject sequence?

Thank you

@h-2
Copy link
Member

h-2 commented Apr 3, 2019

In theory that's possible, however development focus has switched to making the new version of lambda: lambda3.

I will see how much work this feature would be for lambda2, though.

@BGemler
Copy link
Author

BGemler commented Apr 3, 2019

Understood. Thank you!

@BGemler
Copy link
Author

BGemler commented Jan 30, 2020

Hello,

Following up on the above item to see if it's in development for Lambda3. We're processing large amounts of data in pairwise format because we need the aligned portions of the query/subject sequences (including gaps when applicable) - if it would be possible to include these in the tabular output in Lambda3 it would be awesome!

@h-2 h-2 added this to the lambda3 milestone Aug 15, 2022
@h-2
Copy link
Member

h-2 commented Aug 15, 2022

We would need to get changes into SeqAn2 for this. I am not sure that is still possible, but I will investigate!

@BGemler
Copy link
Author

BGemler commented Feb 14, 2023

Hey @h-2 - saw some activity on this repo! Are there any updates on when LAMBDA3 will be released?

@h-2
Copy link
Member

h-2 commented Feb 14, 2023

Hi! Yes, there is quite a bit of development and many improvements and new features on the horizon. I don't have an exact date, yet, but the protein mode on the lambda3 branch is already fairly stable.

Regarding this feature request: We will still be using the SeqAn2 code for writing the output files. This means that we will not be able to do major changes to the Blast output, unfortunately.

I currently see the following options:

  1. Use .m0 and parse the strings from that. Not ideal, because it is an ugly format, but possible.
  2. Use the positions printed in .m8 and extract manually. I guess, you are doing something similar right now, but it is not very ergonomic of course.
  3. Use SAM or BAM output. They currently already offer printing the matching query sequence (SEQ field / qs field) + the hard-clip option. Interfaces for this are more modular, and we would be able to add an option for also printing the subject sequence if this is important to you!

@h-2
Copy link
Member

h-2 commented Jul 22, 2024

Lambda3 was released earlier this year. Closing this issue for now.

@h-2 h-2 closed this as completed Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants