Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No output and "stoi" error with haplotagging data. #5

Closed
ZuyaoLiu opened this issue Mar 17, 2022 · 6 comments
Closed

No output and "stoi" error with haplotagging data. #5

ZuyaoLiu opened this issue Mar 17, 2022 · 6 comments
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@ZuyaoLiu
Copy link

Hi Pierre,

I'm using haplotagging data with LRez and LEVIATHAN.
However, I always got this error when trying to index bam files.
微信截图_20220317173942
The bam file looks like this:
微信截图_20220317174118

Any idea about it?

Thank you !!

@ZuyaoLiu
Copy link
Author

I just found out the error disappears if I just process one chromosome. But that would lead to no SV detected with LEVIATHAN.
And if I process all chromosomes together, the error shows up again.

@clemaitre
Copy link
Collaborator

Hi,

I guess the problem comes from the BX tag in your bam. LRez may not recognize that it is the Haplotagging technology since the BX tag ends with "-1", instead it assumes it is 10X genomics technology and expects the barcode to be a ACGT word.
We assumed the "-1" pattern at the end of the barcode was specific to 10Xgenomics data. It seems not to be the case. Is the "-1" pattern present also in your Fastq files ? Or is it the mapper which added the "-1" ? By the way, which mapper did you use to get your bam file ?

Claire

@ZuyaoLiu
Copy link
Author

Hi clemaitre,

I aligned the reads to ref using EMA. To do this, I manually convert haplotagging barcods to 16 bp sequences and did mapping. After that, I reverted the BX tag in bam files back to haplotagging barcodes. So the "-1" comes from the EMA mapper.
I just tried to remove "-1" by "samtools view -h ../1_map/MA_fem_13.BXnum.bam |perl -pe "s/-1\t/\t/" |samtools view -bS - -o MA_fem_13.new.bam", but the error was still there.

Thank you .

@clemaitre
Copy link
Collaborator

Thank you @ZuyaoLiu for these helpful details.
This seems thus that the error is not related to the barcode format (even if it is problematic for the moment in LRez that haplotagging tags may end by "-1")...

Could you give us access to your problematic bam file to help us debugging ? (you can send a download link for instance at claire[dot]lemaitre[at]inria[dot]fr)

Claire

@anne-gcd anne-gcd added bug Something isn't working help wanted Extra attention is needed labels Mar 18, 2022
@clemaitre
Copy link
Collaborator

Hi @ZuyaoLiu,

the stoi error you got with LRez was fixed, you should now be able to index your bam files using the latest commits (and the "-1" at the end of the barcodes should no longer be a problem neither). The stoi error was due to some contigs whose size is smaller than the windowSize (windowing is used in LRez to multi-thread the indexation).

A similar piece of code was also present in LEVIATHAN (which also uses windowing but for SV discovery, thus with a potential similar bug if contigSize<windowSize). This has also been fixed in the latest commit of Leviathan. Could you please test on your data and tell us if it runs without error ?

Best,
Claire

@ZuyaoLiu
Copy link
Author

ZuyaoLiu commented Mar 28, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants