Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chromosome name in FindPeaks // Help with Output #52

Closed
Romeo1-1 opened this issue Jul 21, 2022 · 3 comments
Closed

Chromosome name in FindPeaks // Help with Output #52

Romeo1-1 opened this issue Jul 21, 2022 · 3 comments

Comments

@Romeo1-1
Copy link

Hello, thank you for this package!

I encountered a problem during the FindPeaks part. In my bam file, my chromosomes don't have the "chr" prefix.

FindPeaks(output.file = "Sierra/P1832_peaks.txt",
          gtf.file = "genes.gtf",
          bamfile = "test.bam",
          junctions.file = "test_.bed",
          ncores = 20)

image

I managed to find a way around the problem by changing the names in my bam files using samtools & then re-indexing. This requires 2 extra step and I saw that there is a "chr_name" option. But I tried different combinations, like chr.names = c("1","2","3","4","5","6","7","8","9","10","11","12","13","14","15","16","17","18","19","20","21","22","X","Y", "M") but got no luck.

Do you have any idea that could help me ? Thank you very much in advance,

Rémi

@Romeo1-1 Romeo1-1 changed the title Chromosome name in FindPeaks Chromosome name in FindPeaks // Help with Output Jul 23, 2022
@Romeo1-1
Copy link
Author

Hello,

Since I have already opened an issue, I will ask my question here.

To give some context, I am comparing 2 samples from the same patient, before and after treatment (at diagnosis/relapse). I was able to use the package and adapt the vignette to my own data.

I have a question about the "PlotCoverage" graph. Is there any way to tell exactly if a specific alternative transcript is present or not?

image

Here for example, I am very interested in the ENST000000424814 transcript (with spliced exon 10). But the peaks don't really allow to differentiate all these transcripts.

Thanks a lot in advance,

@rj-patrick
Copy link
Contributor

Hi Romeo,

Regarding the first issue, the easiest thing if possible is to use the same GTF file that was used to generate the BAM file. Did you align the data yourself or did you acquire the BAM file as is?

Regarding the second issue, unfortunately Sierra does not have ability to evaluate specific transcripts. However, you might be able to check in IGV whether there are spliced reads skipping that exon of interest.

Cheers,
Ralph

@Romeo1-1
Copy link
Author

Romeo1-1 commented Aug 5, 2022

Hello @reprobate , sorry for the late answer ! By using the exact same GTF file I got rid of the issue ! I was able to see alternative splicing on IGV but I hoped to be able to quantify one specific transcrit.

Thank you for your help,

Remi

@Romeo1-1 Romeo1-1 closed this as completed Aug 5, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants