Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issues with generating splice junction file #60

Open
ckong1806 opened this issue Mar 3, 2023 · 7 comments
Open

issues with generating splice junction file #60

ckong1806 opened this issue Mar 3, 2023 · 7 comments

Comments

@ckong1806
Copy link

I have 3' scRNAseq data from 10X genomics. I ran the data through cellranger so, I have the possorted bam ouput file. I tried to run regtools on the bam file but, the bed output file has ? in the strand column. Regtools wasn't able to detect strandedness since cellranger used STAR for alignment and STAR doesn't infer strandedness. From the vignette in Sierra, it seems you guys also used cellranger output - can you share how were you able to run regtools on your dataset?

@SebastianMHJohn
Copy link

Hi Sierr-Team,
I am having the exact same issue und would appreciate any information.
Best wishes

@rj-patrick
Copy link
Contributor

How have you run RegTools? Unless there's been an update, setting the -s parameter to 1 should work for 10x data. See the commands we use at: https://github.com/VCCRI/Sierra/wiki/Sierra-Vignette#splice-junctions-file

@SebastianMHJohn
Copy link

Hi,
I did it exactly as described in https://github.com/VCCRI/Sierra/wiki/Sierra-Vignette#splice-junctions-file.
Best wishes

@ckong1806
Copy link
Author

How have you run RegTools? Unless there's been an update, setting the -s parameter to 1 should work for 10x data. See the commands we use at: https://github.com/VCCRI/Sierra/wiki/Sierra-Vignette#splice-junctions-file

Hi, I ran the code exactly as the vignette but, when I opened the regtools output file, the strand column is filled with only question marks. Regtools also currently do not accept -s 1 as a parameter anymore. It's either RF (first strand) or FR (second strand) but, based on 10X Genomics website, STAR alignment does not include any strand info. I think that may be why regtools output is ? on the strand column. Did you guys have a different regtools output file (strand column) from 10X data? Was it all - or + on the strand column?

@rj-patrick
Copy link
Contributor

Thanks, I wasn't aware that RegTools had made that change, I'll update that wiki. If you specify -s RF that should work then - at least I was able to re-run it on an old BAM file and it appeared to replicate the old junctions file output. The strandedness refers to the library prep strategy, rather than STAR alignment, so for 10X you can use RF.

@ckong1806
Copy link
Author

Thanks, I wasn't aware that RegTools had made that change, I'll update that wiki. If you specify -s RF that should work then - at least I was able to re-run it on an old BAM file and it appeared to replicate the old junctions file output. The strandedness refers to the library prep strategy, rather than STAR alignment, so for 10X you can use RF.

I tried using -s RF and the regtools output file does have - on the strand column but many still have ?

How do you recommend I proceed? Should I filter out the ? rows?

@rj-patrick
Copy link
Contributor

Sorry for the slow response getting back to you. You should be fine to use that junction file for FindPeaks as it is, we don't utilise the strand column.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants