-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Low quantity of RNA editings found #37
Comments
Hi,
Could you send me you input and output files? Thanks.
Best,
Qing
2017-07-07 1:15 GMT-07:00 s-a-nersisyan <notifications@github.com>:
… Hi, Qing,
I'm using GIREMI to find some RNAE in my RNA-seq data. I have good aligned
file src.bam, reads are paired end (and properly paired). I'm performing
following steps:
1. Sorting bam in coordinate order and indexing it via samtools sort
and index. Output file: sorted.bam
2. Variant calling: samtools mpileup -ugf genome.fa sorted.bam -q 1 |
bcftools view -bvcg - > raw.bcf
3. Filtering results:
bcftools view raw.bcf | vcfutils.pl varFilter -D100 | awk '$6>=10' |
grep "0/1:" > filtered.vcf
In this line I remove homozygous SNV's by grep "0/1:"
4. Annotating filtered.vcf and generating SNV list file (if dbSNP,
gene name, strand etc) via snpEff and own simple python script
5. Running GIREMI: ./giremi -f genome.fa -l SNV_list.txt -o result
sorted.bam
My problem is that GIREMI generates 56616 lines .res file and only 3654 of
them are RNAE. I have meanMI:0.669357 sdMI:0.143034 and it seems to be
normal (as I read in previous issues). I think it's very low quantity of
RNA editings (my bam file is 15GB) and they don't normally match with
results of other RNA editing find tools. Maybe I'm doing something wrong or
missing some important steps? Looking forward for your answer and thank you
very much!
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#37>, or mute the thread
<https://github.com/notifications/unsubscribe-auth/ALL2zdAHaJ_1olyZ1necHmVoYCOlt2JHks5sLekggaJpZM4OQoVW>
.
|
Hello Qing, |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi, Qing,
I'm using GIREMI to find some RNAE in my RNA-seq data. I have good aligned file src.bam, reads are paired end (and properly paired). I'm performing following steps:
bcftools view raw.bcf | vcfutils.pl varFilter -D100 | awk '$6>=10' | grep "0/1:" > filtered.vcf
In this line I remove homozygous SNV's by grep "0/1:"
My problem is that GIREMI generates 56616 lines .res file and only 3654 of them are RNAE. I have meanMI:0.669357 sdMI:0.143034 and it seems to be normal (as I read in previous issues). I think it's very low quantity of RNA editings (my bam file is 15GB) and they don't normally match with results of other RNA editing find tools. Maybe I'm doing something wrong or missing some important steps? Looking forward for your answer and thank you very much!
The text was updated successfully, but these errors were encountered: