-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues regarding flair #1
Comments
I am taking a guess that what's causing In regards to the minimap2 usage error, that's been fixed with the newest |
Hi @belgravia Thanks for your quick response. I will try the updated one now. Just one more query. While running the last version, I was having 723040 entries in my input *.psl, 722688 entries in *_strand.psl; 129339 entries in corrected.gp; and only 10941 entries in *_strand_corrected.psl. Does it looks ok? Thanks |
There is an expected drop in entries after correction since any reads that contain a splice junction not conforming to GT-AG splice motif and isn't within window of an annotated GT-AG is removed. A >40% drop in entries is larger than what I've normally seen. If
Potential fixes are: And there should be little to no drop for the first strand/gene inference step. To address that, I'd have to look at a the missing entries since it's unclear to me what could be causing that. |
Hi @belgravia I am still having issues with running it. Also I tried to check the intermidiate files.
|
After the FLAIR v1..0 release, I had pushed an updated version of On another note, I hope you have fixed your issue with the loss of reads after novel junction filtering! |
It working perfect with updates. -novel option solve the issues related to loss of reads. Thank you for your help. I am having one last query. During Are you considering the collapse.fa as the ref genome? Regards |
I'm glad it's working for you now. |
Thanks for the explanation and also for this nice tools. It makes life easy. |
Hi Alison,
I have a query indirectly related to flair script. In the flair output, we
get the fasta files on the NOVEL isoforms. Is there any way we can have the
fasta files of the KNOWN isoforms of the flair-identified NOVEL isoforms
from the input file?
Thanks and regards
…On Thu, Oct 18, 2018 at 12:36 PM Alison ***@***.***> wrote:
I'm glad it's working for you now.
Yes, that step is aligning the reads directly to the first-pass isoforms
to assign an isoform to each read. Isoforms that surpass some number
threshold of reads supporting them are retained in the final isoform output.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AopMlU0aleGY9QBh482riCA5rwUs6SMBks5umNgggaJpZM4Xc6oZ>
.
|
The fasta files should include all isoforms, including the annotated ones. If you run So for human data, the annotated isoforms should be renamed to 'ENST....' and the novel ones should have the nanopore read name as their name. Also I'm going to be pushing a faster version of this script and others quite soon, if you can wait a little bit. |
Hi Alison,
Thanks for your response. I can see and updated version in flair. Is this
the version you were talking about?
Regards
Ashok Patowary, PhD
University of California Los Angeles
635 Charles E Young Dr S
Los Angeles
CA 90095
http://www.ashokpatowary.com/
…On Thu, Nov 29, 2018 at 4:24 PM Alison ***@***.***> wrote:
The fasta files should include all isoforms, including the annotated ones.
If you run bin/identify_similar_annotated_isoforms.py on your final
isoform psl, it should rename the isoforms that are annotated to their
names in the annotation. This script is already run in the flair.py
wrapper.
So for human data, the annotated isoforms should be renamed to 'ENST....'
and the novel ones should have the nanopore read name as their name.
Also I'm going to be pushing a faster version of this script and others
quite soon, if you can wait a little bit.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AopMld6dAYxN0QZUtc7LNAvKCd4resrEks5u0HqggaJpZM4Xc6oZ>
.
|
Sorry for delay in response! Yes, that script is correct. I combined scripts and there's now a script called |
Hi @anbrooks @belgravia
I tried to run the flair pipeline on my MinION RNAseq dataset. I have faces some issues.
So I mapped the reads independently with minimap2. Thereafter while using the
collapse
optionspython /u/home/Resource/flair/flair.py collapse -r test.fastq -g GRCh37.primary_assembly.genome.fa -q test_strand_corrected.psl -m minimap2 -f GenCode_v28.gtf
, it gives me a blank output file. Also I tried usingpsl_to_sequence.py
tools separately, it gives a blank output file.Am I doing anything wrong?
The text was updated successfully, but these errors were encountered: