-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The EVM results were not consistent with PB data #46
Comments
hi,
If you send me the input data for this example and your command, I'll be
able to give some insights.
best,
***@***.***
~b
…On Sat, Sep 11, 2021 at 6:06 AM Yizhong Huang ***@***.***> wrote:
Hi, there.
I have used the EVM model to combine lots of data to the annotation of a
genome. I checked some results from the EVM results with the IGV. As shown
in the picture, I confused about the results.
Why the EVM results were not consistent with the PB data or the
transdecoder result. Note: I put the transdecoder results into the EVM
model.
[image: image]
<https://user-images.githubusercontent.com/31943359/132944144-878260c9-8dbe-498a-b01c-b34800a89acd.png>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#46>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX47KDGOLW2GKBZN3S3UBMSZDANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
OK, thanks so much! I will prepare the data as soon as possible! |
Hi, there ${EVM}/EvmUtils/execute_EVM_commands.pl commands.list #参数 --weights | -w weights for evidence types file${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions partitions_list.out --output_file_name evm.out ${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions partitions_list.out --output evm.out --genome ${genome} |
Thanks!
Can you share your weights.txt file please?
…On Sat, Sep 11, 2021 at 11:30 AM Yizhong Huang ***@***.***> wrote:
Hi, there
Thanks for your kind help ! I have attached the corresponding file for the
region that I have posted before.
The commands are listed as follows:
${EVM}/EvmUtils/partition_EVM_inputs.pl
--genome ${genome}
--gene_predictions gene_predictions.gff3
--transcript_alignments transcript_alignments.gff3
--protein_alignments protein_alignments.gff3
--segmentSize 500000 --overlapSize 10000
--partition_listing partitions_list.out
${EVM}/EvmUtils/write_EVM_commands.pl --genome ${genome} --weights
/home/goldenpigs/1.Huangyizhong/12.BMX/14.Hifi-data/7.merge/weights.txt
--gene_predictions gene_predictions.gff3
--transcript_alignments transcript_alignments.gff3
--protein_alignments protein_alignments.gff3
--output_file_name evm.out --partitions partitions_list.out > commands.list
${EVM}/EvmUtils/execute_EVM_commands.pl commands.list
#参数
--weights | -w weights for evidence types file
${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions
partitions_list.out --output_file_name evm.out
${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions
partitions_list.out --output evm.out --genome ${genome}
find . -regex ".*evm.out.gff3" -exec cat {} ; > EVM.all.gff3
Thanks again for your help!
Sincerely
YIzhong Huang
test.zip
<https://github.com/EVidenceModeler/EVidenceModeler/files/7148092/test.zip>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX4BLAX3Q2RFQP5LWH3UBNY27ANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
also, if you could send me the inputs that correspond to this partition and
the exact command corresponding to that partition, that'll help too.
I need to rerun it exactly as you did for this piece.
thx
~brian
…On Sun, Sep 12, 2021 at 9:56 AM Brian Haas ***@***.***> wrote:
Thanks!
Can you share your weights.txt file please?
On Sat, Sep 11, 2021 at 11:30 AM Yizhong Huang ***@***.***>
wrote:
> Hi, there
> Thanks for your kind help ! I have attached the corresponding file for
> the region that I have posted before.
> The commands are listed as follows:
> ${EVM}/EvmUtils/partition_EVM_inputs.pl
> --genome ${genome}
> --gene_predictions gene_predictions.gff3
> --transcript_alignments transcript_alignments.gff3
> --protein_alignments protein_alignments.gff3
> --segmentSize 500000 --overlapSize 10000
> --partition_listing partitions_list.out
> ${EVM}/EvmUtils/write_EVM_commands.pl --genome ${genome} --weights
> /home/goldenpigs/1.Huangyizhong/12.BMX/14.Hifi-data/7.merge/weights.txt
> --gene_predictions gene_predictions.gff3
> --transcript_alignments transcript_alignments.gff3
> --protein_alignments protein_alignments.gff3
> --output_file_name evm.out --partitions partitions_list.out >
> commands.list
>
> ${EVM}/EvmUtils/execute_EVM_commands.pl commands.list
>
> #参数
> --weights | -w weights for evidence types file
>
> ${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions
> partitions_list.out --output_file_name evm.out
>
> ${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions
> partitions_list.out --output evm.out --genome ${genome}
> find . -regex ".*evm.out.gff3" -exec cat {} ; > EVM.all.gff3
> Thanks again for your help!
> Sincerely
> YIzhong Huang
>
> test.zip
> <https://github.com/EVidenceModeler/EVidenceModeler/files/7148092/test.zip>
>
> —
> You are receiving this because you commented.
> Reply to this email directly, view it on GitHub
> <#46 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ABZRKX4BLAX3Q2RFQP5LWH3UBNY27ANCNFSM5D2ZVEIQ>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Sorry, the weight.txt files are listed as follows:
OTHER_PREDICTION transdecoder 5
PROTEIN GeMoMa 5
ABINITIO_PREDICTION Augustus 2
ABINITIO_PREDICTION GeneMark.hmm 2
ABINITIO_PREDICTION GlimmerHMM 1
Thanks again for your kind help!
Best
Yizhong Huang
… 2021年9月12日 21:57,Brian Haas ***@***.***> 写道:
Thanks!
Can you share your weights.txt file please?
On Sat, Sep 11, 2021 at 11:30 AM Yizhong Huang ***@***.***>
wrote:
> Hi, there
> Thanks for your kind help ! I have attached the corresponding file for the
> region that I have posted before.
> The commands are listed as follows:
> ${EVM}/EvmUtils/partition_EVM_inputs.pl
> --genome ${genome}
> --gene_predictions gene_predictions.gff3
> --transcript_alignments transcript_alignments.gff3
> --protein_alignments protein_alignments.gff3
> --segmentSize 500000 --overlapSize 10000
> --partition_listing partitions_list.out
> ${EVM}/EvmUtils/write_EVM_commands.pl --genome ${genome} --weights
> /home/goldenpigs/1.Huangyizhong/12.BMX/14.Hifi-data/7.merge/weights.txt
> --gene_predictions gene_predictions.gff3
> --transcript_alignments transcript_alignments.gff3
> --protein_alignments protein_alignments.gff3
> --output_file_name evm.out --partitions partitions_list.out > commands.list
>
> ${EVM}/EvmUtils/execute_EVM_commands.pl commands.list
>
> #参数
> --weights | -w weights for evidence types file
>
> ${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions
> partitions_list.out --output_file_name evm.out
>
> ${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions
> partitions_list.out --output evm.out --genome ${genome}
> find . -regex ".*evm.out.gff3" -exec cat {} ; > EVM.all.gff3
> Thanks again for your help!
> Sincerely
> YIzhong Huang
>
> test.zip
> <https://github.com/EVidenceModeler/EVidenceModeler/files/7148092/test.zip>
>
> —
> You are receiving this because you commented.
> Reply to this email directly, view it on GitHub
> <#46 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ABZRKX4BLAX3Q2RFQP5LWH3UBNY27ANCNFSM5D2ZVEIQ>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub <#46 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHTWVP6EPYCV3YSOG6XVSILUBSWSZANCNFSM5D2ZVEIQ>.
Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
great!
Can you find the partitioned inputs for that segment?
On Sun, Sep 12, 2021 at 10:09 AM Yizhong Huang ***@***.***>
wrote:
… Sorry, the weight.txt files are listed as follows:
OTHER_PREDICTION transdecoder 5
PROTEIN GeMoMa 5
ABINITIO_PREDICTION Augustus 2
ABINITIO_PREDICTION GeneMark.hmm 2
ABINITIO_PREDICTION GlimmerHMM 1
Thanks again for your kind help!
Best
Yizhong Huang
> 2021年9月12日 21:57,Brian Haas ***@***.***> 写道:
>
>
> Thanks!
>
> Can you share your weights.txt file please?
>
> On Sat, Sep 11, 2021 at 11:30 AM Yizhong Huang ***@***.***>
> wrote:
>
> > Hi, there
> > Thanks for your kind help ! I have attached the corresponding file for
the
> > region that I have posted before.
> > The commands are listed as follows:
> > ${EVM}/EvmUtils/partition_EVM_inputs.pl
> > --genome ${genome}
> > --gene_predictions gene_predictions.gff3
> > --transcript_alignments transcript_alignments.gff3
> > --protein_alignments protein_alignments.gff3
> > --segmentSize 500000 --overlapSize 10000
> > --partition_listing partitions_list.out
> > ${EVM}/EvmUtils/write_EVM_commands.pl --genome ${genome} --weights
> > /home/goldenpigs/1.Huangyizhong/12.BMX/14.Hifi-data/7.merge/weights.txt
> > --gene_predictions gene_predictions.gff3
> > --transcript_alignments transcript_alignments.gff3
> > --protein_alignments protein_alignments.gff3
> > --output_file_name evm.out --partitions partitions_list.out >
commands.list
> >
> > ${EVM}/EvmUtils/execute_EVM_commands.pl commands.list
> >
> > #参数
> > --weights | -w weights for evidence types file
> >
> > ${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions
> > partitions_list.out --output_file_name evm.out
> >
> > ${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions
> > partitions_list.out --output evm.out --genome ${genome}
> > find . -regex ".*evm.out.gff3" -exec cat {} ; > EVM.all.gff3
> > Thanks again for your help!
> > Sincerely
> > YIzhong Huang
> >
> > test.zip
> > <
https://github.com/EVidenceModeler/EVidenceModeler/files/7148092/test.zip>
> >
> > —
> > You are receiving this because you commented.
> > Reply to this email directly, view it on GitHub
> > <
#46 (comment)
>,
> > or unsubscribe
> > <
https://github.com/notifications/unsubscribe-auth/ABZRKX4BLAX3Q2RFQP5LWH3UBNY27ANCNFSM5D2ZVEIQ
>
> > .
> > Triage notifications on the go with GitHub Mobile for iOS
> > <
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
>
> > or Android
> > <
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
>.
> >
> >
>
>
> --
> --
> Brian J. Haas
> The Broad Institute
> http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
> —
> You are receiving this because you authored the thread.
> Reply to this email directly, view it on GitHub <
#46 (comment)>,
or unsubscribe <
https://github.com/notifications/unsubscribe-auth/AHTWVP6EPYCV3YSOG6XVSILUBSWSZANCNFSM5D2ZVEIQ
>.
> Triage notifications on the go with GitHub Mobile for iOS <
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android <
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKXZYFRPJ3VBL2YB2C2LUBSYDFANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Ok, I will find it and send it quickly!
… 2021年9月12日 22:11,Brian Haas ***@***.***> 写道:
great!
Can you find the partitioned inputs for that segment?
On Sun, Sep 12, 2021 at 10:09 AM Yizhong Huang ***@***.***>
wrote:
> Sorry, the weight.txt files are listed as follows:
> OTHER_PREDICTION transdecoder 5
> PROTEIN GeMoMa 5
> ABINITIO_PREDICTION Augustus 2
> ABINITIO_PREDICTION GeneMark.hmm 2
> ABINITIO_PREDICTION GlimmerHMM 1
> Thanks again for your kind help!
> Best
> Yizhong Huang
>
> > 2021年9月12日 21:57,Brian Haas ***@***.***> 写道:
> >
> >
> > Thanks!
> >
> > Can you share your weights.txt file please?
> >
> > On Sat, Sep 11, 2021 at 11:30 AM Yizhong Huang ***@***.***>
> > wrote:
> >
> > > Hi, there
> > > Thanks for your kind help ! I have attached the corresponding file for
> the
> > > region that I have posted before.
> > > The commands are listed as follows:
> > > ${EVM}/EvmUtils/partition_EVM_inputs.pl
> > > --genome ${genome}
> > > --gene_predictions gene_predictions.gff3
> > > --transcript_alignments transcript_alignments.gff3
> > > --protein_alignments protein_alignments.gff3
> > > --segmentSize 500000 --overlapSize 10000
> > > --partition_listing partitions_list.out
> > > ${EVM}/EvmUtils/write_EVM_commands.pl --genome ${genome} --weights
> > > /home/goldenpigs/1.Huangyizhong/12.BMX/14.Hifi-data/7.merge/weights.txt
> > > --gene_predictions gene_predictions.gff3
> > > --transcript_alignments transcript_alignments.gff3
> > > --protein_alignments protein_alignments.gff3
> > > --output_file_name evm.out --partitions partitions_list.out >
> commands.list
> > >
> > > ${EVM}/EvmUtils/execute_EVM_commands.pl commands.list
> > >
> > > #参数
> > > --weights | -w weights for evidence types file
> > >
> > > ${EVM}/EvmUtils/recombine_EVM_partial_outputs.pl --partitions
> > > partitions_list.out --output_file_name evm.out
> > >
> > > ${EVM}/EvmUtils/convert_EVM_outputs_to_GFF3.pl --partitions
> > > partitions_list.out --output evm.out --genome ${genome}
> > > find . -regex ".*evm.out.gff3" -exec cat {} ; > EVM.all.gff3
> > > Thanks again for your help!
> > > Sincerely
> > > YIzhong Huang
> > >
> > > test.zip
> > > <
> https://github.com/EVidenceModeler/EVidenceModeler/files/7148092/test.zip>
> > >
> > > —
> > > You are receiving this because you commented.
> > > Reply to this email directly, view it on GitHub
> > > <
> #46 (comment)
> >,
> > > or unsubscribe
> > > <
> https://github.com/notifications/unsubscribe-auth/ABZRKX4BLAX3Q2RFQP5LWH3UBNY27ANCNFSM5D2ZVEIQ
> >
> > > .
> > > Triage notifications on the go with GitHub Mobile for iOS
> > > <
> https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
> >
> > > or Android
> > > <
> https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
> >.
> > >
> > >
> >
> >
> > --
> > --
> > Brian J. Haas
> > The Broad Institute
> > http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
> > —
> > You are receiving this because you authored the thread.
> > Reply to this email directly, view it on GitHub <
> #46 (comment)>,
> or unsubscribe <
> https://github.com/notifications/unsubscribe-auth/AHTWVP6EPYCV3YSOG6XVSILUBSWSZANCNFSM5D2ZVEIQ
> >.
> > Triage notifications on the go with GitHub Mobile for iOS <
> https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android <
> https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
> >
>
> —
> You are receiving this because you commented.
> Reply to this email directly, view it on GitHub
> <#46 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ABZRKXZYFRPJ3VBL2YB2C2LUBSYDFANCNFSM5D2ZVEIQ>
> .
> Triage notifications on the go with GitHub Mobile for iOS
> <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
> or Android
> <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
>
>
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub <#46 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHTWVPZQVFTEFUEO3LKPC63UBSYJTANCNFSM5D2ZVEIQ>.
Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
The partitioned list out for this for segment is : The comamand is : Thanks |
do you find all the inputs at:
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 ?
…On Sun, Sep 12, 2021 at 10:42 AM Yizhong Huang ***@***.***> wrote:
The partitioned list out for this for segment is :
chr1_MotherHap /work/6.EVM/chr1_MotherHap Y
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000
The comamand is :
/home/EVidenceModeler/EvmUtils/.././evidence_modeler.pl -G
genome_softmasked.fa -g gene_predictions.gff3 -w /work/6.EVM/weights.txt -e
transcript_alignments.gff3 -p protein_alignments.gff3 --exec_dir
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 >
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000/evm.out 2>
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000/evm.out.log
all this files are right?
Thanks
Best
Yizhong. Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKXZX33YGO3GQ4ZL4IO3UBS34BANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Yes, all thell the inputs at: /work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 as attached as files below |
Got it. thanks!
I see a couple things here that could be a problem.
The transcript and protein alignments need to be in an alignment gff3
format (which is different from the ab initio prediction type). See the
example data that ships with EVM for formatting examples and you'll see
what I mean.
The pacbio data doesn't appear to be included - in case you were aiming to
do this. It would go into the transcript alignments input type.
The OTHER_PREDICTION type for transdecoder is fine, but you'd need to
include the transdecoder data in the gene predictions input file (not
transcript alignments file).
hope this helps,
~b
On Sun, Sep 12, 2021 at 11:11 AM Yizhong Huang ***@***.***>
wrote:
… do you find all the inputs at:
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 ?
… <#m_4636621665775433958_>
On Sun, Sep 12, 2021 at 10:42 AM Yizhong Huang *@*.***> wrote: The
partitioned list out for this for segment is : chr1_MotherHap
/work/6.EVM/chr1_MotherHap Y
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 The comamand is
: /home/EVidenceModeler/EvmUtils/.././evidence_modeler.pl -G
genome_softmasked.fa -g gene_predictions.gff3 -w /work/6.EVM/weights.txt -e
transcript_alignments.gff3 -p protein_alignments.gff3 --exec_dir
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 >
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000/evm.out 2>
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000/evm.out.log all
this files are right? Thanks Best Yizhong. Huang — You are receiving this
because you commented. Reply to this email directly, view it on GitHub <#46
(comment)
<#46 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABZRKXZX33YGO3GQ4ZL4IO3UBS34BANCNFSM5D2ZVEIQ
. Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
.
-- -- Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas
http://broad.mit.edu/~bhaas
Yes, all thell the inputs at:
/work/6.EVM/chr1_MotherHap/chr1_MotherHap_92000001-93000000 as attached as
files below
region_92000001-93000000.zip
<https://github.com/EVidenceModeler/EVidenceModeler/files/7149786/region_92000001-93000000.zip>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKXYZXKZ4FXYCTXFNAALUBS7K7ANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Thanks so much for your kind advice! I have change the EVM process as follows: Thanks again! |
I see. In this case, you might add the GeMoMa predictions into the gene
predictions input as well, and assign it as 'OTHER_EVIDENCE' type in the
weights file (like transdecoder).
…On Sun, Sep 12, 2021 at 9:46 PM Yizhong Huang ***@***.***> wrote:
Thanks so much for your kind advice! I have change the EVM process as
follows:
First, add the transdecoder file into the gene_predictions.gff3
Second, add the pacbio. data using the taco_gtf_to_alignment_gff3.pl into
the. transcript_alignments.gff3
But as for the protein results from the GeMoMa, the file that I used was
the same as the file in the example_data_files, then I used the
GeMoMa_gff_to_gff3.pl to convert it into the protein_alignments.gff3, the
final file is the same format as the gene_prediction. I also runned the
GeMoMa.example.gff by using the GeMoMa script, no changes in the final
results. How to deal with it? The new EVM process is running.
Thanks again!
yizhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX22HXLIIR3Y73B3YADUBVJW7ANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Just discard the protein_alignments.gff3 if I have no other proteins evidence and then add the GeMoMa predictions into the gene predictions input files? Right ? |
that's right
On Mon, Sep 13, 2021 at 10:48 AM Yizhong Huang ***@***.***>
wrote:
… I see. In this case, you might add the GeMoMa predictions into the gene
predictions input as well, and assign it as 'OTHER_EVIDENCE' type in the
weights file (like transdecoder).
… <#m_-4895672658369520534_>
On Sun, Sep 12, 2021 at 9:46 PM Yizhong Huang *@*.***> wrote: Thanks so
much for your kind advice! I have change the EVM process as follows: First,
add the transdecoder file into the gene_predictions.gff3 Second, add the
pacbio. data using the taco_gtf_to_alignment_gff3.pl into the.
transcript_alignments.gff3 But as for the protein results from the GeMoMa,
the file that I used was the same as the file in the example_data_files,
then I used the GeMoMa_gff_to_gff3.pl to convert it into the
protein_alignments.gff3, the final file is the same format as the
gene_prediction. I also runned the GeMoMa.example.gff by using the GeMoMa
script, no changes in the final results. How to deal with it? The new EVM
process is running. Thanks again! yizhong Huang — You are receiving this
because you commented. Reply to this email directly, view it on GitHub <#46
(comment)
<#46 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABZRKX22HXLIIR3Y73B3YADUBVJW7ANCNFSM5D2ZVEIQ
. Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
.
-- -- Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas
http://broad.mit.edu/~bhaas
Just discard the protein_alignments.gff3 if I have no other proteins
evidence and then add the GeMoMa predictions into the gene predictions
input files? Right ?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX26QRBLFZ2YWIEBHJLUBYFKRANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Thanks so much! Hope it can solve my problems! Thanks again for your kind help ! |
You're welcome!
On Mon, Sep 13, 2021 at 11:07 AM Yizhong Huang ***@***.***>
wrote:
… that's right On Mon, Sep 13, 2021 at 10:48 AM Yizhong Huang *@*.
*> wrote: … <#m_5047073197504118105_> I see. In this case, you might add
the GeMoMa predictions into the gene predictions input as well, and assign
it as 'OTHER_EVIDENCE' type in the weights file (like transdecoder). …
<#m_-4895672658369520534_> On Sun, Sep 12, 2021 at 9:46 PM Yizhong Huang @.*>
wrote: Thanks so much for your kind advice! I have change the EVM process
as follows: First, add the transdecoder file into the gene_predictions.gff3
Second, add the pacbio. data using the taco_gtf_to_alignment_gff3.pl into
the. transcript_alignments.gff3 But as for the protein results from the
GeMoMa, the file that I used was the same as the file in the
example_data_files, then I used the GeMoMa_gff_to_gff3.pl to convert it
into the protein_alignments.gff3, the final file is the same format as the
gene_prediction. I also runned the GeMoMa.example.gff by using the GeMoMa
script, no changes in the final results. How to deal with it? The new EVM
process is running. Thanks again! yizhong Huang — You are receiving this
because you commented. Reply to this email directly, view it on GitHub <
#46 <#46>
(comment) <#46 (comment)
<#46 (comment)>>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABZRKX22HXLIIR3Y73B3YADUBVJW7ANCNFSM5D2ZVEIQ
. Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
. -- -- Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas
http://broad.mit.edu/~bhaas Just discard the protein_alignments.gff3 if I
have no other proteins evidence and then add the GeMoMa predictions into
the gene predictions input files? Right ? — You are receiving this because
you commented. Reply to this email directly, view it on GitHub <#46
(comment)
<#46 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABZRKX26QRBLFZ2YWIEBHJLUBYFKRANCNFSM5D2ZVEIQ
. Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
.
-- -- Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas
http://broad.mit.edu/~bhaas
Thanks so much! Hope it can solve my problems! Thanks again for your kind
help !
Sincerely
Yizhong.Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX57LVZOQU6GU4SAVWTUBYHTNANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Hi, there! best~ |
One issue is that EVM doesn't model UTRs and will only model coding exons,
but you'll have UTRs in some of your inputs.
If you want to add UTRs, you could run PASA afterwards using the EVM data
as input annotations and the pacbio and other transcripts as inputs /
sources for the UTRs.
Another issue is that EVM requires complete ORFs unless the genes fall at
the ends of the contigs, in which case they can be 5' or 3' partials.
hope this helps,
~b
…On Tue, Sep 14, 2021 at 5:34 AM Yizhong Huang ***@***.***> wrote:
Hi, there!
With your help, the EVM results have become more accurate than before!
Thanks so much ! When I check the EVM final results with the igv, there is
still some mistakes, as showing in the follow pictures. The final results
were still lost some exons as comparing with the pb isoforms and the
transdecoder results. Need your help and thanks again! In this EVM process,
I just add the transcript_alignments.gff3 and gene_predictions.gff3 , and
all the transdecoder and GeMoMa results were combined into the
gene_predictions.gff3 file. The weights file were:
TRANSCRIPT pacbio 10
OTHER_PREDICTION transdecoder 8
OTHER_PREDICTION GeMoMa 5
[image: image]
<https://user-images.githubusercontent.com/31943359/133223891-6c7000a0-77a2-4732-960a-fe8e044f89f3.png>
chr15_MotherHap_12000001-13000000.zip
<https://github.com/EVidenceModeler/EVidenceModeler/files/7160680/chr15_MotherHap_12000001-13000000.zip>
best~
Yizhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX4JHJ4YR3TK36RYF3LUB4JLZANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Yes,I know that the EVM only model the coding exons. As shown in the transdecoder file, the genes that marked with the red arrow is a complete ORFs and the left terminal is a exon ,may not a UTR. So, I confused that why the EVM results did not have the left exon? |
you can play around with the weights file and see how it impacts things.
If there's something serious, I can dig into it, but I've got a lot of
other work on my plate right now.
On Tue, Sep 14, 2021 at 8:27 AM Yizhong Huang ***@***.***>
wrote:
… One issue is that EVM doesn't model UTRs and will only model coding exons,
but you'll have UTRs in some of your inputs. If you want to add UTRs, you
could run PASA afterwards using the EVM data as input annotations and the
pacbio and other transcripts as inputs / sources for the UTRs. Another
issue is that EVM requires complete ORFs unless the genes fall at the ends
of the contigs, in which case they can be 5' or 3' partials. hope this
helps, b
… <#m_4129522838419141696_>
On Tue, Sep 14, 2021 at 5:34 AM Yizhong Huang *@*.***> wrote: Hi, there!
With your help, the EVM results have become more accurate than before!
Thanks so much ! When I check the EVM final results with the igv, there is
still some mistakes, as showing in the follow pictures. The final results
were still lost some exons as comparing with the pb isoforms and the
transdecoder results. Need your help and thanks again! In this EVM process,
I just add the transcript_alignments.gff3 and gene_predictions.gff3 , and
all the transdecoder and GeMoMa results were combined into the
gene_predictions.gff3 file. The weights file were: TRANSCRIPT pacbio 10
OTHER_PREDICTION transdecoder 8 OTHER_PREDICTION GeMoMa 5 [image: image]
https://user-images.githubusercontent.com/31943359/133223891-6c7000a0-77a2-4732-960a-fe8e044f89f3.png
chr15_MotherHap_12000001-13000000.zip
https://github.com/EVidenceModeler/EVidenceModeler/files/7160680/chr15_MotherHap_12000001-13000000.zip
best Yizhong Huang — You are receiving this because you commented. Reply
to this email directly, view it on GitHub <#46 (comment)
<#46 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABZRKX4JHJ4YR3TK36RYF3LUB4JLZANCNFSM5D2ZVEIQ
. Triage notifications on the go with GitHub Mobile for iOS
https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675
or Android
https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub
.
-- -- Brian J. Haas The Broad Institute http://broadinstitute.org/~bhaas
http://broad.mit.edu/~bhaas
Yes,I know that the EVM only model the coding exons. As shown in the
transdecoder file, the genes that marked with the red arrow is a complete
ORFs and the left terminal is a exon ,may not a UTR. So, I confused that
why the EVM results did not have the left exon?
Thanks so much!
yizhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX7ZNOXRAVY3PNLMK43UB45RZANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Ok, thanks and sorry to disturb! |
hi,
can you send me the partitioned data for this one?
Also, do you have the gemoma data uploaded into igv?
…On Wed, Sep 15, 2021 at 9:23 AM Yizhong Huang ***@***.***> wrote:
Hi, there
Sorry to disturb you again. Today, I modify the weight files and the
results have been improved much. I also check the file in the IGV with the
EVm final files, pb data, transdecoder data and the protein data. As shown
in the pictures, all data support that this region has only one gene, while
the EVM model hass two genes. How to explain and solve it? The weights are:
TRANSCRIPT pacbio 8
OTHER_PREDICTION transdecoder 10
OTHER_PREDICTION GeMoMa 5
[image: image]
<https://user-images.githubusercontent.com/31943359/133441213-8a4a2df6-beb3-43db-b3b7-75d212fce881.png>
Thanks
Yizhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX55TN2G5IIEPCF6L43UCCM6RANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
yes, I have added the genoma data into the igv and the results also support this region has only one gene. Thanks advance for your kind help! The partitioned data for this one is attahced! |
Got it.
So the issue with the split gene is because there's an AT--AC intron and
EVM doesn't work with those introns:
AG 929365-929461 *AT*
*AC* 930973-931145 GT
For the other one, when studying the transdecoder results, note that IGV is
showing both the coding and the noncoding parts of the transcript.
Ideally, you'll show just the coding region since that's what EVM is
working with - so just the CDS regions.
hope this helps,
~brian
On Wed, Sep 15, 2021 at 9:27 PM Yizhong Huang ***@***.***> wrote:
Hi, there
Another problem: how to deal with the EVM results which was the same as
the protein results but not with the pb and transdecoder result? As shown
in the following picture.
… Thanks
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas
|
Thanks, for the split gene with the splice site that EVM doesn't work with those introns, only I can do is manual curation? thanks so much! |
It isn't clear from the image where the coding regions are in the
transdecoder results - it's just showing the transcripts that were targeted
by transdecoder for orf detection.
EVM is generally used with a combination of ab initio predictors along with
these other evidence types, so you might consider running a tool like
augustus and including those predictions as well - could help in general.
For the non GT/GC--AG introns, we tend to use PASA to 'fix' those.
best of luck!
…On Thu, Sep 16, 2021 at 10:52 AM Yizhong Huang ***@***.***> wrote:
Thanks, for the split gene with the splice site that EVM doesn't work with
those introns, only I can do is manual curation?
As for the second question, we can see clearly that the EVM results is the
same as the GeMoMa results, not the pb or transdecoder results. How to deal
with it ?
thanks so much!
yIzhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX5PFEEAASX24TK6DT3UCIAELANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Got it, when I do the EVM process, I have set the weight file for PB at the first, and then the transdecoder, the gemoma is the third. In my opinion, when the exons structures are the same in the three data, the final EVM results may the PB structures, so I confused about the results as shown in the final picture. Maby I am wrong! |
yes, you'd run PASA based on the transcripts, and import the EVM
predictions in as the annotation set. PASA will 'fix' any that need to be
adjusted based on transcript alignments and add UTRs and alt splice
variants.
…On Thu, Sep 16, 2021 at 12:47 PM Yizhong Huang ***@***.***> wrote:
Got it, when I do the EVM process, I have set the weight file for PB at
the first, and then the transdecoder, the gemoma is the third. In my
opinion, when the exons structures are the same in the three data, the
final EVM results may the PB structures, so I confused about the results as
shown in the final picture. Maby I am wrong!
I agree with you that the denovo prediction results should be added into
the EVM model. I have tested the results between with-BRAKER and no-BRAKER
(the BRAKER including the augustus result), I find that the EVM results are
more credible without the BRAKER result. Maybe I should add the denovo
results to do it again!
As for the PASA , do you mean I should firstly run the alignment by the
PASA process with the transcripts data and then do the update process ?
Thanks so much!
Yizhong Huang
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABZRKX2AIFVILHUD3L7LTE3UCINRJANCNFSM5D2ZVEIQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas <http://broad.mit.edu/~bhaas>
|
Hi, there.
![image](https://user-images.githubusercontent.com/31943359/132944144-878260c9-8dbe-498a-b01c-b34800a89acd.png)
I have used the EVM model to combine lots of data to the annotation of a genome. I checked some results from the EVM results with the IGV. As shown in the picture, I confused about the results.
Why the EVM results were not consistent with the PB data or the transdecoder result. Note: I put the transdecoder results into the EVM model.
Thanks so much !
Yizhong Huang
The text was updated successfully, but these errors were encountered: