Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PPR-Meta scoring #35

Open
hoelzer opened this issue Oct 27, 2020 · 2 comments
Open

PPR-Meta scoring #35

hoelzer opened this issue Oct 27, 2020 · 2 comments
Labels
documentation Improvements or additions to documentation enhancement New feature or request

Comments

@hoelzer
Copy link
Collaborator

hoelzer commented Oct 27, 2020

Currently, all sequences are considered as viruses that are reported as "phages" by PPR-Meta. However, we can additionally filter by a "phage score" provided by the tool:

Header,Length,phage_score,chromosome_score,plasmid_score,Possible_source
seq8,86578,0.658026557109837,0.323770475766357,0.0182029599535136,phage
seq11,63443,0.671362450565434,0.257167359821571,0.0714701900259453,phage
seq20,41715,0.945974168353953,0.0147801588566125,0.0392456778921355,phage
seq22,38841,0.999412552439551,1.51951318980135e-05,0.000572250124745809,phage
awk 'BEGIN{FS=","}{if($6=="phage" && $3>0.7){print $0}}' 01-viruses/pprmeta/kleiner_virome_2015_pprmeta.csv 

This is also done here

@hoelzer hoelzer added documentation Improvements or additions to documentation enhancement New feature or request labels Oct 27, 2020
@hoelzer
Copy link
Collaborator Author

hoelzer commented Oct 27, 2020

I checked this for the Kleiner and Neto data set. Kleiner is not affected at all (because we combine PPRmeta results with VF results anyway) but some changes for Neto:

Screenshot from 2020-10-27 12-02-14
Screenshot from 2020-10-27 12-02-20

For Neto we reduce the number of unclassified contigs from 105 to 92. We also lose some Imitervirales annotations though.

@hoelzer
Copy link
Collaborator Author

hoelzer commented Feb 28, 2022

I would still do this and implement a parameter for the PPR-Meta filtering and not just taking all hits that are phage into account. A good default seems >0.7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant