Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Predict lifestyle for a new species #2

Open
akiselev-lrsv opened this issue Jan 14, 2021 · 3 comments
Open

Predict lifestyle for a new species #2

akiselev-lrsv opened this issue Jan 14, 2021 · 3 comments

Comments

@akiselev-lrsv
Copy link

Dear @danielzmbp ,
Your work looks amazing, congratulations.

I'm interested in using this tool, could you please briefly explain how I can use to predict a lifestyle for a new species outside your training set? Do I need to assign it to a specific group ad then test the hypothesis, or the software can assign a species to a group automatcally?

Thank you!

@danielzmbp
Copy link
Owner

danielzmbp commented Jan 14, 2021

@akiselev-lrsv Thanks a lot for your comment!
You don't need to assign it to any group, the model will do that for you. You just need to run the model on your annotation using the notebook lspred_testing.ipynb. Note that you need to have the input files in the table format from the genome-properties script. If you have further questions, please don't hesitate to get in touch!

@akiselev-lrsv
Copy link
Author

Thank you @danielzmbp ,
Where can I read about the output scores of the notebook lspred_testing.ipynb? And how to use to interpret the results?

[[0.00992905 0.00214383 0.10016803 0.8877591 ] [0.30474845 0.25972545 0.24817495 0.18735124]] ['saprotroph', 'necrotroph', 'hemibiotroph', 'biotroph']

Best.
Andrei

@danielzmbp
Copy link
Owner

danielzmbp commented Jan 14, 2021

@akiselev-lrsv

Where can I read about the output scores of the notebook lspred_testing.ipynb?

I'm not sure I understand the question. Do you mean what is the source of the examples?

And how to use to interpret the results?

Those are the probabilities for the four output classes. Meaning, the first one would be predicted biotroph, and the second one a saprotroph, but not as clearly as the first one. I will try to make this more explicit in the notebook.
Thanks again for your interest,
Daniel

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants