Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About fDOG running issues #32

Closed
majssssa opened this issue Mar 30, 2023 · 12 comments
Closed

About fDOG running issues #32

majssssa opened this issue Mar 30, 2023 · 12 comments

Comments

@majssssa
Copy link

I had the following problem running fDOG
(hamstr) sxq020@masterv2:~/fDOG/output39_1$ fdog.run --seqFile /media/ym/desk16/sxq020/fDOG/output39_1/431143.fa --jobName A_test --refspec ORYSA@4530@230330 --corepath /media/ym/desk16/sxq020/fDOG/output39_1/A39_1_39525/coreTaxa_dir --searchpath /media/ym/desk16/sxq020/fDOG/output39_1/A39_1_39525/searchTaxa_dir --annopath /media/ym/desk16/sxq020/fDOG/output39_1/A39_1_39525/annotation_dir

Identified seed ID: 4530_1
Compiling core set for A_test
Traceback (most recent call last):
File "/media/ym/desk16/sxq020/.local/bin/fdog.run", line 8, in
sys.exit(main())
File "/media/ym/desk16/sxq020/.local/lib/python3.8/site-packages/fdog/runSingle.py", line 225, in main
core_runtime = core_fn.run_compile_core([seqFile, seqName, refspec, seed_id, reuseCore,
File "/media/ym/desk16/sxq020/.local/lib/python3.8/site-packages/fdog/libs/corecompile.py", line 417, in run_compile_core
compile_core([seqFile, seqName, refspec, seed_id, coreArgs, pathArgs,
File "/media/ym/desk16/sxq020/.local/lib/python3.8/site-packages/fdog/libs/corecompile.py", line 178, in compile_core
tree = ncbi.get_topology(tax_ids.keys(), intermediate_nodes = True)
File "/media/ym/desk16/sxq020/.local/lib/python3.8/site-packages/ete3/ncbi_taxonomy/ncbiquery.py", line 442, in get_topology
lineage = id2lineage[sp]
KeyError: 39525383

Since the species added to the taxa does not have a taxa_id number in the ncbi, I numbered it 39525383.

@trvinh
Copy link
Member

trvinh commented Mar 30, 2023

Hi,
as written at https://github.com/BIONF/fDOG/wiki/FAQ#can-fdog-work-with-non-ncbi-taxa, the core compilation step of fDOG cannot work with non-ncbi taxa. I suppose that you have your new species with the ID 39525383 in the blast databases folder (named coreTaxa_dir by default).
Best,
Vinh
P.S.: I see that you open multiple issues to ask about the problems regarding only to the usage of fDOG, i.e. they have always the same topic. It would be more convenient for us if you just post them in one ONE issue entry. Many thanks!

@majssssa
Copy link
Author

I am very sorry for the inconvenience caused to you. If you have any questions in the future, I will do as you said.

@majssssa
Copy link
Author

I am very sorry for the inconvenience caused to you. If I have any questions in the future, I will do as you said.I'm sorry for the typo in my last answer. I hope it won't cause your misunderstanding.

@majssssa
Copy link
Author

majssssa commented May 4, 2023

Hi,
There was a strick option in an earlier version of Hamstr. I wonder if there is an option in fDOG that has similar functionality to strick.Or in other words how to make the software run more accurately.

@trvinh
Copy link
Member

trvinh commented May 4, 2023

The -strict option is not available anymore in fDOG. I would suggest to choose from the core taxa the reference species as the most closely related species to your search taxa.

@majssssa
Copy link
Author

majssssa commented May 4, 2023

Ok, thank you very much for your reply

@majssssa
Copy link
Author

majssssa commented May 4, 2023

Sorry to bother you again, but the choice for the --evalHmmer parameter should be 0.00005 or 1e-6

@trvinh
Copy link
Member

trvinh commented May 4, 2023

you must use the decimal format (e.g. 0.00005 for 5e-5)

@trvinh
Copy link
Member

trvinh commented May 4, 2023

Hi,
There was a strick option in an earlier version of Hamstr. I wonder if there is an option in fDOG that has similar functionality to strick.Or in other words how to make the software run more accurately.

Hi @majssssa ,
just another info, we found that fDOG performed with a high specificity. The -strict option will just reduce the sensitivity, which is not necessary. You will find this benchmark in the fDOG manuscript, which will be soon published.
Regards,
Vinh

@majssssa
Copy link
Author

majssssa commented May 5, 2023

HI,
When running fDOG, I added the --rbh option instead of the --rsp option. I found that only one gene was selected for each species. Is this because the conditions are more stringent after adding --rbh, so only one gene meets the criteria, or does the --rbh option select only the most perfect candidate based on the output?

@ebersber
Copy link

ebersber commented May 5, 2023 via email

@majssssa
Copy link
Author

majssssa commented May 5, 2023 via email

@trvinh trvinh closed this as completed Mar 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants