Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create HGTrobustness_parsing #18

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

CaroleBelliardo
Copy link

add HGTrobustness_parsing
This script, HGTrobustness_parsing.py, is designed to Facilitate a phylogenetic validation of putative Horizontal Gene Transfer (HGT) events based on a comprehensive analysis involving protein counts and taxonomic ratios within sister branches of phylogenetic trees.

Usage
It is run from the command line and requires several arguments:

--fasttree_tree_results or -a: This is the file path to fasttree_tree_results from AvP results. This argument is required.

--fasttree or -t: This is the directory path containing trees in Newick format. This argument is required.

--nb_prot or -n: This is the total number of proteins in the sister branch and ancestral sister branch. This argument is optional and defaults to 3 if not provided.

--clade_ratio or -n: This is the clade ratio, which is the total number of proteins in the sister branch and ancestral sister branch. This argument is optional and defaults to 0.8 if not provided.

--output_names or -o: This is the name of the output files. This argument is required.

The script also includes a function import_tree that imports a tree structure from a Newick file and roots the tree at the midpoint. This function takes as an argument the name of the file containing the tree structure in Newick format.

To run the script, navigate to the directory containing the script and use the following command, replacing the argument values with those appropriate for your use case:

$ python Parse_tree_2024.py --fasttree_tree_results [your_results_file/file path] --fasttree [your_tree_directory/dir path] --nb_prot [your_protein_number/int] --clade_ratio [your_clade_ratio/float] --output_names [your_output_file_name/path]

add HGTrobustness_parsing
This script, HGTrobustness_parsing.py, is designed to facilitate a phylogenetic validation of putative Horizontal Gene Transfer (HGT) events based on a comprehensive analysis involving protein counts and taxonomic ratios within sister branches.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant