Skip to content
DENG Ziqi edited this page Jul 14, 2022 · 23 revisions

Overview

Welcome! PhyloCloud is an easy-to-use online platform for operating and interpreting phylogenetic trees and multiple sequence alignments, serving as a bridge between complex phylogenomic frameworks and non-expert users. Users can browse the public tree collections or build and save your own trees at your own workplace both registerially and anonymously. https://phylocloud.cgmlab.org/

If you use PhyloCloud, please cite

Ziqi Deng, Jorge Botas, Carlos P Cantalapiedra, Ana Hernández-Plaza, Jordi Burguet-Castell, 
Jaime Huerta-Cepas, PhyloCloud: an online platform for making sense of phylogenomic data, 
Nucleic Acids Research ,Volume 50, Issue W1, 5 July 2022, Pages W577–W582, 
https://doi.org/10.1093/nar/gkac324

uploadTree

Citation

Welcome! PhyloCloud is an easy-to-use online platform for operating and interpreting phylogenetic trees and multiple sequence alignments, serving as a bridge between complex phylogenomic frameworks and non-expert users. Users can browse the public tree collections or build and save your own trees at your own workplace both registerially and anonymously. https://phylocloud.cgmlab.org/

Quick Start

PhyloCloud provides entry point to annotated data and tools for phylogenomic research, regardless of whether registered or anonymous users. The home page of PhyloCloud indicates the most typical features so you can quickly start using PhyloCloud:

Besides of common usage for trees management, we also provide various phylogenetic tools to conduct various analysis

If you already have a tree, PhyloCloud implemented the state-of-art tree visualizing engine and provide various options for further modification or analysis.

UploadTree

One of the core features in PhyloCloud is to allow user upload multiple trees as a collection. Click the Upload tree button at the top right corner of the page to open the upload panel.

uploadTree

Input trees are expected to be in plain text files which not bigger than 50MB(~500,000 tips), and at the moment we support Newick format and parse with ETE Toolkit, for further imformation about newick format, please check ETE Toolkit

Phylocloud also allows user to attach alignment (<500MB) with corresponding tree. If you upload alignment files alongside with treefiles, please note that they must share the same prefix, otherwise alignments are not able to detect as attachment.

Besides of data input, users can also choose if their collection private or public for other users. Collection name is optional, once a collection is created, it will generate a collection id and it will be the collection title if it was not named. Anonymous user will be directed to the collection page where storage the trees you just upload. For registered users, they will be redirected to my cloud where your private cloud is.

*Noted Anonymous users should remember the collection id otherwise it will be lost.

BrowseTree

In phylocloud we have storage a several of featured collections which is public and open for browsing and download. Click Featured Collection to enter the public collection portal. Once your interested collection, click the collection title and entered the collection page.

Here we show example of eggNOG5 collection:

eggNOG5Collection

  • Trees of collection are displayed as tree cards with thumnails, with treecard you can find basic information of each tree, such as owner, title, and download button of tree and alignment(if there is)
  • User can search trees by tree title or node's name against public collection or own collection. When users are at collection page, the search filter will switch to Current Collection where allows user to search within collection

Phylogenomic tools

BuildTree

Phylocloud provides a automated phylogenetic workflows to construct phylogeneies with a simple click. Submit nucleotide or amino acids sequences in FASTA format and infer a phylogeny using ETE-Toolkit predefined workflows. Click here to explore more

Once the job has submit, it will be redirect to result page with job id. After a waiting patiently for a moment, result tree will be render and visualized on the result page.

*Noted users should remember the job id otherwise it will be lost.

CompareTree

Phylocloud allows users to compute topological distances between trees and compare differences side-by-side.

Comparison result will be shown below, it consists of two part:

treecompare1

the first part is the topological comparisons,

  • once you hover to the target node, it shows the features of the node and the corresponding note in the comparing target will be highlighted.
  • mismatches can be tracked up to internal nodes which are highlighted as red dot

treecompare2

the second part is a statistic summary which contains the information of statistic difference. The reported values are:

  • rf, Robinson-Foulds symmetric distance
  • max_rf, maximum Robinson-Foulds value for this comparison
  • norm_rf, Normalized Robinson-Foulds distance (RF/maxRF)
  • eucl dist, Euclidean distance between two trees
  • mismatches, number of mismatches that compare
  • %src_br frequency of edges in target tree found in the reference (1.00 = 100% of branches are found)
  • %ref_br frequency of edges in the reference tree found in target (1.00 = 100% of branches are found)
  • effective_tree_size, Effective tree size used for comparisons (after pruning not shared items)
  • treekoD Average distance among all possible subtrees in the original target trees to the reference tree
  • subtrees Number of subtrees used for the comparison (applies only when duplicated items are use to decomposed target trees)

TaxaQuery

Phylocloud allows to download, parse and query a local copy of the NCBI/GTDB taxonomy database.

Query by Ids: taxaexample1 taxaexample2

Taxa tree result will be shown below taxaquery

TreePage

Once upload your tree, you are able to visualize it on the TreePage quickstart

A) General Interface

PhyloCloud uses the new visualisation framework implemented in ETE 4.0, which allows for the interactive exploration of huge phylogenies based on a context-based adaptive zooming strategy. It allows for quick searches within large tree topologies, where each search can be associated to a different label and colour. By hover over target node or clade, it shows properties information(if there is any) of related node. By default, ecah node contain three basic properties, name, dist, and support.

B) Control Panel

Control Panel of tree view is consists of three major tabs:

  • Layout includes the basic options in regards to general tree layout information, including tree display mode, nodes collapse degree, annotation layout, etc
  • Select & Search provides options for choose colors for selected nodes or clades. It also provide function for searching target node with regular expression by clicking "new search" button. Noted that searching function can be also triggered with the shortcut key "/" or clicking a direct button to the tree view panel
  • Advance tab provides more specific options in regards to scale level, width, height from tree view to minimap.

C) Node Panel

The node editor panel provides access to node-specific actions, such as creating subtrees, collapsing, pruning, rooting, and more.

D) Tree Analysis

In order to perform any analysis or annotations, such as taxonomic or evolutionary detection, click the Analyze from the navigator Currently we provide four options of analysis.

Topology Editing

Here you can find topology modifications such as automatic re-rooting, pruning, ladderizing branches, resolving polytomies or converting tree topology into an ultrametric.

Tree Annotation

1) Taxonomic annotation

This action will extract taxa information from the tree leaves and annotated the tree accordingly.

By default, NCBI/GTDB TaxIDs will be extracted from the leaf names by splitting by the dot symbol (e.g, delimiter='.'), and taking the first element of the string (index=0), and choose NCBI or GTDB for Taxa DB.

NCBI annotation example https://phylocloud.cgmlab.org/tree_page/61c3061f33448902b5c2882b/

GTDB annotation example https://phylocloud.cgmlab.org/tree_page/6234703aa614694328db1570/

taxaexample1

After select correctly column info, click Annotate

taxaexample1

Taxa info will be activated in tree layouts at aligned panel

*Notes that PhyloCloud adopts GTDB Release 202, annotate it with different GTDB ids may encounter issues 

2) Evolutionary annotation

This action will scan tree topology and use species information of each leaf to infer speciation and duplication events.

By deafult, species names are extracted by splitting leaf names by the dot symbol (delimiter='.'), and taking the first element (index=0)

Check example https://phylocloud.cgmlab.org/tree_page/62458ebb0d85d1b583543600/

taxaexample1

In this example case, delimiter is "_", so we need to replace "." with _ .

taxaexample1

After annotation, Speciation events are lable as blue dots, and Duplication is red dots at the last comman ancestors

3) Link to Alignment

This action will link a multiple sequence alignment to the current tree.

check example tree https://phylocloud.cgmlab.org/tree_page/6245c056721288d60d0041ab/ with demo msa

After added alignment file, click Alignment layout on Control panel, alignment layout will be displayed taxaexample1

*Noted the above changes can be either discarded after testing or saved permanently into the database by clicking Save Changes on the top right corner

TreeExport

the Export tab provide tree export functions. Depending on the export format selected, several additional options are available:

taxaexample1

  • Text where users can choose to download plain newick file or full annotated newick containning all features information

  • Image where PhyloCloud provides PDF and HTML format for tree image

  • Tree view users can share treeview session by clicking share button to copy url link to clipboard. *note that for registered users, tree status must be set to Public(default) to be able to be read by other users