# Plot root to tip plots for fellowship applications 

September 20, 2018 

I generated a tree using FastTree for the H5 HA dataset that I used for my beast run on 2018-08-23. I opened up the tree file in TempEst, chose "best fitting root" and exported the data as a text file. I will now read it in and plot it. 

In [1]:
# import necessary modules
import sys, subprocess, glob, os, shutil, re, importlib, Bio, csv
import numpy as np
import pandas as pd
import rpy2
%load_ext rpy2.ipython 

In [2]:
# read in file 
infile = "/Users/lmoncla/Documents/H5N1_Cambodian_outbreak_study/beast/dta-2018-08-23/root-to-tip-2018-09-20.txt"

In [3]:
# read into a dataframe
df = pd.read_csv(infile, sep="\t", header=0)
df.head()

Unnamed: 0,tip,date,distance,residual
0,a/muscovyduck/vietnam/lbm113/2012|a|h5n1|2.3.2...,2012.101093,0.207833,0.018957
1,a/muscovyduck/vietnam/lbm260/2012|a|h5n1|2.3.2...,2012.5,0.210973,0.018978
2,a/muscovyduck/vietnam/lbm258/2012|a|h5n1|2.3.2...,2012.5,0.210983,0.018988
3,a/muscovyduck/vietnam/ncvd1220/2012|a|h5n1||ha...,2012.103825,0.207293,0.018396
4,a/duck/vietnam/ncvd1210/2012|a|h5n1||ha|4|not_...,2012.103825,0.208473,0.019576


In [5]:
%%R -w 800 -h 300 -u px -i df  # this sets the size of the plot...otherwise, it will go off the page
require(ggplot2)
library(ggplot2)

p <- ggplot(data=df, aes(x=date, y=distance)) + 
    geom_point(color="#51B695")+
    geom_smooth(method='lm',formula=y~x, color = "black", se=FALSE)+
    scale_x_continuous(limits=c(2008, 2018), breaks=c(2008,2010,2012,2014,2016,2018))+ 
    scale_y_continuous(limits=c(0,0.3))+
    labs(x="collection date",y="root to tip distance")+
    theme(plot.title = element_text(size=20, hjust=0.5))+
    theme(panel.grid.major.y=element_line(colour=NA))+
    theme(panel.grid.minor=element_line(colour=NA,size=NA))+    
    theme(strip.background = element_rect(colour=NA, fill=NA))+
    theme(axis.line.x=element_line(colour="black"))+
    theme(axis.line.y=element_line(colour="black"))+
    theme(strip.text.x=element_text(size=116))+
    theme(axis.title.y=element_text(size=16, vjust=8))+
    theme(axis.title.x=element_text(size=16, vjust=-12))+
    theme(axis.text=element_text(size=16, colour="black"))+
    theme(axis.text.x=element_text(size=16))+
    theme(legend.text=element_text(size=16))+
    theme(legend.title=element_text(size=16, face="plain"))+
    theme(panel.margin=unit(1, "lines"))+theme(plot.margin=unit(c(1,1,1,1),"cm"))+
    theme(legend.key.size=unit(0.7, "cm"))+
    theme(panel.background=element_rect(fill=NA))+
    theme(legend.key=element_rect(fill=NA))

ggsave("root-to-tip-2018-10-07.pdf", p, width = 5, height = 3.5, device=pdf, path="/Users/lmoncla/Documents/Fellowships/K99/figures")
