# Table of Contents
1. [Introduction](#Introduction)
2. [Setting Up Environment](#Setting-Up-Environment)
3. [Getting Data](#Getting-Data)
3. [Descriptive Analysis](#Descriptive-Analysis)
    1. [Independent Variables](#Independent-Variables)
    2. [Dependent Variables](#Dependent-Variables)
4. [Inferential Analysis](#Inferential-Analysis)
    1. [_Sceloporus jarrovii_](#Sceloporus-jarrovii)
    1. [_Sceloporus virgatus_](#Sceloporus-virgatus)
5. [Conclusions](#Conclusions)
6. [Discussion](#Discussion)

## Introduction 

[Back to TOC](#Table-of-Contents)

## Setting Up Environment

[Back to TOC](#Table-of-Contents)

In [1]:
import pandas as pd
import numpy as np
import scipy.stats as ss
import os, glob, logging
from summary_functions import *
import plotly
import chart_studio.plotly as py
import plotly.figure_factory as ff
import plotly.graph_objs as go
from plotly.offline import download_plotlyjs, init_notebook_mode, plot, iplot

init_notebook_mode(connected=True)
pd.options.display.max_columns = 50
pd.options.display.max_columns = 100
pd.options.display.max_colwidth = 1500


# Getting Data
[Back to TOC](#Table-of-Contents)

In [2]:
df = pd.read_csv('TRT 2008-2016 analysis.csv')

In [5]:
print('There are {} rows in the file and {} columns.\n{}'.format(df.shape[0],df.shape[1],[col for col in df.columns]))

There are 144 rows in the file and 15 columns.
['Unnamed: 0', 'year', 'Treatment', 'paintmark', 'TRTmin', 'Species', 'sex', 'SVL', 'TL', 'RTL', 'RemTL', 'RemMass', 'propRemTL', 'Treatment2', 'TRTminLog']


## Descriptive Analysis 
1. [Independent Variables](#Independent-Variables)
2. [Dependent Variables](#Dependent-Variables)

[Back to TOC](#Table-of-Contents)

### Independent Variables 

The independent variables include 'year', 'Treatment', 'paintmark', 'Species', 'sex', 'SVL', 'TL', 'RTL', 'RemTL', 'RemMass', 'propRemTL', and 'Treatment2'.

[Back to Descriptive Analysis](#Descriptive-Analysis); [Back to TOC](#Table-of-Contents)

In [21]:
IV = ['year', 'Treatment', 'paintmark', 'Species', 'sex', 'SVL', 'TL', 'RTL', 'RemTL', 'RemMass', 'propRemTL', 'Treatment2']
for v in IV:
    print("\n\nReport for '{}':\nDESCRIBE: \n{}\nREVIEW: \n{} "\
          .format(v,df.groupby('Species')[v].apply(distribution),df.groupby('Species')[v].apply(review)))

ERROR:root:x must contain only numeric, or NoneType variables:
 x:
0        wI1b
1       wJ14b
2        wK2b
3        wK4b
4        wL3b
5        yA1b
6        yA6b
7        yA7b
8        yB3b
9        yC1b
10       yG3b
11       yG5b
25       wJ3b
26       wL2b
27       wL5b
28       yA2b
29       yB1b
30       yB2b
31       yC4b
32     yD2b.t
33       yE4b
43       y21b
44        y9b
54       y10b
56       y13b
59        y2b
60        y4b
73       wI4b
74      wJ11b
75      wJ12b
        ...  
94       o19b
97       o22b
99       o24b
102      o28b
104      o30b
106      o33b
107      o34b
108      o37b
115      w44b
120       w4b
121       w5b
122       w7b
125       w8b
127       w9b
128      w10b
129      w11b
130      w12b
131      w13b
132      w15b
133      w16b
134      w18b
135      w17b
136      w19b
137      w20b
138      w21b
139      w22b
140      w23b
141      w24b
142      w25b
143      w26b
Name: sj, Length: 75, dtype: object
 the following values in x are non-numeric:

ERROR:root:x must contain only numeric, or NoneType variables:
 x:
12     Intact
13     Intact
14     Intact
15     Intact
16     Intact
17     Intact
18     Intact
19     Intact
20     Intact
21     Intact
22     Intact
23     Intact
24     Intact
34        50%
35        50%
36        50%
37        50%
38        50%
39        50%
40        50%
41        50%
42        50%
45        25%
46        25%
47        25%
48        25%
49        25%
50        NaN
51        25%
52        25%
        ...  
66        75%
67        75%
68        75%
69        75%
70        75%
71        75%
72        75%
89        25%
92        NaN
93        50%
95        25%
96        25%
98        25%
100       25%
101       25%
103       25%
105       25%
109       25%
110       25%
111       25%
112       25%
113       50%
114       25%
116       25%
117       50%
118       50%
119       50%
123       25%
124       50%
126       25%
Name: sv, Length: 69, dtype: object
 the following values in x are non-numeric:



Report for 'year':
DESCRIBE: 
            n  minimum  maximum  median  siqr         mean     stdev
Species                                                             
sj      0  75     2007     2016  2011.0   4.5  2011.440000  4.156532
sv      0  69     2008     2016  2012.0   1.5  2012.753623  2.068004
REVIEW: 
Species
sj                (Unique types include the following: {<class 'int'>}, Unique values include:{2016, 2015, 2011, 2007}, OK)
sv    (Unique types include the following: {<class 'int'>}, Unique values include:{2016, 2008, 2010, 2011, 2012, 2015}, OK)
Name: year, dtype: object 


Report for 'Treatment':
DESCRIBE: 
            n  minimum  maximum  median  siqr      mean     stdev
Species                                                          
sj      0  75      1.0      5.0     4.0   1.0  3.266667  1.297954
sv      0  69      1.0      4.0     3.0   1.0  2.840580  1.195865
REVIEW: 
Species
sj    (Unique types include the following: {<class 'float'>}, Unique values includ

### Dependent Variables 

[Back to Descriptive Analysis](#Descriptive-Analysis); [Back to TOC](#Table-of-Contents)

In [20]:
DV = ['TRTmin','TRTminLog']
for v in DV:
    print("\n\nReport for '{}':\nDESCRIBE: \n{}\nREVIEW: \n{} "\
          .format(v,df.groupby('Species')[v].apply(distribution),df.groupby('Species')[v].apply(review)))



Report for 'TRTmin':
DESCRIBE: 
            n  minimum  maximum  median    siqr      mean     stdev
Species                                                            
sj      0  75     0.13     0.71    0.29  0.0675  0.289200  0.120805
sv      0  69     0.03     0.51    0.26  0.0700  0.258986  0.100968
REVIEW: 
Species
sj                                  (Unique types include the following: {<class 'float'>}, Unique values include:{0.13, 0.32, 0.36, 0.29, 0.31, 0.24, 0.42, 0.19, 0.25, 0.7, 0.34, 0.26, 0.18, 0.43, 0.68, 0.28, 0.2, 0.17, 0.21, 0.15, 0.23, 0.16, 0.46, 0.71, 0.30000000000000004, 0.14, 0.4, 0.41, 0.33}, OK)
sv    (Unique types include the following: {<class 'float'>}, Unique values include:{0.19, 0.07, 0.24, 0.25, 0.27, 0.28, 0.4, 0.3, 0.26, 0.43, 0.34, 0.51, 0.18, 0.11, 0.36, 0.2, 0.37, 0.29, 0.15, 0.22, 0.17, 0.09, 0.16, 0.23, 0.21, 0.03, 0.38, 0.30000000000000004, 0.14, 0.31, 0.08, 0.32, 0.49, 0.33}, OK)
Name: TRTmin, dtype: object 


Report for 'TRTminLog':
DESCRIBE: 

## Inferential Analysis

1. [_Sceloporus jarrovii_](#Sceloporus-jarrovii)
1. [_Sceloporus virgatus_](#Sceloporus-virgatus)

[Back to TOC](#Table-of-Contents)

### _Sceloporus jarrovii_

[Back to Inferential Analysis](#Inferential-Analysis); [Back to TOC](#Table-of-Contents)

### _Sceloporus virgatus_

[Back to Inferential Analysis](#Inferential-Analysis); [Back to TOC](#Table-of-Contents)

## Conclusion

[Back to TOC](#Table-of-Contents)

## Discussion

[Back to TOC](#Table-of-Contents)