-
Notifications
You must be signed in to change notification settings - Fork 5
/
index.Rmd
390 lines (239 loc) · 57.1 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
---
title: "An integrated phenology modelling framework in R"
subtitle: "modelling vegetation phenology with phenor"
author: "Koen Hufkens, David Bassler, Tom milliman, Eli Melaas, Andrew D. Richardson"
site: "bookdown::bookdown_site"
output:
bookdown::gitbook:
config:
toc:
collapse: subsection
scroll_highlight: yes
before: null
after: null
toolbar:
position: fixed
edit : null
download: null
search: yes
fontsettings:
theme: white
family: sans
size: 2
sharing:
facebook: yes
twitter: yes
all: ['facebook', 'twitter']
documentclass: book
---
# Abstract
1. Phenology is a first order control on productivity and mediates the biophysical environment by altering albedo, surface roughness length and evapotranspiration. Accurate and transparent modelling of vegetation phenology is therefore key in understanding feedbacks between the biosphere and the climate system.
2. Here, we present the phenor R package and modelling framework. The framework leverages measurements of vegetation phenology from four common phenology observation datasets, the PhenoCam network, the USA National Phenology Network (USA-NPN), the Pan European Phenology Project (PEP725), MODIS phenology (MCD12Q2) combined with (global) retrospective and projected climate data.
3. We show an example analysis using the phenor modelling framework which quickly and easily compares 20 included spring phenology models for three plant functional types. An analysis of model skill using the root mean squared (RMSE) error shows little or no difference regardless of model structure, corroborating previous studies. We argue that addressing this issue will require novel model development combined with easy data assimilation as facilitated by our framework.
4. In conclusion, we hope the phenor phenology modelling framework in the R language and environment for statistical computing will facilitate reproducibility and community driven phenology model development, in order to increase their overall predictive power, and leverage an ever growing number of phenology data products.
Note that this is a submitted version of the final manuscript so small changes can apply. However, the main message has remained largely unaltered.
# Introduction
Seasonal leaf development, or vegetation phenology, is strongly linked to seasonal changes in temperature and considered an indicator of climate change. Currently, rising temperatures due to climate change have moved spring forward in time by 2.3 to 5.2 days per decade since the 1970s (Rosenzweig et al. 2007). Vegetation phenology hereby does not only disproportionately influence ecosystem productivity by advancing and delaying the season (Richardson et al. 2010, 2013), it also changes canopy properties such as albedo and atmospheric boundary layer properties (Sakai et al. 1997; Hollinger et al. 1999). As such, models of seasonal leaf development, rigorously validated against in-situ observations, are key to understanding how climate change will affect ecosystem productivity and biophysical vegetation properties.
Luckily, phenology has been recorded by amateurs and professionals, such as national meteorological institutions, supporting contemporary analysis of past or ongoing climate change (Chuine et al. 2004). Recently, individual observations have been formalized into rigorous citizen science efforts through for example USA National Phenology Network (USA-NPN; https://www.usanpn.org/; Betancourt et al. 2005), Project Budburst (http://budburst.org/). In addition, automated camera networks (i.e., the PhenoCam network, https://phenocam.sr.unh.edu/; (Richardson et al. 2018)) or remote sensing (Zhang et al. 2003) provide a canopy wide continuous way of evaluating the development of vegetation across larger areas in a consistent and continuous fashion (White et al. 2009; Melaas et al. 2016). Numerous studies have demonstrated the value of the PhenoCam derived Gcc index, a measure of vegetation greenness as percentage green within a digital image, for characterizing the seasonal trajectory of vegetation color and activity (Keenan et al. 2014; Klosterman et al. 2014; Toomey et al. 2015; Hufkens et al. 2016). Similarly., the MODIS MCD12Q2 phenology product has been a proven source of phenological data (Chen et al. 2016).
These observations of vegetation phenology allow us to estimate changes in the timing of vegetation development in response year to year variation in weather as well as climate change and climate variability (Chuine et al. 2004; Vitasse et al. 2009; Melaas et al. 2016). Most process-based models try to simulate the various internal and environmental influences and, to various degrees, take into account whole plant physiological status (paradormancy), internal factors of developing bud (endodormancy) and external factors driving or suppressing seasonal development (ecodormancy)(Lang et al. 1987).
One of the first such ecodormancy models was the growing degree day model as proposed by De Reaumur dating back to 1735. Although vegetation phenology is often driven by temperature multiple additional constraints have been proposed including daylength, chilling degrees, precipitation, relative humidity or vapour pressure deficit (Hunter & Lechowicz 1992; Chuine & Cour 1999; García-Mozo et al. 2009; Laube et al. 2013, 2014; Xin et al. 2015). Similarly, fall senescence has been modelled using chilling degree days with additional constraints such as daylength (Archetti et al. 2013; Jeong & Medvigy 2014; Gill et al. 2015). These various models are either used in isolation to address particular physiological questions or included in land surface models to scale phenological processes (Richardson et al. 2011). Model development, in isolation or coupled to larger land surface models, often integrate multiple environmental drivers which increases model complexity (Jeong & Medvigy 2014; Chen et al. 2016). Yet, models which include more complex concepts, based upon growing degree days, do not necessarily perform better than a simple regression based approach. As such, model structures still explain a limited amount of the year-to-year variability, and fail to generalize well (Schaber & Badeck 2003; Linkosalo et al. 2006; Fisher et al. 2007; Clark et al. 2014; Basler 2016). For example, model studies have shown that biologically “incorrect” models can be parameterized to provide good predictions but lacking any biological representation (Hunter & Lechowicz 1992). A study by Migliavacca et al. (2012) has shown that between-model differences by the end of the century are almost as large as differences between-climate scenario values. As a consequence, different model assumptions will behave disproportionately different under future scenarios affecting their potential impacts and uncertainties (Migliavacca et al. 2012).
With vegetation phenology as a first order control on ecosystem productivity, accurate and transparent model predictions of vegetation phenology in a changing climate are key. In order to facilitate easy model comparison and future development of new models we developed the phenor model framework for the R language and environment for statistical computing (R Core Team 2016). The phenor R package assimilates four important phenological records across a variety of ecosystem and plant functional types. The assimilated datasets provide extensive coverage in the US and Europe and results can be easily scaled globally using various gridded data products made accessible through the software. Here, we provide a worked example for the phenor R package using the recent standardized PhenoCam dataset (Richardson et al. 2017; http://phenocam.us) to demonstrate the ease with which a suite of phenological models can be evaluated and scaled up from sites to regions and biomes, and extrapolated in both forecast and hindcast modes.
# Material and Methods
## The phenor R package
The phenor R package assimilates four important phenological records of either observational, near-surface and satellite remote sensing based records across a variety of ecosystem and plant functional types. The phenor R package combines data from near-surface remote sensing through the PhenoCam network using phenocamr and daymetr R packages into a phenology modelling framework which covers data preparation, model optimization and model visualization and consists of a number of key functions. In addition, data from the USA National Phenology Network (USA-NPN), the Pan European Phenology Project (PEP725) and the MODIS land surface phenology product (MCD12Q2) can be ingested. In the interest of brevity both phenocamr, daymetr R packages and the PhenoCam source data are described in Appendix S1.
The format_phenocam() phenor function combines phenophases, downloaded and generated by the phenocamr R package, with the climate data downloaded using the daymetr R package. The function requires the location (path) of the generated phenophase output files, together with parameters specifying the phenophase (direction = rising; with rising for spring or falling for autumn) and the threshold value used (threshold = 25), the Gcc percentile to use (gcc_value = 90, Sonnentag et al. 2012) and the offset as a day-of-year value. The offset is the day-of-year in the previous year on which to start reporting climate data, running until this day in the subsequent year. The function returns model calibration/validation and driver data a nested list of data frames (df), used in subsequent model optimization (see description of optimize_parameters() and model_calibration() below).
df = format_phenocam( path = “/path/to/phenocamr/phenophases/”,
direction = “rising”,
gcc_value = “gcc_90”,
threshold = 25,
offset = 264)
Similarly, the format_pep725() function uses PEP725 observational data together with European E-OBS climate data (Haylock et al. 2008) to compile a consistent calibration/validation dataset for European observational records (e.g., Basler 2016). Data can be downloaded using the download_pep725() function. We provide similar functionality for the USA-NPN data. Data can be downloaded through the USA-NPN application programming interface using download_npn() and correctly formatted with format_npn(). Furthermore, the format_modis() function correctly formats a directory of MODIS MCD12Q2 land surface phenology data (i.e., phenophases, Zhang et al. 2003) as downloaded with the MODISTools R package (Tuck et al. 2014).
Spatially scaling of model results is facilitated through a number of functions. The format_daymet() function uses gridded pre-processed Daymet tiles to generate spatially explicit driver data (download_daymet_tiles() and daymet_tmean() functions of daymetr, Appendix S1). The format_eobs() function provides the same functionality for the E-OBS climate data. Yet another source of hindcast data is compiled using the format_berkeley_earth() function, which allows the user to subset 1x1 degree global historical climate data for any year since 1850 through the Berkeley Earth project (Rohde et al. 2012). Similarly, the format_cmip5() function formats 1/4th degree NASA Earth Exchange (NEX) global gridded Coupled Model Intercomparison Project (CMIP5) data of historical reanalysis and representative concentration pathway (RCP 4.5 and RCP8.5) projections. Unlike format_phenocam() or format_modis() no calibration/validation data is included in the gridded spatial data.
The resulting dataset of all formatting functions is a nested list with the following layout:
phenor_data_structure
└─── doy (vector)
└─── site (vector, NULL for spatial data)
└─── location (matrix, latitude and longitude by row)
└─── ltm (vector, value: degrees C)
└─── Ti (matrix, columns: years, rows: doy, value: degrees C)
└─── Tmaxi (matrix, columns: years, rows: doy, value: degrees C)
└─── Tmini (matrix, columns: years, rows: doy, value: degrees C)
└─── Li (matrix, columns: years, rows: doy, value: hours/day)
└─── VPDi (matrix, columns: years, rows: doy, value: Pa)
└─── Pi (matrix, columns: years, rows: doy, value: mm/day)
└─── transition_dates (vector, NULL for spatial data)
└─── georeferencing (NULL for PhenoCam, MODIS, USA-NPN or PEP725 data)
└─── size (size of the spatial data)
└─── extent (extent of the spatial data)
└─── projection (projection of the spatial data)
Within the phenor data structure the top level is a particular site. For each site critical parameters such as the day-of-year range (doy, as specified by the offset), the geographic location (or georeferencing), and matrices holding, minimum temperature (Tmini), maximum temperature (Tmaxi) and mean daily temperature (Ti), precipitation (Pi), vapour pressure deficit (VPDi), daylength in hours (Li), and calibration/validation transition date (transition_dates) data are provided. Matrices are organized with columns representing a given year, and rows representing a given day-of-year. Other data are represented as vectors matching the number of columns present in the climate data matrices. Where necessary, data is truncated to match the available climate data. When certain data sources are missing the content of a field is set to NULL.
The optimize_parameters() function in the package allows for the easy optimization of model parameters. This function uses two common optimizers, GenSA (Xiang et al. 2013) and rgenoud (Mebane & Sekhon 2011). The GenSA algorithm combines both the Boltzmann machine and faster Cauchy machine simulated annealing approaches for fast optimizations (Tsallis & Stariolo 1995), while the genoud routine combines an evolutionary algorithm with a derivative based (quasi-Newton) method to solve difficult optimization problems (Mebane & Sekhon 2011). To optimize a calibration/validation dataset (df) one specifies a particular model (e.g., the Thermal Time model, TT), a defined optimizer (e.g., GenSA), an objective function such the root mean squared error (RMSE) and upper-lower parameter limits as well as parameter starting values when required. Additional control parameters, such as the maximum number or iterations (e.g., max.call), can be provided using a list of options to the control parameter. An example function call to optimize a the TT model is provided below.
optimal_par = optimize_parameters( par = NULL,
data = df,
cost = rmse,
model = “TT”,
method = “GenSA”,
lower = c(1,-5,0),
upper = c(365,10,2000),
control = list(max.call = 40000))
Final predicted values for the optimized parameters can be retrieved by running the estimate_phenology() function with the optimized parameters.
results = estimate_phenology(par = optimal_par$par,
data = df,
model = “TT”)
The output will automatically be formatted as a map of phenology dates or a vector, depending on the input data class. However, running models across all grid cells of spatial data would provide a naively broad representation of land surface phenology. For example, only a small subset of the US is dominated by any particular plant functional type (PFT), such as deciduous broadleaf forests. In order to better differentiate between different dominant PFT we include a function land_cover_density() which calculates the percentage coverage of a particular MODIS MCD12Q1 IGBP land cover class (Friedl et al. 2010) within a given raster cell for a given location (i.e., CMIP5 data, see Figure \@ref(fig:spatial_scaling)).
A wrapper function, model_calibration(), is provided for both the optimize_parameters() and estimate_phenology() functions which integrates the previously described steps providing both summary statistics (RMSE and AICc) and a plot (Figure \@ref(fig:validation) ) of the model fit. Likewise, the model_comparison() function serves as a wrapper for multi-model parameter optimization runs. For a visual comparison limited to two model runs we provide an arrow plot function, arrow_plot(), displaying directional changes in the modelled values between the model outputs (Figure \@ref(fig:arrow) ).
```{r validation, echo = FALSE, fig.cap = 'Model calibration output'}
knitr::include_graphics("output/Figure_1_model_validation.png")
```
```{r arrow, echo = FALSE, fig.cap = 'Arrow plot'}
knitr::include_graphics("output/Figure_2_arrow_plot.png")
```
## A worked example: a quick model comparison
As a worked example we partially recreate the spring phenology model comparison by Basler (2016) using PhenoCam data. However, we note that a similar exercise could be executed with any of the other phenology data sources available through phenor. The model structures included in this worked example can be described by the three broad categories: (1) as simple linear regression to spring temperature (2) models explaining ecodormancy release only, (3) models explaining the release of endo- and ecodormancy. A reference NULL model assumes a fixed mean date of leaf unfolding.
A total of 22 phenology models are included in the package (table 1). These include 20 spring phenology models including precipitation driven models, one fall senescence chilling degree day model and one grassland pollen release model. In our worked example of the phenor R package we will focus only on the 20 spring phenology models. A full list of the model structures and parameter ranges for the models are provided in Table 2 of Appendix S2 and included in the phenor library.
TABLE 1
For this study we combined spring phenology dates based on PhenoCam 3-day summary data from the standardized PhenoCam Dataset (Richardson et al. 2017) with Daymet data (Thornton et al. 2017) for three common PFTs, deciduous broadleaf forests, evergreen needleleaf forest and grasslands. A total of 102 sites and 508 site years were included in our calibration/validation dataset, of which 63 were deciduous broadleaf forest sites (358 site years), 18 were evergreen needleleaf forest sites (63 site years) and 21 were grasslands (88 site years). Deciduous broadleaf sites which are moisture limited, and all sites outside Daymet coverage, were excluded from our analysis. The final selected sites span a large geographic area ranging from New Mexico to Southern Alaska, and Maine to California (Figure \@ref(fig:location)).
```{r location, echo = FALSE, fig.cap = 'Site locations'}
knitr::include_graphics("output/Figure_3_site_location_map.png")
```
We acknowledge that phenological development as measured using PhenoCam data represent different physiological processes for different PFTs. For example, the phenology of deciduous forests or grasslands is closely linked to the development of new leaf tissues (Keenan et al. 2014; Hufkens et al. 2016) where evergreen forest phenology is determined by dehardening of existing needles at the end of the winter season. Thus, optimized model parameters and their interpretation are specific to each PFT.
For all PFTs, model optimization were executed using the default generalized simulated annealing (GenSA) package minimizing the RMSE between the greenness rising PhenoCam phenophase estimations and model predictions (see Appendix S1). The optimizer was run for 40 000 iterations with a starting temperature of 10 000. To determine the influence of locations at the margin of the forest biome on model optimizations a subset of sites centrally located within the deciduous forest biome was created (Melaas et al. 2016, Appendix S2 Table 2). This subset was optimized separately and compared to results for the complete deciduous broadleaf dataset. We assess proper convergence of the optimized parameters by initializing the optimizer using 12 random sets of parameters. We report mean and standard deviations of the RMSE between observations and predictions on the optimized parameter values for all PFTs and the subset generated for deciduous forest sites. We compare model performance with a log transformed ANOVA, combined with a post-hoc Tukey HSD test. Model errors are evaluated for normality using a Shapiro-Wilk test.
For illustrative purposes we produce overview maps (Figure \@ref(fig:spatial_scaling)) of spatial patterns both in hindcast and forecast mode. In hindcast mode we use 1x1 km Daymet gridded data across New England, while we present the difference in predicted spring phenology (ΔDOY) between years 2100 and 2011 for forecast CMIP5 IPSL-CM5A-MR (Mid-Resolution Institut Pierre Simon Laplace Climate Model 5) model runs, across the contiguous US. The Thermal Time (TT) and Accumulated Growing Season Index models were optimized for deciduous broadleaf and grassland PhenoCam data, respectively. For forecast data only pixels dominated by their particular PFT are displayed, limiting a naively broad interpretation of the results.
Our comparison of 20 spring phenology models across three PFT showed that most models were significantly different from the NULL model, with the exception of the SGSI model in the evergreen PFT (post-hoc Tukey HSD test, p < 0.001, Figure \@ref(fig:comparison)). The model performance of the centrally located deciduous broadleaf sites was marginally greater (RMSE: ~7.6±0.7 days) compared to the complete deciduous broadleaf dataset (RMSE: ~7.9±1.2 days). This difference between the full deciduous broadleaf forest dataset and those more centrally within the biome suggests an influence of geographic location on model error.
```{r comparison, echo = FALSE, fig.cap = 'model comparison'}
knitr::include_graphics("output/Figure_4_model_comparison.png")
```
The influence of different model structures on individual values was visualized using the arrow plot (Figure \@ref(fig:arrow)) between two model runs. When visually comparing the Thermal Time (TT) and the Photothermal Time (PTT) models small changes are noted (\@ref(fig:arrow)). Both models accumulate growing degree days where the PTT model adds a photoperiod component to the original TT model. In the PTT model leaf unfolding is therefore in part dependent on a daylength requirement in addition to thermal forcing. In our example, the addition of a daylength requirement shifts model results for early and late developing plants toward earlier leaf out dates, while at the same time shifting mid season developing plants toward later leaf out dates. Despite these changes overall model accuracy remains the same. A description of all statistical results is provided in Appendix S1, section 5.
```{r spatial_scaling, echo = FALSE, fig.cap = 'spatial scaling'}
knitr::include_graphics("output/Figure_5_spatial_runs.png")
```
# Discussion
Here, we have demonstrated how the phenor R package and its included “model zoo”, together with consistent estimates of vegetation phenology through PhenoCam network (e.g., phenocamr, Appendix S1) or other phenology data sources can be leveraged for a fast and transparent model comparisons. More so, easy access to various gridded data sources allows for quick spatial scaling of optimized models in both hindcast and forecast mode (Figure \@ref(fig:spatial_scaling)). For example, the code required to partially reproduce a study by Basler (2016) relied on a mere 15 R commands (see run_model_comparison.r in the phenor manuscript github repository), while the models used are easily readable and well documented. Furthermore, adding model structures is easy compared to other frameworks which rely on either low level languages, are closed source or do not work cross-platform (Hänninen & Kramer 2007; Chuine et al. 2013; Brown et al. 2014)`. More so, to execute our complete case study reasonable processing times were recorded (~ 48 CPU hours on a recent desktop workstation) although relying on a slower scripting language, while computational loads for data generation and processing at a global scale remain marginal. The case study demonstrates the ease with which we executed our model comparisons in phenor, corroborating previous studies and once more highlighting the limitations of current model structures in explaining year-to-year variability (Linkosalo et al. 2006; Fisher et al. 2007; Clark et al. 2014; Basler 2016). This result therefore underscores the need for tools such as phenor to facilitate easy and transparent model development and comparison. Furthermore, new visualization tools such as the arrow plot might help in this process. The arrow plot (Figure \@ref(fig:arrow)) suggests that the assessment of model skill through summarizing values such as RMSE seem suboptimal, hiding structural differences in model performance hiding important information on model performance. The non-normal distribution of model errors within all models (p < 0.001) suggests as much.
# Known limitations
We acknowledge that previous efforts have been made to provide phenology model frameworks (Hänninen & Kramer 2007; Chuine et al. 2013; Brown et al. 2014). Yet, their use and interoperability and scaleability is limited due to platform restrictions or their closed source nature. We recognize that the models as currently presented are by no means an exhaustive list of all model structures found in literature. However, the phenor R package is open source and adding model structures is easy compared to other low level languages (e.g., C / Fortran) and is actively encouraged. We are aware that the phenor R package does not include all possible phenological climatological drivers or phenological calibration/validation data either, although we stress that access to four larger freely available phenological data sources is provided by the phenor R package. Similarly, other sources of climate data, such as the ERA-interim re-analysis data (Dee et al. 2011), could be integrated as long as the described data structure is followed.
# Conclusion
Accurately representing vegetation phenology, a first order control on ecosystem productivity, under future conditions is key in understanding feedbacks between the climate and the biosphere. Here, we demonstrated the advantages of the phenor R package and modelling framework, through a worked example, by quickly and easily comparing 20 spring phenology models and their model skill for three plant functional types. Our results corroborate previous analysis, showing little or no difference in predictive power between models, which suggest convenient tools for further analysis or novel model development are needed to capture current and future phenological changes as well as their underlying physiological processes. We hope the phenor phenology modelling framework in R will allow for a better integration of observational and experimental data providing opportunities to better understand the environmental factors driving seasonality, and past and future responses of vegetation to climate change and variability.
# Contributions
K.H. conceived and designed all three packages with contributions of D.B.; T.M. developed the application programming interface queried by phenocamr; K.H. analyzed the data and interpreted the results; K.H., D.B., E.M., T.M., A.D.R. interpreted the results. K.H. drafted the manuscript. All authors commented on and approved the final manuscript.
# Acknowledgements
The Richardson Lab acknowledges support from the NSF Macrosystems Biology programme (awards EF-1065029 and EF-1702697). D. B. acknowledges the Harvard Forest Bullard Fellowship programme. K.H. acknowledges support from the LabEx COTE MicroMic project and BELSPO Brain programme (project BR/175/A3/COBECORE). We thank ORNL, the Daymet team and Michele M. Thornton for the continued support in developing daymetr. We thank the World Climate Research Program’s Working Group on Coupled Modelling, which is responsible for CMIP, and the climate modelling groups for producing and making available their model output. We are grateful to the US Department of Energy’s Program for Climate Model Diagnosis and Inter-comparison provides coordinating support and led development of software infrastructure in partnership with the Global Organization for Earth System Science Portals. Climate scenarios used were from the NEX-GDDP dataset, prepared by the Climate Analytics Group and NASA Ames Research Center using the NASA Earth Exchange, and distributed by the NASA Center for Climate Simulation (NCCS). We thank the NASA Earth Exchange project for making these data available. Finally, we thank our many collaborators, including site PIs and technicians, for their efforts in support of PhenoCam.
# Code & Data availability
All three packages can be found on the author's [personal github](http://github.com/khufkens) and are easily installed and loaded in the R statistical software using the following commands.
if(!require(devtools)){ install.package(devtools) }
devtools::install_github("khufkens/daymetr")
devtools::install_github("khufkens/phenocamr")
devtools::install_github("khufkens/phenor")
The data used in the worked examples are freely available as a curated dataset (Richardson et al. 2017). Manuscript data and figures can be generated using the R scripts listed in the [manuscript repository](https://github.com/khufkens/phenor_manuscript/).
# Reference
Archetti, M., Richardson, A.D., O’Keefe, J. & Delpierre, N. (2013). Predicting climate change impacts on the amount and duration of autumn colors in a New England forest. PloS one, 8, e57373.
Basler, D. (2016). Evaluating phenological models for the prediction of leaf-out dates in six temperate tree species across central Europe. Agricultural and Forest Meteorology, 217, 10–21.
Betancourt, J.L., Schwarz, M., Breshears, D.D., Cayan, D.R., Dettinger, M., Inouye, D.W., Post, E. & Reed, B.C. (2005). Implementing a U.S. National Phenology Network. Eos, Transactions American Geophysical Union, 86, 538.
Blümel, K. & Chmielewski, F.-M. (2012). Shortcomings of classical phenological forcing models and a way to overcome them. Agricultural and Forest Meteorology, 164, 10–19.
Brown, H.E., Huth, N.I., Holzworth, D.P., Teixeira, E.I., Zyskowski, R.F., Hargreaves, J.N.G. & Moot, D.J. (2014). Plant Modelling Framework: Software for building and running crop models on the APSIM platform. Environmental Modelling and Software, 62, 385–398.
Cannell, M.G.. G.R., Smith, R.I.I., Society, B.E. & Ecology, A. (1983). Thermal time, chill days and prediction of budburst in Picea sitchensis. Journal of applied Ecology, 20, 951–963.
Chen, M., Melaas, E.K., Gray, J.M., Friedl, M.A. & Richardson, A.D. (2016). A new seasonal-deciduous spring phenology submodel in the Community Land Model 4.5: impacts on carbon and water cycling under future climate scenarios. Global Change Biology, 22, 3675–3688.
Chuine, I. (2000). A unified model for budburst of trees. Journal of theoretical biology, 207, 337–47.
Chuine, I. & Cour, P. (1999). Climatic determinants of budburst seasonality in four temperate-zone tree species. New Phytologist, 143, 339–349.
Chuine, I., Cour, P. & Rousseau, D.D. (1999). Selecting models to predict the timing of flowering of temperate trees: implications for tree phenology modelling. Plant, Cell and Environment, 22, 1–13.
Chuine, I., Garcia de Cortazar Atauri, I., Kramer, K. & Hänninen, H. (2013). Plant Development Models. In: (ed. Schwarz MD). , Dordrecht, Netherlands, pp. 275-293. Phenology: An Integrative Environmental Science (ed M.D. Schwartz), pp. 275–293. Springer, Dordrecht.
Chuine, I., Yiou, P., Viovy, N., Seguin, B., Daux, V. & Ladurie, E.L.R. (2004). Historical phenology: Grape ripening as a past climate indicator. Nature, 432, 289–290.
Clark, J.S., Salk, C., Melillo, J. & Mohan, J. (2014). Tree phenology responses to winter chilling, spring warming, at north and south range limits (N. Anten, Ed.). Functional Ecology, 28, 1344–1355.
Črepinšek, Z., Kajfež-Bogataj, L. & Bergant, K. (2006). Modelling of weather variability effect on fitophenology. Ecological Modelling, 194, 256–265.
Dee, D.P., Uppala, S.M., Simmons, A.J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M.A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A.C.M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A.J., Haimberger, L., Healy, S.B., Hersbach, H., Holm, E. V., Isaksen, L., K??llberg, P., K??hler, M., Matricardi, M., Mcnally, A.P., Monge-Sanz, B.M., Morcrette, J.J., Park, B.K., Peubey, C., de Rosnay, P., Tavolato, C., Th??paut, J.N. & Vitart, F. (2011). The ERA-Interim reanalysis: Configuration and performance of the data assimilation system. Quarterly Journal of the Royal Meteorological Society, 137, 553–597.
Fisher, J.I., Richardson, A.D. & Mustard, J.F. (2007). Phenology model from surface meteorology does not capture satellite-based greenup estimations. Global Change Biology, 13, 707–721.
Friedl, M.A., Sulla-Menashe, D., Tan, B., Schneider, A., Ramankutty, N., Sibley, A. & Huang, X.M. (2010). MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sensing of Environment, 114, 168–182.
García-Mozo, H., Galán, C., Belmonte, J., Bermejo, D., Candau, P., Díaz de la Guardia, C., Elvira, B., Gutiérrez, M., Jato, V., Silva, I., Trigo, M.M., Valencia, R. & Chuine, I. (2009). Predicting the start and peak dates of the Poaceae pollen season in Spain using process-based models. Agricultural and Forest Meteorology, 149, 256–262.
Gill, A.L., Gallinat, A.S., Sanders-DeMott, R., Rigden, A.J., Short Gianotti, D.J., Mantooth, J.A. & Templer, P.H. (2015). Changes in autumn senescence in northern hemisphere deciduous trees: A meta-analysis of autumn phenology studies. Annals of Botany, 116, 875–888.
Hänninen, H. (1990). Modelling bud dormancy release in trees from cool and temperate regions. Acta Forestalia Fennica, 213, 1–47.
Hänninen, H. & Kramer, K. (2007). A framework for modelling the annual cycle of trees in boreal and temperate regions. Silva Fennica, 41, 167–205.
Haylock, M.R., Hofstra, N., Klein Tank, A.M.G., Klok, E.J., Jones, P.D. & New, M. (2008). A European daily high-resolution gridded data set of surface temperature and precipitation for 1950-2006. Journal of Geophysical Research Atmospheres, 113.
Hollinger, D.Y., Goltz, S.M., Davidson, E.A., Lee, J.T., Tu, K. & Valentine, H.T. (1999). Seasonal patterns and environmental control of carbon dioxide and water vapour exchange in an ecotonal boreal forest. Global Change Biology, 5, 891–902.
Hufkens, K., Keenan, T.F., Flanagan, L.B., Scott, R.L., Bernacchi, C.J., Joo, E., Brunsell, N.A., Verfaillie, J. & Richardson, A.D. (2016). Productivity of North American grasslands is increased under future climate scenarios despite rising aridity. Nature Climate Change.
Hunter, A.F. & Lechowicz, M.J. (1992). Predicting the timing of budburst in temperate trees. Journal of Applied Ecology, 29, 597–604.
Jeong, S. & Medvigy, D. (2014). Macroscale prediction of autumn leaf coloration throughout the continental United States. Global Ecology and Biogeography, 23, 1245–1254.
Keenan, T., Darby, B., Felts, E., Sonnentag, O., Friedl, M., Hufkens, K., O’Keefe, J., Munger, J.W., Toomey, M. & Richardson, A.D. (2014). Tracking forest phenology and seasonal physiology using digital repeat photography: a critical assessment. Ecological Applications.
Klosterman, S.T., Hufkens, K., Gray, J.M., Melaas, E., Sonnentag, O., Lavine, I., Mitchell, L., Norman, R., Friedl, M. a. & Richardson, a. D. (2014). Evaluating remote sensing of deciduous forest phenology at multiple spatial scales using PhenoCam imagery. Biogeosciences Discussions, 11, 2305–2342.
Kramer, K. (1994). Selecting a model to predict the onset of growth of Fagus-sylvatica. Journal of Applied Ecology, 31, 172–181.
Landsberg, J.J. (1974). Apple Fruit Bud Development and Growth; Analysis and an Empirical Model. Annals of Botany, 38, 1013–1023.
Lang, G., Early, J., Martin, G. & Darnell, R. (1987). Endo-, para-, and ecodormancy: physiological terminology and classification for dormancy research. HortScience, 22, 371–377.
Laube, J., Sparks, T.H., Estrella, N., Höfler, J., Ankerst, D.P. & Menzel, A. (2013). Chilling outweighs photoperiod in preventing precocious spring development. Global Change Biology, n/a-n/a.
Laube, J., Sparks, T.H., Estrella, N. & Menzel, A. (2014). Does humidity trigger tree phenology? Proposal for an air humidity based framework for bud development in spring. New Phytologist, 202, 350–355.
Leinonen, I., Repo, T. & Hänninen, H. (1997). Changing Environmental Effects on Frost Hardiness of Scots Pine During Dehardening. Annals of Botany, 79, 133–137.
Linkosalo, T., Häkkinen, R. & Hänninen, H. (2006). Models of the spring phenology of boreal and temperate trees: Is there something missing? Tree physiology, 26, 1165–72.
Masle, J., Doussinault, G., Farquhar, G.D. & Sun, B. (1989). Foliar stage in wheat correlates better to photothermal time than to thermal time. Plant, Cell & Environment, 12, 235–247.
Mebane, W.R.J. & Sekhon, J.S. (2011). Genetic Optimization Using Derivatives: The rgenoud Package for R. Journal of Statistical Software, 42, 1–26.
Melaas, E.K., Friedl, M.A. & Richardson, A.D. (2016). Multiscale modeling of spring phenology across Deciduous Forests in the Eastern United States. Global Change Biology, 22, 792–805.
Migliavacca, M., Sonnentag, O., Keenan, T.F., Cescatti, a., O’Keefe, J. & Richardson, a. D. (2012). On the uncertainty of phenological responses to climate change, and implications for a terrestrial biosphere model. Biogeosciences, 9, 2063–2083.
Murray, M.B., Cannell, M.G.R. & Smith, R.I. (1989). Date of budburst of fifteen tree species in Britain following climatic warming. Journal of Applied Ecology, 26, 693–700.
R Core Team. (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
De Reaumur, R.-A. (1735). Observations du thermometre, faites a Paris pendant l′annee 1735 comparees avec celles qui onte faites sous la ligne et al′Ile de France,a Alger et en quelques-unes de nos ıles de l′Amerique. Memoires de l’Academie Royale des Sciences de Paris, 1735, 545–576.
Richardson, A.D., Anderson, R.S., AltafArain, M., Barr, A.G., Bohrer, G., Chen, G., Chen, J.M., Ciais, P., Davis, K.J., Desai, A.R., Dietze, M.C., Dragoni, D., Maayar, M. El, Garrity, S., Gough, C.M., Grant, R., Hollinger, D.Y., Margolis, H. a., McCaughey, H., Migliavacca, M., Monson, R.K., William Munger, J., Poulter, B., Raczka, B.M., Ricciuto, D.M., Sahoo, A.K., Schaefer, K., Tian, H., Vargas, R., Verbeeck, H., Xiao, J. & Xue, Y. (2011). Terrestrial biosphere models need better representation of vegetation phenology: Results from the North American Carbon Program. Global Change Biology, n/a-n/a.
Richardson, A.D., Black, T.A., Ciais, P., Delbart, N., Friedl, M. a, Gobron, N., Hollinger, D.Y., Kutsch, W.L., Longdoz, B., Luyssaert, S., Migliavacca, M., Montagnani, L., Munger, J.W., Moors, E., Piao, S., Rebmann, C., Reichstein, M., Saigusa, N., Tomelleri, E., Vargas, R. & Varlagin, A. (2010). Influence of spring and autumn phenological transitions on forest ecosystem productivity. Philosophical transactions of the Royal Society of London. Series B, Biological sciences, 365, 3227–46.
Richardson, A.D., Hufkens, K., Milliman, T., Aubrecht, D.M., Chen, M., Gray, J.M., Johnston, M.R., Keenan, T.F., Klosterman, S.T., Kosmala, M., Melaas, E.K., Friedl, M.A. & Frolking, S. (2018). Tracking vegetation phenology across diverse North American biomes using PhenoCam imagery. Scientific Data.
Richardson, A.D., Keenan, T.F., Migliavacca, M., Ryu, Y., Sonnentag, O. & Toomey, M. (2013). Climate change, phenology, and phenological control of vegetation feedbacks to the climate system. Agricultural and Forest Meteorology, 169, 156–173.
Rohde, R., Muller, R., Jacobsen, R., Muller, E., Groom, D. & Wickham, C. (2012). A New Estimate of the Average Earth Surface Land Temperature Spanning 1753 to 2011. Geoinformatic & Geostatistics: An Overview, 1, 1–7.
Rosenzweig, C., Casassa, G., Karoly, D.J., Imeson, A., Liu, C., Menzel, A., Rawlins, S., Root, T.L., Seguin, B. & Tryjanowski, P. (2007). Climate change 2007: impacts, adaptation and vulnerability: Working Group II contribution to the Fourth Assessment Report of the IPCC Intergovernmental Panel on Climate Change. Working Group II Contribution to the Intergovernmental Panel on Climate Change Fourth Assessment Report (eds M.L. Parry, O.F. Canziani, J.P. Palutikof, P.J. van der Linden & C.E. Hanson), p. 976.
Sakai, R.K., Fitzjarrald, D.R. & Moore, K.E. (1997). Detecting leaf area and surface resistance during transition seasons. Agricultural and Forest Meteorology, 84, 273–284.
Schaber, J. & Badeck, F.-W. (2003). Physiology-based phenology models for forest tree species in Germany. International journal of biometeorology, 47, 193–201.
Thornton, P.E., Thornton, M.M., Mayer, B.W., Wilhelmi, N., Wei, Y., Devarakonda, R. & Cook, R.B. (2017). Daymet: Daily Surface Weather Data on a 1-km Grid for North America, Version 3. Data set. Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, USA. URL https://daac.ornl.gov/cgi-bin/dsviewer.pl?ds_id=1328
Toomey, M., Friedl, M. a, Frolking, S., Hufkens, K., Klosterman, S., Sonnentag, O., Baldocchi, D.D., Bernacchi, C.J., Biraud, S.C. & Richardson, A.D. (2015). Greenness indices from digital cameras predict the timing and seasonal dynamics of canopy-scale photosynthesis. Ecological Applications, 25, 99–115.
Tsallis, C. & Stariolo, D.A. (1995). Generalized Simulated Annealing. 233, 395–406.
Tuck, S.L., Phillips, H.R.P., Hintzen, R.E., Scharlemann, J.P.W., Purvis, A. & Hudson, L.N. (2014). MODISTools - downloading and processing MODIS remotely sensed data in R. Ecology and Evolution, 4, 4658–4668.
Vitasse, Y., Porté, A.J., Kremer, A., Michalet, R. & Delzon, S. (2009). Responses of canopy duration to temperature changes in four temperate tree species: relative contributions of spring and autumn leaf phenology. Oecologia, 161, 187–98.
Wang, J.Y. (1960). A Critique of the Heat Unit Approach to Plant Response Studies. Ecology, 41, 785–790.
White, M. a., DeBeurs, K.M., Didan, K., Inouye, D.W., Richardson, A.D., Jensen, O.P., O’Keefe, J., Zhang, G., Nemani, R.R., van LEEUWEN, W.J.D., Brown, J.F., de WIT, A., Schaepman, M., Lin, X., Dettinger, M., Bailey, A.S., Kimball, J., Schwartz, M.D., Baldocchi, D.D., Lee, J.T. & Lauenroth, W.K. (2009). Intercomparison, interpretation, and assessment of spring phenology in North America estimated from remote sensing for 1982-2006. Global Change Biology, 15, 2335–2359.
Xiang, Y., Gubian, S., Suomela, B. & Hoeng, J. (2013). Generalized simulated annealing for global optimization: the GenSA Package. R Journal, 5, 13–28.
Xin, Q., Broich, M., Zhu, P. & Gong, P. (2015). Modeling grassland spring onset across the Western United States using climate variables and MODIS-derived phenology metrics. Remote Sensing of Environment, 161, 63–77.
Zhang, X., Friedl, M.A., Schaaf, C.B., Strahler, A.H., Hodges, J.C.F., Gao, F., Reed, B.C. & Huete, A. (2003). Monitoring vegetation phenology using MODIS. Remote Sensing of Environment, 84, 471–475.
# APPENDIX
## The PhenoCam network, data processing and selection
The PhenoCam network (https://phenocam.sr.unh.edu) uses digital camera imagery to monitor ecosystem dynamics over time. The network currently consists of over 400 cameras in the conterminous US, tracking vegetation development. Camera imagery is uploaded and stored on the PhenoCam server. Any new images that have been uploaded to the server during the previous 24 hours are converted into greenness time series from RGB triplets of the recorded JPEG images.
Variability in RGB triplets, due to changing scene illumination, can be largely suppressed by converting the DN triplets to their respective chromatic coordinates (e.g. the green chromatic coordinate, Gcc) and subsequent selection of the 90th percentile of the values across a 1 or 3-day period (Richardson et al. 2007, 2009; Sonnentag et al. 2012). Numerous studies have demonstrated the value of the Gcc index (and the corresponding red chromatic coordinate, Rcc) for characterizing the seasonal trajectory of vegetation color and activity (Keenan et al. 2014; Klosterman et al. 2014; Toomey et al. 2015). Furthermore an appropriate “region of interest” (ROI) is defined, corresponding to the area within each digital image for which colour information will be extracted according to the dominant vegetation type in each image. We therefore calculated Gcc as in eq. 1, where X DN denotes the mean (across the ROI) digital number for colour channel X. The generation of these PhenoCam Gcc 1- or 3-day summary product time series is done nightly on the PhenoCam servers.
Gcc = green DN / (red DN + green DN + blue DN) (eq.1)
## The phenocamr R package: methodology
### Outlier detection and smoothing
A first step in the post-processing of the PhenoCam data after downloading the data is the detection of outliers which might confound further analysis. For the outlier detection we rely on an iterative spline-based outlier detection algorithm on each of the GCC time series (mean, 50th, 75th, and 90th quantiles) included in the 1- and 3-day summary product files. First, using a range of smoothing factors, we fit a family of cubic smoothing splines to each of time series. We selected the optimal spline from within this family using Akaike’s Information Criterion (AIC, (Akaike 1974)), which balances goodness-of-fit against model complexity in order to avoid over-fitting (Hurvich et al. 1998). Then, in an iterative process, we identified and flagged data points lying more than 2x standard deviations (based on the residual variance,) from the spline. With these outliers excluded, we then repeated the spline fitting process. This was done up to 20 times or until no further outliers were detected from one iteration to the next. The spline fitting process is repeated one final time, so that together with the outlier flags, a smoothed and interpolated time series (with uncertainty estimates) could be used for transition date estimation. The procedure described above can be applied to either 1- or 3-day summary file using both the detect_outliers() and smooth_ts() routines separately. However, it is preferred to specify the proper post-processing routines upon downloading the PhenoCam data using the download_phenocam() function.
### Phenophase estimation
Changes in the cycle of vegetation activity are identified as change points using the Pruned Exact Linear Time (PELT) method (Killick et al. 2011) for each cycle with a penalty factor (beta = 0.5) and a minimum segment length of 14 days. Extracted phenophases are calculated as changes in the signal between these change points and intended to define the start of the “greenness rising” and end of the “greenness falling” stage for a full cycle of vegetation activity (i.e., from dormancy, through green-up or “greenness rising”, peak activity, senescence or “greenness falling”, and back to dormancy). Thus, unlike sigmoid-based approaches that have been typically used in the literature (Zhang et al. 2003; Hufkens et al. 2012; Klosterman et al. 2014; Filippa et al. 2016), our method is not limited to ecosystems with a single annual cycle of green-up and senescence and invariant to changes in baseline values due to varying camera settings over time (see Figure \@ref(fig:timeseries)).
The AIC-selected smoothing spline, described in the previous section, is central to this method. The spline is first used to determine the GCC minima (the baseline before a “greenness rising” stage or after a “greenness falling” stage) and maxima (the peak between “rising” and “falling” stages). From the GCC minima and maxima associated with each stage, we then used the spline to identify the dates when 10%, 25%, and 50% of the amplitude (= maxima – minima) were reached. Different threshold values may be specified by calling the underlying function transition_dates(). Transition date uncertainties were then derived from the confidence interval (1.96 x SE) around the smoothing spline, with a minimum uncertainty extending to the immediately preceding or following observation (i.e., ± 1 and ± 3d for the 1- and 3-day GCC time series, respectively; larger in the case of missing data on either side of the estimated transition date). By default all dates are reported in unix time with a start date at (1970-01-01).
## The phenocamr R package: usage
The phenocamr package, most prominently, interfaces with the PhenoCam server to facilitate easy downloading of pre-processed Gcc time series (see section 2) into the R environment. Subsequently, the package screens for outliers in the downloaded time series, smooths the time series and finally estimates different phenophases in a seamless routine. The phenocamr package currently provides all post-processing of PhenoCam standardized datasets (e.g. V1.0; Richardson et al. 2017, In Review). The download_phenocam() function in the package covers most of this functionality. An example of the basic function is provided here:
download_phenocam(site = “bartlett”,
vegetation = “DB”,
roi_id = 0003,
frequency = 3,
outlier_detection = TRUE,
smooth = TRUE,
phenophase = TRUE)
These instruction download 3-day summary data (frequency = 3) at the Bartlett experimental forest site (site = “bartlett”) for a deciduous broadleaf (DB) ROI (vegetation = “DB”) with ID 0003 (roi_id = 0003). In addition, we specify that outlier detection and smoothing should be applied (outlier_detection, and smooth set to TRUE). Finally, phenophases are estimated for this particular time series (phenophase = TRUE). By default transition dates for the 10, 25 and 50% amplitude threshold are estimated for both rising and falling parts of a Gcc time series, for a detailed description we refer to Appendix S1. A processed Gcc time series at a 3-day time step together with the 25% amplitude of the rising falling stages of each cycle of vegetation activity at the Bartlett experimental forest are shown in \@ref(fig:timeseries). In the interest of brevity we refer to the package documentation for additional parameter options.
## The daymetr R package
Daymet provides gridded estimates of daily weather parameters across North America (Thornton et al. 1997), and makes these data available through an ORNL DAAC web service, to which the daymetr R package provides an interface. A simple download command for a particular location can be executed using the following syntax:
download_daymet( site = "bartlett",
lat = 44.0646,
lon = -71.2881,
start = 1980,
end = 2010,
internal = TRUE)
Here we specify the filename to assign to the data to be downloaded (e.g. “bartlett”) as specified by a given latitude and longitude and a start and end year (start = 1980 and end = 2010, respectively). The 'internal' parameter specifies if data should be loaded into the current R workspace (TRUE) or merely downloaded as a comma delimited file (CSV, FALSE) to disk. A similar syntax is used for batch downloading data from a list of locations stored in a CSV file using download_daymet_batch() or downloading gridded (spatial) data through the download_daymet_tiles() and download_daymet_ncss() functions.
## Worked example statistical results
Comparing model performance across the 20 spring phenology models and all the three PFT showed that all models were significantly different from the NULL model, except for the SGSI model for the evergreen PFT (post-hoc Tukey HSD test, p < 0.001, Figure 4). We also note that the NULL model for the grassland PFT performed worse compared to either evergreen or deciduous forest PFT (Figure 4d). For grasslands the RMSE of the NULL model equals 32.7 days compared to only 14.5 and 16.1 days for the deciduous and evergreen forest sites respectively. Excluding the NULL model, model RMSE varied greatly among PFTs. For deciduous broadleaf sites an average RMSE of ~9.1 ± 1.2 days was observed (Figure 4a). For the evergreen needleleaf forest and the grassland sites the RMSE values almost doubled (~14.4 ± 0.8 days and ~13.6 ± 2 days respectively, Figure 4c-d). A separate model optimization on centrally located deciduous broadleaf sites resulted in a RMSE ~7.6 ± 0.7 days (Figure 4b). Limiting the deciduous broadleaf sites to locations to those centrally located within the biome point to a marginally higher RMSE of ~7.9 ± 1.2 days. Differences between the full deciduous broadleaf forest dataset and those more centrally within the biome allude to the influence of geographic location on model error. Although our analysis quickly and easily reproduced a limited version of previous work by Basler (2016), only including a subset of the models discussed and omitting extensive cross-validation therein, our study extends this work in other ways. Mainly, we increased the geographic range, covering boreal, temperate and subtropical regions, and included three plant functional types. However, even with this greater range in latitude and the continental scale of the study, as well as three plant functional types, does not change previous findings.
Variability between in RMSE between the PFT can largely be explained by differences in phenological processes. Grassland phenology is largely determined by both temperature and plant available water. Yet, two grassland spring phenology models included in phenor which account for water stress through VPD (SGSI, AGSI; Xin et al. 2015) do not perform significantly better than other described models. Heterogeneity, between grassland locations might explain this lack of performance. However, an in depth analysis is outside the scope of this manuscript. Evergreen forest phenology as observed by PhenoCams is not determined by developing leaves but by dehardening of existing needles. Estimates of evergreen phenology therefore might benefit from a model structure focussed on the dehardenning process (Hänninen & Kramer 2007) not yet included in the described models.
## References
Akaike, H. (1974). A new look at the statistical model identification. Automatic Control, IEEE Transactions on, 19, 716–723.
Filippa, G., Cremonese, E., Migliavacca, M., Galvagno, M., Forkel, M., Wingate, L., Tomelleri, E., Morra di Cella, U. & Richardson, A.D. (2016). Phenopix: A R package for image-based vegetation phenology. Agricultural and Forest Meteorology, 220, 141–150.
Hufkens, K., Friedl, M., Sonnentag, O., Braswell, B.H., Milliman, T. & Richardson, A.D. (2012). Linking near-surface and satellite remote sensing measurements of deciduous broadleaf forest phenology. Remote Sensing of Environment, 117, 307–321.
Hurvich, C.M., Simonoff, J.S. & Tsai, C.-L. (1998). Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 60, 271–293.
Keenan, T., Darby, B., Felts, E., Sonnentag, O., Friedl, M., Hufkens, K., O’Keefe, J., Munger, J.W., Toomey, M. & Richardson, A.D. (2014). Tracking forest phenology and seasonal physiology using digital repeat photography: a critical assessment. Ecological Applications.
Killick, R., Fearnhead, P. & Eckley, I.A. (2011). Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 1459.
Klosterman, S.T., Hufkens, K., Gray, J.M., Melaas, E., Sonnentag, O., Lavine, I., Mitchell, L., Norman, R., Friedl, M. a. & Richardson, a. D. (2014). Evaluating remote sensing of deciduous forest phenology at multiple spatial scales using PhenoCam imagery. Biogeosciences Discussions, 11, 2305–2342.
Melaas, E.K., Friedl, M.A. & Richardson, A.D. (2016). Multiscale modeling of spring phenology across Deciduous Forests in the Eastern United States. Global Change Biology, 22, 792–805.
Richardson, A.D., Braswell, B.H., Hollinger, D.Y., Jenkins, J.P. & Ollinger, S. V. (2009). Near-surface remote sensing of spatial and temporal variation in canopy phenology. Ecological applications: a publication of the Ecological Society of America, 19, 1417–28.
Richardson, A.D., Hufkens, K., Milliman, T., Aubrecht, D.M., Chen, M., Gray, J.M., Johnston, M.R., Keenan, T.F., Klosterman, S.T., Kosmala, M., Melaas, E.K., Friedl, M.A. & Frolking, S. (2017). Tracking vegetation phenology across diverse North American biomes using PhenoCam imagery. Scientific Data.
Richardson, A.D., Jenkins, J.P., Braswell, B.H., Hollinger, D.Y., Ollinger, S. V & Smith, M.-L. (2007). Use of digital webcam images to track spring green-up in a deciduous broadleaf forest. Oecologia, 152, 323–34.
Sonnentag, O., Hufkens, K., Teshera-Sterne, C., Young, A.M., Friedl, M., Braswell, B.H., Milliman, T., O’Keefe, J. & Richardson, A.D. (2012). Digital repeat photography for phenological research in forest ecosystems. Agricultural and Forest Meteorology, 152, 159–177.
Thornton, P.E., Running, S.W. & White, M.A. (1997). Generating surfaces of daily meteorological variables over large regions of complex terrain. Journal of Hydrology, 190, 214–251.
Toomey, M., Friedl, M. a, Frolking, S., Hufkens, K., Klosterman, S., Sonnentag, O., Baldocchi, D.D., Bernacchi, C.J., Biraud, S.C. & Richardson, A.D. (2015). Greenness indices from digital cameras predict the timing and seasonal dynamics of canopy-scale photosynthesis. Ecological Applications, 25, 99–115.
Zhang, X., Friedl, M.A., Schaaf, C.B., Strahler, A.H., Hodges, J.C.F., Gao, F., Reed, B.C. & Huete, A. (2003). Monitoring vegetation phenology using MODIS. Remote Sensing of Environment, 84, 471–475.
```{r timeseries, echo = FALSE, fig.cap = 'Example Gcc PhenoCam Dataset V1.0 time series'}
knitr::include_graphics("output/Appendix_S1_figure_1_time_series_example.png")
```