Skip to content

Latest commit

 

History

History
225 lines (159 loc) · 16.1 KB

CalculateVehicleTrips.md

File metadata and controls

225 lines (159 loc) · 16.1 KB

CalculateVehicleTrips Module

November 23, 2018

This module calculates average daily vehicle trips for households consistent with the household DVMT. An average trip length model is applied to estimate average length of household trips reflecting the characteristics of the household and the place where they live. The average trip length is divided into the average household DVMT to get an estimate of average number of daily vehicle trips.

Model Parameter Estimation

Metropolitan and non-metropolitan household trip length models are estimated from the Hh_df dataset in the VE2001NHTS package. Separate metropolitan and non-metropolitan models are estimated because the metropolitan model is sensitive to added variables that are not present for non-metropolitan household data. The models are estimated using the records of households that had some light-duty vehicle travel on the survey day.

The models are estimated in 2 steps. In the first step, a linear model is estimated to predict the survey day average trip length. This model is applied stochastically over 1000 days to calculate the average trip length. In the second step, a linear model of the averages is estimated. These steps are described in more detail below.

In the first step, linear models of survey day average vehicle trip length are estimated for metropolitan and non-metropolitan households. Since the distribution of vehicle trip length is highly skewed with a long right-hand tail, the model is estimated for predicting power-transformed values. The power is estimated to minimize the skewness of the distribution. Following is a summary of the estimation statistics for non-metropolitan households:


Call:
lm(formula = ModelFormula, data = Data_df)

Residuals:
     Min       1Q   Median       3Q      Max 
-0.43677 -0.06198 -0.00375  0.05986  0.43820 

Coefficients:
                       Estimate Std. Error t value Pr(>|t|)    
(Intercept)           0.9966442  0.0197085  50.569  < 2e-16 ***
Drivers               0.0823890  0.0103433   7.965 1.70e-15 ***
NonDrivers            0.0305891  0.0075363   4.059 4.94e-05 ***
LogIncome             0.0254076  0.0018770  13.536  < 2e-16 ***
LogDensity           -0.0146889  0.0003151 -46.623  < 2e-16 ***
VehLtDvr             -0.0146301  0.0020280  -7.214 5.56e-13 ***
Drivers:LogIncome    -0.0066623  0.0009650  -6.904 5.16e-12 ***
NonDrivers:LogIncome -0.0028721  0.0007053  -4.072 4.67e-05 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 0.1 on 31205 degrees of freedom
Multiple R-squared:  0.0934,	Adjusted R-squared:  0.0932 
F-statistic: 459.3 on 7 and 31205 DF,  p-value: < 2.2e-16

All variables are highly significant and the coefficients have expected signs. Trip lengths increase with the numbers of drivers and non-drivers in the household and the income of the household. The number of drivers in the household has a stronger effect than the number of non-drivers. Trip lengths decrease at higher population densities and if households have fewer vehicles than drivers. The negative coefficients for the interaction of income with drivers and non-drivers indicates that the rate of increase in trip length diminishes as income and the numbers of drivers and non-drivers increase.

The metropolitan model has the same independent variables as the non-metropolitan model and also includes freeway lane-miles per capita and urban mixed-use neighborhood variables. Following is a summary of the estimation statistics for this model:


Call:
lm(formula = ModelFormula, data = Data_df)

Residuals:
     Min       1Q   Median       3Q      Max 
-0.69842 -0.10741 -0.00495  0.10100  0.72511 

Coefficients:
                            Estimate Std. Error t value Pr(>|t|)    
(Intercept)                 0.901819   0.044739  20.157  < 2e-16 ***
Drivers                     0.140673   0.023458   5.997 2.05e-09 ***
NonDrivers                  0.059420   0.016324   3.640 0.000273 ***
LogIncome                   0.051143   0.004077  12.544  < 2e-16 ***
LogDensity                 -0.022121   0.001060 -20.873  < 2e-16 ***
IsUrbanMixNbrhd            -0.261479   0.034807  -7.512 6.11e-14 ***
FwyLaneMiPC                19.946129   7.085153   2.815 0.004881 ** 
VehLtDvr                   -0.027993   0.004361  -6.419 1.41e-10 ***
Drivers:LogIncome          -0.011388   0.002129  -5.349 8.96e-08 ***
NonDrivers:LogIncome       -0.005853   0.001498  -3.907 9.36e-05 ***
LogDensity:IsUrbanMixNbrhd  0.027941   0.003759   7.434 1.11e-13 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 0.1646 on 16299 degrees of freedom
Multiple R-squared:  0.0805,	Adjusted R-squared:  0.07993 
F-statistic: 142.7 on 10 and 16299 DF,  p-value: < 2.2e-16

The coefficients for the variables shared with the non-metropolitan model have the same signs. The added freeway lane-mile and urban mixed-use neighborhood variables are also highly significant and have expected signs. The average vehicle trip length increases with greater freeway lane-miles and decreases in urban mixed-use neighborhoods. The added interaction between density and urban mixed-use has a positive sign which indicates that the rate of decrease in trip length diminishes as density increases.

As can be seen from the statistics, the models predict a small portion of the observed variation in vehicle trip lengths. Because of this and because the model is a model of power-transformed trip length (where trip length has a long right-hand tail), the mean of the modeled values is substantially less than the observed mean. To address this, the modeled predictions are treated as the mean values of distributions for which a standard deviation is calculated that results in a distribution of predicted values that has the same variance as the observed distribution. The model is run 1000 times for each survey household where each time a value is chosen at random from a normal distribution having the modeled mean and the estimated standard deviation. The household average is computed from the results.

In the second step, linear models to predict the simulated averages for metropolitan and non-metropolitan households are estimated. The independent variables in these models are the same as in the survey day models. Following is are the estimation statistics for the metropolitan household model.


Call:
lm(formula = makeFormula(EndTerms_), data = EstData_df)

Residuals:
       Min         1Q     Median         3Q        Max 
-0.0142881 -0.0021256 -0.0000383  0.0021285  0.0138720 

Coefficients:
                             Estimate Std. Error t value Pr(>|t|)    
(Intercept)                 9.928e-01  8.621e-04 1151.70   <2e-16 ***
Drivers                     7.863e-02  4.520e-04  173.98   <2e-16 ***
NonDrivers                  3.230e-02  3.145e-04  102.71   <2e-16 ***
LogIncome                   2.820e-02  7.855e-05  359.00   <2e-16 ***
LogDensity                 -1.198e-02  2.046e-05 -585.52   <2e-16 ***
IsUrbanMixNbrhd            -1.424e-01  6.709e-04 -212.28   <2e-16 ***
FwyLaneMiPC                 1.080e+01  1.365e-01   79.15   <2e-16 ***
VehLtDvr                   -1.531e-02  8.403e-05 -182.17   <2e-16 ***
Drivers:LogIncome          -6.382e-03  4.102e-05 -155.60   <2e-16 ***
NonDrivers:LogIncome       -3.180e-03  2.886e-05 -110.19   <2e-16 ***
LogDensity:IsUrbanMixNbrhd  1.520e-02  7.245e-05  209.86   <2e-16 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 0.003172 on 16299 degrees of freedom
Multiple R-squared:  0.9859,	Adjusted R-squared:  0.9859 
F-statistic: 1.139e+05 on 10 and 16299 DF,  p-value: < 2.2e-16

The high R-squared value indicates that the linear model can be substituted for the synthesis method for predicting the average trip length for households. The following figure illustrates this by comparing the distributions of the values produced by the linear model and the simulated averages.

metro_veh-trp-len_dist.png

Following are the estimation statistics for the non-metropolitan household model.


Call:
lm(formula = makeFormula(EndTerms_), data = EstData_df)

Residuals:
     Min       1Q   Median       3Q      Max 
-0.22242 -0.03391 -0.00118  0.03213  0.29653 

Coefficients:
                       Estimate Std. Error  t value Pr(>|t|)    
(Intercept)           0.9159244  0.0098446    93.04   <2e-16 ***
Drivers               0.8770513  0.0051662   169.77   <2e-16 ***
NonDrivers            0.3688203  0.0037642    97.98   <2e-16 ***
LogIncome             0.2799887  0.0009375   298.64   <2e-16 ***
LogDensity           -0.1795620  0.0001579 -1136.86   <2e-16 ***
VehLtDvr             -0.1724488  0.0010129  -170.25   <2e-16 ***
Drivers:LogIncome    -0.0694627  0.0004820  -144.11   <2e-16 ***
NonDrivers:LogIncome -0.0347099  0.0003523   -98.52   <2e-16 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 0.04995 on 31205 degrees of freedom
Multiple R-squared:  0.9836,	Adjusted R-squared:  0.9835 
F-statistic: 2.666e+05 on 7 and 31205 DF,  p-value: < 2.2e-16

The high r-squared value indicates that this model is also a suitable substitute for the simulation of average trip length. The following figure shows that the fit of the non-metropolitan model is not as good as the fit of the metropolitan model.

nonmetro_veh-trp-len_dist.png

The following table compares the mean modeled values for average DVMT, average trip length, and average daily vehicle trips with average survey values for DVMT, trip length, and vehicle trips. The survey values are capped at the 99th percentile value as are the modeled values. The mean number of trips for the modeled and survey values is calculated by dividing the mean DVMT by the mean trip length. As can be seen, the modeled mean trip length is close to the survey mean. However, since the mean of the modeled average household DVMT is less than the survey day mean DVMT, the modeled average number of vehicle trips is less than the survey mean.

DVMT (miles) Trip Length (miles) Trips
Model Means 58.6 8.8 6.7
Survey Means 62.6 8.7 7.2

How the Module Works

The module applies the estimated metropolitan and non-metropolitan average vehicle trip length models to calculate the average vehicle trip length for each household. The metropolitan model is applied to all household located in the urbanized area. The non-metropolitan model is applied to other households. The average number of vehicle trips is calculated for each household by dividing the household DVMT by the average trip length.

User Inputs

This module has no user input requirements.

Datasets Used by the Module

The following table documents each dataset that is retrieved from the datastore and used by the module. Each row in the table describes a dataset. All the datasets must be present in the datastore. One or more of these datasets may be entered into the datastore from the user input files. The table names and their meanings are as follows:

NAME - The dataset name.

TABLE - The table in the datastore that the data is retrieved from.

GROUP - The group in the datastore where the table is located. Note that the datastore has a group named 'Global' and groups for every model run year. For example, if the model run years are 2010 and 2050, then the datastore will have a group named '2010' and a group named '2050'. If the value for 'GROUP' is 'Year', then the dataset will exist in each model run year group. If the value for 'GROUP' is 'BaseYear' then the dataset will only exist in the base year group (e.g. '2010'). If the value for 'GROUP' is 'Global' then the dataset will only exist in the 'Global' group.

TYPE - The data type. The framework uses the type to check units and inputs. Refer to the model system design and users guide for information on allowed types.

UNITS - The units that input values need to represent. Some data types have defined units that are represented as abbreviations or combinations of abbreviations. For example 'MI/HR' means miles per hour. Many of these abbreviations are self evident, but the VisionEval model system design and users guide should be consulted.

PROHIBIT - Values that are prohibited. Values in the datastore do not meet any of the listed conditions.

ISELEMENTOF - Categorical values that are permitted. Values in the datastore are one or more of the listed values.

NAME TABLE GROUP TYPE UNITS PROHIBIT ISELEMENTOF
Marea Marea Year character ID
FwyLaneMiPC Marea Year compound MI/PRSN NA, < 0
Marea Bzone Year character ID
Bzone Bzone Year character ID
D1B Bzone Year compound PRSN/SQMI NA, < 0
Marea Household Year character ID
Bzone Household Year character ID
LocType Household Year character category NA Urban, Town, Rural
Income Household Year currency USD.2001 NA, < 0
Drivers Household Year people PRSN NA, < 0
Vehicles Household Year vehicles VEH NA, < 0
HhSize Household Year people PRSN NA, <= 0
IsUrbanMixNbrhd Household Year integer binary NA 0, 1
Dvmt Household Year compound MI/DAY NA, < 0

Datasets Produced by the Module

The following table documents each dataset that is retrieved from the datastore and used by the module. Each row in the table describes a dataset. All the datasets must be present in the datastore. One or more of these datasets may be entered into the datastore from the user input files. The table names and their meanings are as follows:

NAME - The dataset name.

TABLE - The table in the datastore that the data is retrieved from.

GROUP - The group in the datastore where the table is located. Note that the datastore has a group named 'Global' and groups for every model run year. For example, if the model run years are 2010 and 2050, then the datastore will have a group named '2010' and a group named '2050'. If the value for 'GROUP' is 'Year', then the dataset will exist in each model run year. If the value for 'GROUP' is 'BaseYear' then the dataset will only exist in the base year group (e.g. '2010'). If the value for 'GROUP' is 'Global' then the dataset will only exist in the 'Global' group.

TYPE - The data type. The framework uses the type to check units and inputs. Refer to the model system design and users guide for information on allowed types.

UNITS - The units that input values need to represent. Some data types have defined units that are represented as abbreviations or combinations of abbreviations. For example 'MI/HR' means miles per hour. Many of these abbreviations are self evident, but the VisionEval model system design and users guide should be consulted.

PROHIBIT - Values that are prohibited. Values in the datastore do not meet any of the listed conditions.

ISELEMENTOF - Categorical values that are permitted. Values in the datastore are one or more of the listed values.

DESCRIPTION - A description of the data.

NAME TABLE GROUP TYPE UNITS PROHIBIT ISELEMENTOF DESCRIPTION
VehicleTrips Household Year compound TRIP/DAY NA, < 0 Average number of vehicle trips per day by household members
AveVehTripLen Household Year distance MI NA, < 0 Average household vehicle trip length in miles