### [Forecasting Economic Aggregates Using Dynamic Component Grouping](https://mpra.ub.uni-muenchen.de/81585/1/MPRA_paper_81585.pdf)

__Abstract__

In terms of aggregate accuracy, whether it is worth the effort of modelling a disaggregate process, instead of forecasting the aggregate directly, depends on the
properties of the data. Forecasting the aggregate directly and forecasting each of
the components separately, however, are not the only options. This paper develops
a framework to forecast an aggregate that dynamically chooses groupings of components based on the properties of the data to benefit from both the advantages
of aggregation and disaggregation. With this objective in mind, the dimension of
the problem is reduced by selecting a subset of possible groupings through the use
of agglomerative hierarchical clustering. The definitive forecast is then produced
based on this subset. The results from an empirical application using CPI data for
France, Germany and the UK suggest that the grouping methods can improve both
aggregate and disaggregate accuracy

#### Intro

The usual argument behind using the components is that
allowing for different specifications across disaggregate variables may capture more
precisely the dynamics of a process that becomes too complex through aggregation.
Favouring forecasting directly is that it would be less affected by disaggregate misspecification, data measurement error and structural breaks. Ultimately, whether it is
better to forecast components together or separately depends on the particular forecasting models and data. 


In this context, it might be
possible to find specific groupings that avoid the problems associated with disaggregate
forecasting while still allowing for distinct disaggregate dynamics to be picked up in the
process. With this objective we develop a two-stage method that combines statistical
learning techniques and traditional economic forecasting evaluation. 

- In the first stage, we use agglomerative hierarchical clustering to reduce the dimension of the problem by choosing a subset of feasible groupings based on the commonality among the different components. 
- In the second stage, we try different selection procedures on the resulting hierarchy to produce the final aggregate forecast. These selection procedures include choosing a single grouping based on some criterion and combining the whole subset of groups.

The results from an empirical application using CPI data for France, Germany and the
UK show that the grouping method can improve overall accuracy. 
The results show
that some of the methods that selected a unique grouping performed better than the
best performing non-grouping method, both in terms of aggregate and disaggregate
accuracy.  

The options available for forecasting are many, even when only the level of disaggregation is considered. 
The usual argument behind using the components to forecast an aggregate is that allowing for different specifications across disaggregate variables may capture more precisely the dynamics of a process that becomes too complex through aggregation. In support of this view, Granger (1990) show that the summing
many simple stationary processes can produce a fractional integrated aggregate, while
Bermingham and D’Agostino (2014) show that the dispersion of the persistence of individual series has an accelerating effect on the increase of complexity in the aggregate.

Favouring forecasting the aggregate directly is that, in practical applications, it is likely
that the disaggregate processes may suffer from misspecification. For example, if the
disaggregate models neglect that a number of components share common factors, the
forecasting errors will tend to cluster having a negative effect on the aggregate forecast. The direct aggregate forecast would be less affected by these features
in the data and other problems, like those resulting from data measurement error and
structural breaks.

The theoretical literature supports using the disaggregate forecasts, or bottom-up approach, but the results in the empirical literature are mixed. Ultimately, whether the
magnitude of the aggregation error compensates the specification errors in the disaggregate model depends on the particular forecasting models and data.

An option to improve forecasting performance in this setting, is to work on the modelling, that include disaggregate information in a direct aggregate approach or include common factors
in a bottom-up approach. Another less obvious way, is to look for data transformations
that allow existing models to perform better.

> Examples: Marcellino et al. (2003), Hahn and Skudelny (2008), Burriel (2012) and Esteves
(2013) for European GDP growth; and Zellner and Tobias (2000), Perevalov and Maier (2010) and Drechsel
and Scheufele (2013) for GDP growth in specific industrialized countries.

__As mentioned before, adding components together results in new series with characteristics that may differ quite significantly from those of the originating ones. In this
context, it may be possible to purposefully find specific groupings that show more desirable properties than those of the individual components and the aggregate.__

Some authors have proposed using purpose-built groupings to increase overall forecasting accuracy, but it would seem that, at least in economic forecasting, it has had
little impact. A reason for this may be that the number of possible
groupings grows exponentially with the number of components meaning that traditional methods, that would usually rely on evaluating all possible outcomes, are really only usable for problems with relatively few components. For larger problems, a different
approach becomes necessary.

One that has been relatively successful recently, particularly given the increase in popularity of methods for Big Data, is one that performs grouping conditional on some feature of the original data. The success of these methods, however, depends on the chosen feature being useful in obtaining the desired outcome. 
- The assumption upon which many of these models are built on, is that by grouping series that behave in a similar way, the idiosyncratic errors within groups will tend to offset each other while the more relevant individual dynamics will be retained to be modelled.
Although these problems are set in a different context, the purpose of the methods are
very similar to those of grouping components to increase the forecasting accuracy of an
economic aggregate. They belong, however, to an area of research of statistical learning
that has focused almost exclusively on extracting information from very large datasets.
Many relevant economic aggregates, like GDP and CPI, do not fall in this category and
it is unclear whether these methods will work with relatively small samples.