Skip to content

EddyShao/QBO-1d

Repository files navigation

Data-Driven Gravity Wave Parameterization in QBO-1d model

Motivation, Background, and Setup

In this section, we will provides some information about the motivation and background of the project.

Gravity Waves Parametrization

The tropical stratospheric Quasi-Biennial Oscillation is driven largely by parametrized gravity wave breaking, which makes its parametrization an interesting and necessary task to do.

Currently, the parametrization is largely based on the physics. We hope by adopting machine learning, we can have a parametrization with improved accuracy and computation feasibility.

Quasi-Biennual Oscillation

Our project is located in a very interesting fact in Atomespheric Fluid Dynamics called Quasi-Biennual Oscillation, QBO for short. Equatorial zonal wind oscillates between easterlies and westerlies in the tropical stratosphere with a mean period of 28 to 29 months (quasi-biennial) due to gravity waves forcing.

Our job, therefore, is to parametrize the gravity waves inside the oscillation. In particular, our work is largely based on the 1-dim QBO model.

QBO-1d

The QBO-1d model is largely a hybrid of the models used by Holton and Linden (1972) and Plumb (1977). The (advection-diffussion) equation reads:

$$\frac{\partial u}{\partial t} + w \frac{\partial u}{\partial z} - \kappa \frac{\partial^2 u}{\partial z^2} = - S(u, z)$$

where $u$ is the zonal wind, $w$ and $k$ are the advection and difussion constants, and $z$ is the vertical coordinate. In particular, $S$ is the gravity wave forcing term, which is our focus.

Physical Parametrization with Stochasticity

Currently, the forcing term is parametrized in the following way.

$$S(u, z) = \frac{1}{\rho} \frac{\partial}{\partial z} F(u, z)$$

where the wave flux 𝐹(𝑢,𝑧) is parameterized as follows: $$F(u, z) = \sum_{i} A_{i} \exp{ - \int_{z_1}^{z} g_{i}(u, z') , dz' }$$

and

$$A(c) = \text{sgn}(c) B_m \exp\left[- \ln{2} \left(\frac{c}{c_w}\right)^2 \right]$$

Note that, when $z = z_1$, $F(u, z_1) = \sum_i A_i$, denoted $F_{S_0}$(total source flux).

The stochasticity enters the story by by making $F_{S_0}$(total source flux) and $C_w$ (spectral width) Random Variables of time.

Data-Driven parametrizations

The pipeline of the project can be visualized as the follow image. Pipeline

Goals: Online and Offline

We have goals from two perspectives.

  • Offline: Machine Learning model’s prediction on test dataset should be satisfactory. For offline performance we mainly use (level-wise mean) / RMSE as the metric.

  • Online: Machine Learning model should function similarly as the Physical Model when inserted into the PDE, with better accuracy and efficiency (hopefully).

Methodology

Despite human beings' great breakthrough on classification, regression is still a hard task. Here is the list of all the machine learning algorithms for regression that I can think of:

  • Linear regression
  • Neural Nets
  • Regression trees/Forest
  • Supported Vector Machine

Linear models will by no means be used in practice, we will treat it as a baseline. Neural nets and trees and forest have been done by other members in the group. Therefore, our main focus is the application of supported vector machine for regression(SVR for short) on our problem.

For an introduction of supporetd vector machine and regression, please refer to my presentation slides and weekly report.

Results

Offline Results

With some highly nontrivial data preprocessing and hyperparameters tuning, SVR model are able to achieve a fairly satisfactory performance on the dataset ($R^{2} = 0.95$), which is much better than the linear models ($R^{2} = 0.88$)

Online Results

With some highly nontrivial hyper-parameters tuning, SVR models can successfully emulate the physical-based gravity waves, thus producing the correct oscillation. SVR's Online performance

Unfortunately, the efficiency of SVR models are not satisfactory compared with pre-existing NN models (which is very conter-intuitive). Its low efficiency is mainly due to the computation of exponential function inside the kernel.

Also, it has some ability to 'generalize'. For the definition of generalization and further details, please refer to my presentation slides.

Conclusion

  • Supported Vector Regression model can emulate physics-based gravity wave parametrization (in QBO-1d model)
  • The relation between offline performance and online performance is unknown.
  • Supported Vector Regression model has some ability of generalization.
  • A small but dense model outperforms a big but sparse model in efficiency with comparable online emulation results. However, SVR is essentially not a method with efficiency.

About

Research on QBO-1d

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published