Skip to content

MEDIDA: Model Error Discovery with Interpretability and Data Assimilation

Notifications You must be signed in to change notification settings

envfluids/MEDIDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MEDIDA: Model Error Discovery with Interpretability and Data Assimilation

Table of contents

Introduction

Models used for many important engineering and natural systems are imperfect. The discrepancy between the mathematical representations of a true physical system and its imperfect model is called the model error. These model errors can lead to substantial difference between the numerical solutions of the model and the observations of the system, particularly in those involving nonlinear, multi-scale phenomena. Thus, there is substantial interest in reducing model errors, particularly through understanding their physics and sources and leveraging the rapid growth of observational data. Here we introduce a framework named MEDIDA: Model Error Discovery with Interpretability and Data Assimilation. MEDIDA only requires a working numerical solver of the model and a small number of noise-free or noisy sporadic observations of the system. In MEDIDA, first the model error is estimated from differences between the observed states and model-predicted states (the latter are obtained from a number of one-time-step numerical integrations from the previous observed states). If observations are noisy, a data assimilation (DA) technique such as ensemble Kalman filter (EnKF) is first used to provide a noise-free analysis state of the system, which is then used in estimating the model error. Finally, an equation-discovery technique, such as the relevance vector machine (RVM), a sparsity-promoting Bayesian method, is used to identify an interpretable, parsimonious, closed-form representation of the model error. Using the chaotic Kuramoto-Sivashinsky (KS) system as the test case, we demonstrate the excellent performance of MEDIDA in discovering different types of structural/parametric model errors, representing different types of missing physics, using noise-free and noisy observations.

Requirements

'*' Included in the utils

Experiments

System (perfect model): KS

KS Case KS Location

Run main.py with all the input arguments

python3 -u main.py $CASE_NO $NOISE_MAG $N_ENS $N_SPORADIC $EQN $IF_SIMULATION $IF_RVM

where,

  • CASE_NO (type: integer) is the case number [to do: change case numbers as they appear in the paper)

  • NOISE_MAG (type: real/float) is the standard deviation of the noise ratio (std of noise to std of singal at each time step)

  • N_ENS (type: integer) is the size of ensembles (number of the ensemble members)

  • N_SPORADIC (type: integer) is the number of time-steps between each sample pair

  • EQN (type: string) type of the (imperect) model equation: "KS" for KS, "KSpu3x" for KS+c_4 'u_{xxx}'$ KSpu3x_Du3 for KS + c_4 $'u^2u_{x}'$

  • IF_SIMULATION (type: boolean) is set to TRUE for generating the ensembles and running the imperect model (otherwise FALSE)

  • IF_RVM (type: boolean) is set to TRUE to run RVM to discover the missing physics (otherwise FALSE; set to TRUE if IF_SIMULATION==TRUE or the corresponding saved files exists )

Example: Only run RVM on (already generated data files) case number 1 with 1% noise with $N=10M=10240$ sampling once (one pair) every 10,000 time steps, where the imperfect model is of KS type

python3 -u main.py 1 0.01 10240 10000 KS False True

Note: Download, extract, and include "KS.pkl" in ./experiments/KS/save

Citation

References

Following open-source libraries/codes are used

  • [i] rvm-find

    • Laure Zanna, and Thomas Bolton. "Data-Driven Equation Discovery of Ocean Mesoscale Closures ", Geophysical Research Letters 47, 17 (2020).(url)
      BibTeX
      @article{Zanna_GRL_2020,
      author = {Zanna, Laure and Bolton, Thomas},
      doi = {10.1029/2020GL088376},
      issn = {19448007},
      journal = {Geophysical Research Letters},
      number = 17,
      pages = 1--13,
      title = {Data-Driven Equation Discovery of Ocean Mesoscale Closures},
      volume = 47,
      year = 2020,
      }
  • [ii] PDE-FIND

    • Samuel H Rudy, Steven L Brunton, Joshua L Proctor, and Nathan J Kutz. "Data-driven discovery of partial differential equations", Science Advances 3, 4 (2017).(url)
      BibTeX
      @article{Rudy_SA_2017,
      author = {Rudy, Samuel H and Brunton, Steven L and Proctor, Joshua L and Kutz, J Nathan},
      journal = {Science Advances},
      number = 4,
      publisher = {American Association for the Advancement of Science},
      title = {Data-driven discovery of partial differential equations},
      url = { https://advances.sciencemag.org/content/3/4/e1602614 },
      doi = { 10.1126/sciadv.1602614 },
      volume = 3,
      year = 2017,
      }

Releases

No releases published

Packages

No packages published

Languages