SLAMS: Score-based Latent Assimilation in Multimodal Setting

🎊🎊 We won the best student paper award at CVPR EarthVision 2024!! 🎊🎊

📚 Paper: https://arxiv.org/abs/2404.06665

🔍 Overview: We recast data assimilation in a multimodal setting using a deep generative framework. In particular, we implement a latent score-based diffusion model. We project heterogeneous states and observations into a unified latent space where the forward and reverse conditional diffusion processes take place. Through varying ablation studies, given coarse, noisy, and sparse conditioning inputs, we find our method to be robust and physically consistent. Part of this implementation builds upon components originally developed by Rozet, F., & Louppe, G. [paper, code] under the MIT License, which we have adapted and extended to fit our research framework.

Quickstart

Install dependencies using pip or conda

pip install -r requirements.txt

Run sample notebooks under notebooks/ marked with 01_ prefix. These examples are extended from [1] to benchmark against our latent approach.

a: Lorenz'63 system
b: Kolmogorov fluid

Full Experiments

In order to reproduce the results in the paper, we have to acquire the necessary data:

Process the in-situ data python process_cpc.py
Process the ex-situ data python process_noaa.py
Process the ERA5 data from https://leap-stc.github.io/ChaosBench/quickstart.html, particularly

cd data/
wget https://huggingface.co/datasets/LEAP/ChaosBench/resolve/main/process.sh
chmod +x process.sh

./process.sh era5
./process.sh climatology

Update slams/config.py field: ERA_DATADIR = <YOUR_ERA5_DIR>, for instance <PROJECT_DIR>/SLAMS/data
All evaluations are summarized in a series of notebooks/ marked with 02_ prefix.
- a: Pixel-based data assimilation
- b: Latent-based data assimilation NO observation (only background states)
- c: Latent-based data assimilation with +1 observation (in-situ)
- d: Latent-based data assimilation with +2 observation (in-situ + ex-situ)
- e: Figures and tables generation

NOTE: Training your own model is simple and is defined in train_da.py. First, define your latent model in slams/nn.py or score network in slams/score.py. Afterwards, unify both under slams/model_da.py. An example, as defined in the paper, has been provided for your reference.

Citation

If you find any of the code useful, feel free to cite these works.

@misc{qu2024deep,
      title={Deep Generative Data Assimilation in Multimodal Setting}, 
      author={Yongquan Qu and Juan Nathaniel and Shuolin Li and Pierre Gentine},
      year={2024},
      eprint={2404.06665},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@article{rozet2024score,
  title={Score-based data assimilation},
  author={Rozet, Fran{\c{c}}ois and Louppe, Gilles},
  journal={Advances in Neural Information Processing Systems},
  volume={36},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SLAMS: Score-based Latent Assimilation in Multimodal Setting

Quickstart

Full Experiments

Citation

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
docs		docs
logs		logs
notebooks		notebooks
slams		slams
.gitignore		.gitignore
README.md		README.md
process_cpc.py		process_cpc.py
process_noaa.py		process_noaa.py
requirements.txt		requirements.txt
train_da.py		train_da.py

yongquan-qu/SLAMS

Folders and files

Latest commit

History

Repository files navigation

SLAMS: Score-based Latent Assimilation in Multimodal Setting

Quickstart

Full Experiments

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages