# What is LEAP-Pangeo?

<img src="https://leap.columbia.edu/wp-content/uploads/2024/05/LEAP-Pangeo.png" width="600"/>

> LEAP Pangeo is our modern data cloud computing infrastructure for climate data science

Check out our [technical docs](https://leap-stc.github.io/intro.html) for more information!

## Components

- **Data**: [Cloud Storage Buckets](https://leap-stc.github.io/leap-pangeo/jupyterhub.html#leap-pangeo-buckets) enable researchers to store and share data within LEAP (and soon with the entire world). We either produce, transform, or link to Analysis-Ready Cloud-Optimized (ARCO) data within LEAP.
- **Compute**: We like working with data with JupyterLab/Notebooks as the interface, and scalable compute backed by dask running on Kubernetes. But you can bring your own compute as you like! Or just take a look at the data with the Dataviewer.
    - All our compute and data access is based on the same central cloud storage location, and clearly documented standards/interfaces.
- **Discovery**: The different [types of data](https://leap-stc.github.io/policies/data_policy.html#types-of-data-used-at-leap) are collected in our [LEAP Data Catalog](https://catalog.leap.carbonplan.org/) (Final URL still pending).

Read more about our [design principles](https://leap-stc.github.io/leap-pangeo/architecture.html#design-principles).

## What do researchers get from this?
- No hassle with setting up your own computer - All you need locally is a web browser!
- You don't even need that fast of an internet connection! - I have accessed the Hub from Amtrak just fine 😁.
- A familiar environment to work how you like, wherever you are or going to be in the future!

In [1]:
from IPython.display import IFrame
IFrame("https://speakerdeck.com/player/33656c048003411ba1f8e7e7dfd03676?slide=16", 600, 340)

## How is all of this different from working on my institutions Server/HPC?

- Full reproducibility: As long as you have the version of the software environment and version controlled code, you will be able to reproduce your results perfectly every time.
- Easy collaboration within and outside LEAP. [Hermans et al. 2024](https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2023EF004188?af=R) was the product of cross atlantic collaboration that felt just like being in the same city!
- Easier support: Just send a gist, snippet, repository to the [Data and Compute Team]() and we can easily debug, suggest fixes, and get scientists back to what matters.
- Teach on the same data, environments as you do research

## What we will demo today
We can always go off-script and live code, so feel free to ask questions at **any time**!

- Quick intro to notebooks
- Write/read data from cloud buckets, and share some data with others.
- Open a dataset from the catalog and visualize it a couple of different ways.
- Reproduce an IPCC plot from CMIP6 data live + optional extra CMIP6 data analysis
- Optional: Scale up to large data: Working with high resolution ocean model output

Navigate to https://github.com/jbusecke/leap-pangeo-demos to get started!