# Create a small sample RiOMar dataset on Datamor

## Context

### Purpose

The goal is to create a smaller RiOMar dataset to test regridding to Healpix on Pangeo EOSC.

### Description

In this notebook, we will:
- Open a RiOMar data file
- Select a few times to reduce the amount of data
- Save the transformed data in Zarr and make it accessible on datamor

## Contributions

### Notebook


- Tina Odaka (author), IFREMER (France), @tinaok

## Bibliography and other interesting resources

- [RiOMar](https://coast.ifremer.fr/Laboratoires-Environnement-Ressources/LER-Pertuis-Charentais-La-Tremblade/Projets/RIOMAR-2024-2030)


## How to set up pangeo enviroment on datarmor for Fair2adapt riomar usecase: 

```{bash}
ssh datarmor

micromamba create -n riomar python=3.12 xarray zarr  hdf5 ipykernel h5netcdf dask  netCDF4 bottleneck scipy  cftime numba healpy  matplotlib hvplot
pip install git+https://github.com/IAOCEA/xarray-healpy.git
python -m ipykernel install --user --name=riomar

```

Then connect to  `https://datarmor-jupyterhub.ifremer.fr/`



In [2]:
import xarray as xr

In [3]:
url='/home/lops-oh-fair2adapt/riomar/misc/croco_grd.nc'
grid = xr.open_dataset(url, engine='netcdf4')
#grid.to_netcdf('/home/lops-oh-fair2adapt/riomar/misc/croco_grd_hdf5.nc',format='NETCDF4')
grid

In [4]:
url = "/home/lops-oh-fair2adapt/riomar/GAMAR/GAMAR_1h_inst_Y2004M01.nc"
ds = xr.open_dataset(url, engine='h5netcdf')[["temp"]]
ds

In [5]:
ds["nav_lon_rho"]=ds["nav_lon_rho"] * 0 + grid.lon_rho.data
ds["nav_lat_rho"]=ds["nav_lat_rho"] * 0 + grid.lat_rho.data
ds["ocean_mask"]=ds.temp.isel(time_counter=0,s_rho=0).notnull()

In [7]:
smallpath= "/home/lops-oh-fair2adapt/riomar/small.zarr"
small=ds[['temp','ocean_mask']].chunk({'time_counter':1}).isel(time_counter=slice(0,5))
small.to_zarr(smallpath,mode='w')

<xarray.backends.zarr.ZarrStore at 0x2aabd3465a20>

In [8]:
ds=xr.open_dataset(smallpath)
ds