# Processing the drifting buoy data

Drift tracks from instruments deployed during the MOSAiC campaign were obtained from the Arctic Data Center and saved in the folder `data/adc_dn_tracks`. The "DN" stands for "Distributed Network", which is a set of autonomous stations deployed surrounding a heavily-instrumented crewed observatory as it drifted across the Arctic from 2019-2020. 

For this project, we need quality-controlled buoy positions at hourly resolution. The raw data is at a variety of time resolutions, from 5 minutes to 12-hourly, and the resolution may change across the lifetime of a buoy. (This is not accidental--high time resolution comes at a cost of battery life. So some buoys were switched to high time resolution, for example, when storms were forecast, so that the rapid ice motion in response to the fast-moving storm system could be captured).

The files in `scripts/icedrift` contain functions for buoy processing that are in the process of being added to an open-source library. For now, since the library has not yet been listed at conda-forge or similar repositories, we'll add the package directly by adding the scripts folder to the path.

In [5]:
import sys
sys.path.append('../scripts')
from icedrift import cleaning as cln

In [None]:
# You can run the processing code as a cell magic like this
# or run it separately in the command line
%run ../scripts/process_mosaic.py

To get more familiar with the data, try the following tasks.

1. Choose a buoy with high temporal resolution (low f_min). Use Pandas to load the buoy data from the original folder, the qc_buoys folder, and from the interp_buoys folder. How do the time series differ? Did the quality control step remove many observations? Does the lower-frequency interpolated data  appear to represent the high-frequency data well?
2. Choose a buoy with lower temporal resolution (f_min of 240). Repeat the same process as in (1). How does the data quality compare, from what you can tell? Does the interpolation process appear at this stage to add unrealistic features? (You'll get more of a feel for what's realistic or not as we go, and as we calculate other quantities from the buoy trajectories)
3. Choose two buoys. How similar are the paths that they take?
4. Find a buoy where the raw track has f_min close to 60 minutes and the qc_track is a different length. Can you find where the QC process flagged and dropped observations? Do you agree with the algorithm's choices?
5. Using Cartopy or ProPlot, make a geographic plot showing the buoy trajectories. Make the buoy tracks for DN1 and DN2 different colors. (I recommend a North Polar Stereographic map projection)
