<div align="center"><img src="./images/DLI_Header.png"></div>

# Fundamentals of Accelerated Data Science with RAPIDS

## Presentation

Please execute the cell below to watch the instructor presentation before proceeding with the rest of the notebook. Note, you may have already watched this video while waiting for the GPU environment to load, in which case you can proceed with the rest of the notebook.

In [None]:
%%html
<video width="800" controls>
  <source src="https://dli-lms.s3.us-east-1.amazonaws.com/assets/s-ds-01-v1/01_01.mp4" type="video/mp4">
</video>

## Objectives

In this workshop you will use RAPIDS, a platform enabling end to end GPU-accelerated data science, to help stave off a simulated epidemic affecting the entire UK population, using population-scale data. In service of this goal, this workshop is divided into 3 sections:

- *GPU-Accelerated Data Manipulation*
- *GPU-Accelerated Machine Learning*
- *Project: Biodefense*

## GPU-Accelerated Data Manipulation

In this section you will interact with several data sets, including datasets larger than a single GPU's memory, to transform UK population data, hospital data, and road network data in preparation for a variety of machine learning algorithms.

## GPU-Accelerated Machine Learning

In this section you will utilize a variety of machine learning algorithms - K-means, DBSCAN, logistic regression, K-nearest neighbor, and single-source shortest path - to analyze ideal supply locations, clusters of infected people, probabilities of infection, nearest hospitals, and fastest routes to medical care.

## Project: Biodefense

In this section you will utilize everything you learned in the first two sections to stave off, over several simulated days, an epidemic affecting the entire UK population.

## Table of Contents for Section 1

**1-01_intro.ipynb**: This notebook.

**1-02_environment.ipynb**: Learn your interactive coding environment.

**1-03_cudf_basics.ipynb**: Begin learning GPU-accelerated dataframe manipulation with cuDF.

**1-04_grouping_sorting.ipynb**: Learn more advanced cuDF operations.

**1-05_grid_converter.ipynb**: Advance your cuDF skills and use CuPy to apply fast user-defined functions, prepping datasets for machine learning.

**1-06_prep_graph.ipynb**: Use cuDF to prepare data for graph analytics.

**1-07_dask-cudf.ipynb**: Transform large datasets spread across multiple GPUs with Dask and cuDF.

**1-08_grid_converter_dask.ipynb**: Apply user-defined functions accross Dask cuDF partitions using multiple GPUs.

## Next

Please proceed to the [next notebook](1-02_environment.ipynb).