This image is retrieved from Datashader organization.
The objective of this project is to create a tutorial on big data handling. The purpose is to learn how to deal with big data in an efficient and elegant manner using common data science techniques.
This project uses Bash and Python in the Jupyter Notebook interface to complete multiple tasks, including data acquisition, data cleaning, data analysis, and data visualization.
- Clone this repo to your local machine.
- Build the enviroment by following the instructions in the environmental.yml file.
- Open the index.ipynb and run the code.
Click the badge below. This badge will bring you to the virtual enviroment boosted through the BinderHub and you can run the code in the index.ipynb.