This repository contains all the homeworks and exercises for the Big Data Lectures imparted by the Colombian National University conducted by Alvaro Mauricio Montenegro Diaz
This repository contains all the homeworks and exercises for the Big Data Lectures in python, dask, spark and SQL.
- I put into practice the handling of python, numpy and pandas.
- I learned about conda, package installation and environment management.
- Manage, creation and design databases using SQL, mysql and DBeaver.
- Parallel computing using Dask.
- Data management and data processing using spark.