Skip to content
Data Science & Data Engineering Workshop
Jupyter Notebook
Branch: master
Clone or download
Type Name Latest commit message Commit time
Failed to load latest commit information.
DataEngineering Fix SQL Fiddle link Oct 2, 2019
DataScience Added pip import, thanks Jada\! Oct 2, 2019
images Add large portion of Intermediate SQL, Advanced SQL template Oct 1, 2019 Update Oct 2, 2019

HackBU - Data Science & Engineering

Getting Started

In this workshop, we'll be covering two very hot topics in tech - data science and data engineering. The difference between these two fields is complicated and oftentimes blurry, but the difference can be summarized as follows:

  • Data Engineering involves transporting and cleaning data so that it can be used for analysis.

  • Data Science involves analyzing data to establish trends/patterns and make decisions.

The Data Science portion of the workshop covers Data Analysis (Numpy and Pandas) and Data Visulization (Matplotlib and Plotly).

The Data Engineering portion of the workshop covers SQL, a relational database query language, and Apache Kafka, a data streaming platform.

To get started, choose the section that most interests you!

Bug Bounties

Starting today, if you find a bug or mistake in our workshop, let an organizer know or make a pull request fixing it! We'll be handing out special HackBU stickers to those who find bugs/mistakes, and as always, we'll be giving stickers away to some random respondents from our feedback form.


If you attended our workshop on 10/2/19, please leave feedback here.

You can’t perform that action at this time.