Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
246 changes: 40 additions & 206 deletions 00-before-we-begin.ipynb

Large diffs are not rendered by default.

1,493 changes: 0 additions & 1,493 deletions 01-drastically-different-time-series.ipynb

This file was deleted.

988 changes: 0 additions & 988 deletions 02-bar chart as an effective alternative to a pie chart.ipynb

This file was deleted.

1,911 changes: 0 additions & 1,911 deletions 03-pie charts as an alternative to bar charts-inclass.ipynb

This file was deleted.

845 changes: 0 additions & 845 deletions 04-data models and whats in between.ipynb

This file was deleted.

18 changes: 4 additions & 14 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,19 +6,19 @@

Code repository is here: https://github.com/bgbg/datascience_dataviz_workshop

During the workshop, I will share code snippets using [this shared directory](https://drive.google.com/open?id=1umiVMh5xXAxDHhXUkkNgVdDztnSjGCkx)
During the workshop, I might share code snippets using [this shared directory](https://drive.google.com/open?id=1umiVMh5xXAxDHhXUkkNgVdDztnSjGCkx)

1. **Make sure you meet all the requirements. We will not be able to support missing installations or non-working code**. The provided notebook `00-before-we-begin.ipynb` will take you through the verification process. It will make sure that you have all the required software installed, and that you have enough knowledge to proceed with the workshop. It is up to you to make sure that everything works.
1. **Make sure you meet all the requirements. I will not be able to support missing installations or non-working code**. The provided notebook `00-before-we-begin.ipynb` will take you through the verification process. It will make sure that you have all the required software installed, and that you have enough knowledge to proceed with the workshop. It is up to you to make sure that everything works.

2. Visit the [workshop repository](https://github.com/bgbg/datascience_dataviz_workshop) one day before the workshop. I will update that repository with code examples and data sets. You should have them on your computer before the workshop starts.

3. Follow [my blog](https://gorelik.net/blog)

## Workshop description
Data visualization is an indispensable tool for any data scientist. It serves as a means to convey a message or explain a concept. You would never settle for default settings of a machine learning algorithm. Instead, you would tweak them to obtain optimal results. Similarly, you should never stop with the default results you receive from a data visualization framework. Doing so leads to suboptimal results and makes you and your message less convincing.

After this workshop, you will be able to name three most common mistakes in data visualization, and learn how to apply them in your graphs.

During this workshop, a short theoretical introduction will be followed by three lab examples. We will use matplotlib in Jupyter notebooks to practice the knowledge. You are expected to have at least intermediate knowledge of Python, Jupyter notebook interface, and matplotlib object-oriented interface.
During this workshop, a short theoretical introduction will be followed by several lab examples. We will use matplotlib in Jupyter notebooks to practice the knowledge. You are expected to have at least intermediate knowledge of Python, Jupyter notebook interface, and matplotlib object-oriented interface.


## Setup
Expand All @@ -36,14 +36,4 @@ This will create an environment in your local directory. Next, activate the envi
source activate ./dataviz-env
jupyter notebook
```
## Code samples and data sets

I'll update this repository with code samples and data sets. Stay tuned!

## Timeline
1. Theoretical introduction: three most common mistakes in data visualization (45 minutes)
2. Test case (LAB): Plotting several radically different time series on a single graph (45 minutes)
3. Test case (LAB): Bar chart as an effective alternative to a pie chart (45 minutes)
4. Test case (LAB): Pie chart as an effective alternative to a bar chart (45 minutes)

The time line includes some time for recess.
11 changes: 0 additions & 11 deletions data/knesset_20.csv

This file was deleted.

1,600 changes: 0 additions & 1,600 deletions data/winequality-red.csv

This file was deleted.

72 changes: 0 additions & 72 deletions data/winequality.names.txt

This file was deleted.

Loading