# Data Visualisation with Tableau

As an analyst, you must be able to present your results in a relevant and synthetic manner. This will allow you to both explore your databases 🔍 and present your analyses 📊. We will use the Tableau tool to perform our visualisations.


## What you'll learn in this course 🧐🧐

*   Import any type of data into Tableau
*   Create insightful presentations with Tableau
*   Perform Data Mining tasks with Tableau



## Introduction to Tableau 📊

Tableau is one of the most popular Business Intelligence tools out there today. One reason for this is because it is extremely easy to use.


### Setting up Tableau 🎡

There are several versions of Tableau. One is called Tableau Desktop, another is called Tableau Online, and yet another is called Tableau Server. For our course we will be using the open source version of Tableau Desktop called Tableau Public. Here are the steps to install the tool:


* Go to [https://public.tableau.com](https://public.tableau.com)
* Enter your email and download Tableau.
* If it did not start automatically, go to your email and look up for a welcome email from Tableau. You will see a download link.


![](https://drive.google.com/uc?export=view&id=11OJmlDWAOWh5iXSQF-1M2qjhrPuwSVFi)


**NB:** If you would like, you can request a license for Tableau Desktop here 👉👉 [https://www.tableau.com/academic/students](https://www.tableau.com/academic/students) and click on _Get Tableau for Free_. You will need to prove that you are registered with JEDHA by providing, for example, proof of payment.


### Importing data into Tableau 🗂️

With Tableau, you can import almost any type of file. We will mainly use Excel sheets but you can see that there are other versions:

* _Text_ files, which correspond to `.txt` or `.csv` files.
* The _JSON_ files that are often used among web developers (`.json` extension).
* Spatial files that correspond to geographic data (such as maps)  🌎
* Statistical files like SPSS (software created by IBM).

Depending on your company, you will handle different types of files. However, most of the time, you will be using Excel or CSV.

**NB:** CSV (or Comma Separated Values) files are files where each column is separated by a comma. It is very common to make mistakes because a comma has been misplaced. So be extremely careful that your data has been cleaned up otherwise Tableau will not read your file properly.

After opening Tableau, click on the type of file you want to import:

![](https://drive.google.com/uc?export=view&id=1OKsB3veuZtxL47GrzL1NQkdXKPrVIrfI)

Then select the file you want to import :

![](https://drive.google.com/uc?export=view&id=1vo9m8FV_Y1Vstd9VRjSl0LUhVQcts3_Y)

Once the data has been imported, you should see a table like this :

![](https://drive.google.com/uc?export=view&id=14bKUOTFjxoZHGJuWONZfHSta8jDo5lN4)

### Get started with Tableau 🎬


#### Accessing graphs

For those who are familiar with Excel, Tableau is likewise structured. 

* A graph is contained within a _Sheet_. 🪆
* Several _sheets_ are contained within _Dashboards. 🪆🪆
* And  finally, a _dashboard_ can be inserted within a _Story_ whose purpose is to present analysis. 🪆🪆🪆

To build a graph, click on a sheet. Then the interface to build a graph will appear:


![](https://drive.google.com/uc?export=view&id=1Gw7L5uiqrLkK-VrmHGxpgZ-uyIHY-qoa)


**NB:** You can rename the sheet by double-clicking on it.


#### Measures & Dimensions 📏

_Measures_ and _Dimensions_ represent different type of data in Tableau. _Measures_ correspond to quantitative data while _Dimensions_ correspond to qualitative data.

Tableau will always try to guess if a given variable is a *measure* or a *dimension*. And generally it does not get it wrong. However, you will always be able to change them in the tool if this should happen. 👍

In Tableau, _measures_ and _dimensions_ are on the left side of the interface:

![](https://drive.google.com/uc?export=view&id=1SvEUmAq0oIEc10KZbb3dCQmUKMDovMb0)

**NB:** Sometimes it may be necessary to change a data type from _measure_ to _dimension_ or vice-versa. You can do this very simply by dragging and dropping the corresponding _measure_ to the _dimensions_ section.

#### Creating your first graph 📈

Creating a graph is very simple with Tableau. Simply drag and drop a measure and dimension into the _columns_ and _lines_ of the sheet.

![](https://drive.google.com/uc?export=view&id=1qdLSB9OdRwcvPbF-3raoOYnNq78eixqq)


**NB:** It is not mandatory to put dimensions in columns and measurements in rows. It all depends on the visual purpose of the graph.


#### Calculated Fields 🧮

Often your raw data is not sufficient for what you want to create. For example, you might need to apply taxes on a gross margin variable 💰. This is where you will be using calculated fields.

In the _measures_ section, right-click and select, "_create a calculated field..._" You will then have to give a name to your field and do your calculation. The formulas work the same way as Excel ("_*_" for multiplication "_/_" for division).

If you want to use pre-existing _measures_ to create a _calculated field, you will type the name of your measure in the table. The first few letters will be enough for Tableau to suggest it to you, just click on it.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/calculated_field.gif)

#### Colors 🎨

It may seem trivial and yet it's often forgotten, colors are important in Data Visualisation because they will help your audience understand your results easily.

If you want to change colors in your graph, click on "_colors_" in the _Benchmarks_ section and choose the one that suits you. 

We advise you to keep color coding simple: 

* Red often signifies danger or things that you must be careful about. ⛔
* Green is often a sign of serenity or money, while blue is a neutral colour that most people like. 💸

In case you want to separate colors according to dimensions (e.g. one colour per region in your graph), you can drag your dimension and drop it in the _Benchmarks_ section.


![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/colors.gif)


You can change the colors by clicking on "colors" in the marker and "Edit colors_".


![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/edit_colors.gif)


#### Labels 🏷️

You can add labels to your graphs. This is useful when you want to add clarity to your presentations. Remember that people in your audience did not participate in your analyses, and discover your results at the moment of your presentation, so you need to be clear and not think they will guess anything you won't be presenting.

To add a label, go to the _Benchmarks_ section and click on _Labels_. You can customize it in any way you like. You can also drag a _size_ or _measure_ to automatically update your graph.


![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/Labels.gif)


Sometimes you may want to change the format of your labels. For example, you may want to show a monetary value rather than just a number. To do this, you can right-click and select "_format..._". Click on "axis" to the left of the tool and you can change the format of your data.


### Export your work 🎁

If you use Tableau Public, you are limited in terms of export. So you can simply take a screenshot *(shift + cmd + 4 on mac)*

You can also save your work to Tableau Public. Your graphs will in Tableau public repositories. Everybody will be able to checkout your work but you will be able to go back to whenever you want. 

If you are using Tableau Desktop, you will be able to go to Worksheet > Export and select how you want to export your graphs.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/Export_your_work.gif)


## Data Mining with Tableau 🚀🚀

### Visualisation of an A/B Test 💕

To visualise an A/B test quickly on Tableau, you only need to remember two things:

* First, you need to use different colors so that your audience can tell the difference between your control variable and your variation. To do so:
    * Simply drag your dimension on "_Colors_" in the _Marks_ section. 
    * Then you will add labels to clarify the results of each variable.

* The second thing is how to convert numbers into percentages. To do this:
    * You will have to click on the drop-down menu on your measure (look for the arrow on the right side of the measer) and then select "_Add Table calculation_". 
    * In the table calculation, choose "_percentage of total_" and then select "_Table Vertical_" in the _calculation using_ section. This will give you a percentage of the total result.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/AB_test.gif)

These two techniques are a quick way to visualize an A/B Test. Be careful however, ⚠️ **this does not show the statistical significance of the A/B test** ⚠️. You will have to do a [Chi2 test](http://www.evanmiller.org/ab-testing/chi-squared.html) to know the real statistical significance of your test.

### Working with Aliases 🤡

Often you will need to overwrite your default values in your measures to be clearer. For example, if you have binary measures like 0 or 1, it is best to replace this with "clicked" / "didn't click" or "bought" / "didn't buy".

To add an alias, simply right-click on your measure or dimension and select *Edit Alias*. A window will appear and you can replace the values with whatever you want.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/Edit_aliases.gif)

Changing the values with an alias will not change the values in your database. It simply brings clarity to your graph.


### Reference Line 🎼

Adding a reference line is useful when you want to compare your results with your competitors or with your past results. For example, if you want to analyze your income for the year, you may want to compare it to the average income generated by your competitors. This average revenue will be what your baseline is.

To add a reference line, right click on your Y-axis and select "_Add Reference Line_". You will then have several options, you can take a single line, a band, a distribution or a boxplot. Depending on what you need, you will choose the one you want.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/reference_line.gif)


### Combining two graphs 🏳️‍🌈

Sometimes it is useful to show two graphs in the same plane to give your audience perspective.

You can easily do this with Tableau. To add another graph on the same sheet, simply drag and drop a measure into the lines. Tableau will automatically make a new graph.



![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/combining_two_graphs.gif)

## Create a dashboard ✨

Once you have all your sheets created, you can combine them in a _dashboard_ to have a clear and condensed view of what you want to present.

To create a _dashboard_, click on the _dashboard_ tab next to the sheets at the bottom of the software and then add the graphs you want.

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/Create_dashboard.gif)

## Create a story 🌷

A story is a combination of worksheets and dashboards. The goal with it is to enable you to present your analysis without necessarily using PowerPoint. 

![](https://essentials-assets.s3.eu-west-3.amazonaws.com/M01-Data_visualisation/D01-Data_visualisation/story.gif)


## Resources 📚📚

* Data Science A-Z - [http://bit.ly/2tNwqVA](http://bit.ly/2tNwqVA)

* Learn Tableau - [http://tabsoft.co/2suDgLu](http://tabsoft.co/2suDgLu)

* Intro to Tableau - [http://bit.ly/2uzOgIT](http://bit.ly/2uzOgIT)

* Chi Squared Test - [http://bit.ly/1BXeqUk](http://bit.ly/1BXeqUk)

* Calculation Table - [http://bit.ly/2IBSSA](https://help.tableau.com/current/pro/desktop/en-us/calculations_tablecalculations.htm)