# <h1 style="text-align: center;" class="list-group-item list-group-item-action active" data-toggle="list" role="tab" aria-controls="home">Analyzing Data in Tableau</h1>

Take your Tableau skills up a notch with advanced analytics and visualizations. In this course, you’ll learn how to create detail-rich map visualizations, configure date and time fields to show trends over time, and extend your data using Calculated Fields. You’ll also apply your new skills to complete a customer analytics case study. Through hands-on activities, you’ll learn how to create bins, customize filters and interactions, and apply quick table calculations. Finally, you’ll learn power user techniques, including how to slice and dice data and apply dynamic sets and groups—bringing you one step closer to being Tableau Desktop Specialist certification-ready.

<a id="toc"></a>

<h3 class="list-group-item list-group-item-action active" data-toggle="list" role="tab" aria-controls="home">Table of Contents</h3>
    
* [1. Preparing for Analysis](#1)
    - Data preparation
    - Preparing the data
    - Calculated fields to extend data
    - Visualizations for exploratory analysis of trends
    - Discrete time analysis and Quick Table Calculations
    - Slicing and dicing

* [2. Exploring Visualizations](#2) 
    - Who are your customers
    - Exploring user data
    - Building a KPI dashboard
    - The distribution of users
    - Visualizing distributions
    - Working with bins
    
* [3. Mapping Analysis](#3)
    - Interactive mapping of customer activity over time
    - Customer activity in motion
    - Layering and total dock utilization
    - Quick table calculations for ranking
    - Combining characteristic and quick table calculations
    - KPI and time segments
    
* [4. Groups, Sets, and Parameters](#4)
    - Groups
    - Groups for regions
    - Creating seasonal groups
    - Parameters and sets
    - Parameters
    - Sets

## <a id="1"></a>
<font color="lightseagreen" size=+2.5><b>1. Preparing for Analysis</b></font>

<a href="#toc" class="btn btn-primary btn-sm" role="button" aria-pressed="true" style="color:white" data-toggle="popover">Table of Contents</a>

Learn best practices for organizing fields into dimensions and measures and how to configure date and time fields for trend analysis. All the while, you’ll use Calculated Fields, quick table calculations, and highlight actions to elevate your visualizations and reveal the hidden insights.

### 1 01 Data preparation

1. Data preparation

Hello and welcome! My name is Lis and I'll be one of your instructors in this Tableau course. In this first chapter, we'll leverage visual analytics to reveal insights and to show relationships not easily seen in traditional reports. We'll apply Tableau's built-in tools and calculations to extend insights provided by the source data. Let's get started!

2. Data preparation

![image.png](attachment:image.png)

Data preparation is a crucial step in the data analytics workflow. With any new dataset, we need to first examine it to see if any fields need refinement. We should also consider creating calculated fields from existing fields to more effectively tell our data story. And, it's important to take a close look at the fields and see which can be summarized and grouped at a higher level. Finally, we also want to identify categorical fields that can be used to slice and dice the data. Slice and dice means breaking down information into smaller parts with different perspectives. We'll see examples of each in the following exercises, but first let's talk about the dataset we'll use throughout the course.

3. Chicago's Divvy bike sharing system

![image-2.png](attachment:image-2.png)

Divvy is Chicago's bike sharing system. With the city of Chicago, Divvy publishes historical trip data and makes it available for public use. Trip data spans back as early as 2013. That's a lot of data, so we'll be focusing on trips from the first half of 2019. Our data is split into two tables.

4. Divvy dataset: stations table

![image-3.png](attachment:image-3.png)

First, the station table which describes all the different stations throughout Chicago, including a station's unique id, the station's name (usually described by the street intersection), the station's location via coordinates, and the number of docks available at each station.

5. Divvy dataset: trips table

![image-4.png](attachment:image-4.png)

Second, we have the trips table. Each row in the table represents a trip from the first half of 2019. Each trip has a unique trip id and the id of the bike used. The travel time for each trip is captured in seconds. We also have the exact time that the bikes are checked out and back in Central Standard Time, along with the name and id of the starting and ending stations. Divvy riders are either subscribers and non-subscribers, who are referred to simply as customers in the data. More information is known about subscribers because of the ongoing relationship, that is birthyear and gender.

6. Dimension and measure recap

![image-5.png](attachment:image-5.png)

From a data structure perspective, we have both categorical and numeric values. Having a mixture is important to create a variety of visuals. Remember that Tableau organizes data into two main groups: Dimensions and Measures, where Dimensions represent categorical or qualitative data and is color coded blue. Measures represent numerical data that can be aggregated and is color coded green. We can move fields between these two types, but we need to do it strategically. As we load any new data and begin analysis, an excellent first step is to make sure fields are placed in the right section. All numeric values, by default, arrive in the Measures section. If a numeric field is one that shouldn't be aggregated, then we will move them to the Dimensions section. IDs are great example of this, because it's meaningless to add or average up IDs.

7. Let's practice!

Alright, let's get started with some exercises!

**Exercise**

**Non-aggregating numerical dimensions**

When a numeric value is brought into a Tableau workbook, it's placed by default with the Measures in the Data pane. All Measures are aggregated (e.g., sum, average, count) when they are brought onto the canvas. Rather than taking extra steps to overcome this default behavior, we want to move numbers that are used as labels or descriptions to the Dimensions section of the Data pane.

Which of the following is an example of a numeric field that should be moved to the Dimensions section of the Data pane?

![image.png](attachment:image.png)

Perfect! Organizing Tableau's Data pane is always step one! Common examples of these kinds of numerical values are unique identifiers, values to be used as categories, and GPS coordinates.

### 1 02 Preparing data

1. Preparing the data

Alright, we have our newly arrived data here. As you can see, the Trips and Stations table have been loaded in Tableau. Loading data is outside of the scope of this course, so we won’t expand on that here. Under each table, its dimensions and measures are displayed. First things first, we need to look at what has landed in the dimensions and measures section. Let’s make sure that our numeric values are fields that we would want to aggregate when we bring them onto the canvas. And we have four measures here that we don’t want aggregate by default. So we want to control-click birth year, bike-ID, longitude, and latitude, and then we can slide them across the dividing line between dimensions and measures. They will be read as dimensions by default from now on, but we can always switch them back if needed. Next, we’ll create a table to size up our data. First, I’ll add “From Station Name” to the sheet to get a list of all the station names. The “Add to sheet” functionality is the same as dragging the fields to the sheet. I’ll add measure values to the sheet to get a table quickly. Right away, notice that “Docks” and “Tripduration” are summed up by default. I want to know the average number of docks so I’ll change that in the dropdown. We’ll do the same with tripduration to know the average length of a trip started at each station. Tripduration is measured in seconds which is not a useful unit for us to communicate in. So along with that, we'll use our ability to create a calculated field to have a column that is a converted version of tripduration in minutes. The quickest way to accomplish this is to right-click on the field and use the create menu from there. This dialog box appears with the field already populated inside. This is going to be my trip duration in minutes, so I’ll give it that name, and I'm simply going to divide this by 60. And then I’ll drag the new field into this box to get it into our table and again change the aggregation to average. I’m also going to change the fit of the view to “Fit Width” to have my data fill up the space, which will make things easier to inspect. Looking at our new field, this is clearly being rounded. Any time we create a new field, we’re going to want to check its default properties to make sure that every time it’s on the canvas, it’s presented the same way. So within Default properties, we select “Number Format” and for this number, we'll use a custom configuration. We’ll display the field with one decimal point and we’ll add a suffix for the unit, which will call mins for short. Note the space here. And now it’s presented nicely in the viz. Looking at my column headers, this field isn’t clear. Tableau automatically generates it based on the dataset file name and it represents the number of rows or in our case, trips. To make that clearer, I’m going to edit its Alias to be Number of Trips. Our final task here is to add a highlighter, which can be found in the analysis menu on top and then down to highlighters. We see the fields available for highlighting and we’re going to choose “From Station Name”. And now we have this box on the side. So, if I’m looking for a station, I can select it from the list to have it highlighted. If I have several that have a common phonetic element, it will show up like this. What’s great is that it gives me a highlight on the rows in the context of everything else. It doesn't reduce down the rows as filters do. Alright, it’s now your turn!

2. Let's practice!

### 1 03 Calculated Fields to extend data

1. Calculated Fields to extend data

We're going to continue to expand the columns by adding two calculated fields. As practice, we’re going to assume that we weren’t given the field tripduration. Let's use the drop down caret at the top of the data pane as just another point of entry for creating calculated fields. And for this particular one, it'll bring us an empty dialog box to work with. We're going to be creating tripduration in hours. Inside this dialog, I'm going to be using the function DATEDIFF and you'll want to tab into these as they popup. The date part is what segment of the time and date stamp we want to use. Date part can take many different values, including ‘year’, ‘quarter’,’week’ to ‘minute’. We want to extract hours. Note that we put the datepart in single quotes. And again we’re looking at start time and comparing it with end time. I’m selecting them from the list through tabbing. This calculation is valid, so I'm going to save it. And I’m going to bring it into my list of measure values. Let’s change that aggregation from sum to average. Now, notice that a lot of them are zeroes because they're fractions of hours. So that speaks to what we always need to do anyway, which is to change the default properties. Wel’ll use number format and then number custom. We'll go ahead and leave two decimal places in place and we'll give it a suffix for the hour. Now in our second calculated field, we want to be able to see weekend versus weekday. Let’s call it weekday or weekend. We’re going to use the logic structure of IF THEN ELSE along with Date Part to test if a given date is a Saturday or Sunday. To get this done, we’ll use date part and we’ll ask it to extract weekday. Day of the week is what weekday means here and it will return a value from 1 to 7, where 1 represents Sunday and 7 represents Saturday. So we’ll say if the date is 1 or 7, we’ll have the column display “weekend”. Otherwise, we'll want it to say weekday. There are other ways to structure this formula. But I chose this way because it's easier to test for two day values than for five so that's the logic there. The calculation is valid so now I’ll save this. It has popped up here with the other measures dimensions that can be used to slice and dice the data. Let’s see if this works by right clicking on it and telling it to show the filter. It pops it into my filter shelf and it also creates a legend over here in the right-hand pane. The final touch here would be to add a customization to it, which is the show apply button. It gives me more time to change what's here and then say apply without the content of the visualization disappearing just because you don't have something selected. It's a little smoother of a transition for your users and it’s more professional. Ok, time for you to try it out!

2. Let's practice!

## <a id="2"></a>
<font color="lightseagreen" size=+2.5><b>2. Exploring Visualizations</b></font>

<a href="#toc" class="btn btn-primary btn-sm" role="button" aria-pressed="true" style="color:white" data-toggle="popover">Table of Contents</a>

In this chapter, you’ll get to know more about Divvy users. Learning about them provides valuable information toward successful and continued engagement. To do this, you’ll build bar charts, KPI charts, and histograms with variable bin width. You’ll create additional insights by adding filters to your visualizations.

## <a id="3"></a>
<font color="lightseagreen" size=+2.5><b>3. Mapping Analysis</b></font>

<a href="#toc" class="btn btn-primary btn-sm" role="button" aria-pressed="true" style="color:white" data-toggle="popover">Table of Contents</a>

Dive deeper into maps and elevate geographic data with layers and density. You'll customize Tooltips and learn how to embed them with more information. You'll also learn more about Quick Table Calculations and apply them to your visualizations.

## <a id="4"></a>
<font color="lightseagreen" size=+2.5><b>4. Groups, Sets, and Parameters</b></font>

<a href="#toc" class="btn btn-primary btn-sm" role="button" aria-pressed="true" style="color:white" data-toggle="popover">Table of Contents</a>

Expand your Tableau toolbox with groups, sets, and parameters. You'll create groups using Lasso selection and Calculated Fields. You'll also use Parameters to enable users to dynamically input changes to your visualizations. Finally, you'll create Sets and compare your findings to an external weather data source.