# Introducing Tableau Prep Builder
* Have you ever been deep in analysis and then realized that your data is "dirty" with inconsistencies and needs to be cleaned before you can continue?
* Or are you trying to publish data for other and want to reshape it first so it will be easier for others to analyze?

Tableau Prep Builder is a visual data preparation tool you can use to clean and prepare data for these situations and more. This module
describes Tableau Prep Builder, explains why you'd use it and how to best use ist with other Tableau products.

**Objectives**
At the end of this module, you will be able to:
* Define and describe the purpose of data preparation
* Describe how data perparation fits into the analytic process
* Identify the major benefits of Tableau Prep Builder
* Describe how Tableau Prep Builder is integrated with other Tableau products
* Summerize when to use Tableau Prep Builder as opposed to Tableau Desktop
* Identify some common roles/personas who would use Tableau Prep Builder
* Describe examples of how Tableau Prep Builder is used by people in these roles
* Describe typical scenarios where you could use Tableau Prep Builder


## Iterative and non-linear
The analytics cycle is not a linear progression from one stage to the next. It may occasionally work that way, but in
general the process is iterative. You can jump back and forth among the stages in the cycle. For example, your
exploration of the initial question may lead to follow-up questions, and your exploration of the data may lead you to other liens of questioning
within the same data set

# The Data Preparation Process
Get an example of when to use three main process steps

## Connecting to Data

## Cleaning the Data
Examine and edit data values and save the flow

## Outputting the Cleaned Data
Add an output step to a flow and use it to create an output file

## Using the Output File as a Data Source
Connect to the output file and build a viz with the cleaned data

# Connecting to and Configuring Data
To use Tableau Prep Builder to clean and prepare your data, you first create a flow by connecting to your data. Tableau Prep Builder
supports connections to numerous popular data types such as text and Microsoft Excel files, Microsoft SQL Server and Google BigQuery based
databases, Tableau data exctracts and Tableau published data sources. In this module you'll learn how to connect to your data and then configure
the connection to maximize flow performance

## Tableau published data sources
Starting with version 2019.3 Tableau Prep Builder can connect to data sources published to Tableau Server and Tableau Online
The Data Management add-on a separate license must be enabled on Tableau Server and Tableau Online

You connect to Tableau published data sources using Tableau Server connector, enabling you to use them as input data
sources for your flows. Using data sources on your oragnization's Tableau Server and Tableau Online helps to ensure you're
using data sources tha tare secure, standardized and centrally managed by your organization, such as by data sterwards or
database administrators

## Run an Initial SQL command when connecting to a data source
You can use Initial SQL to query your connections using Tableau Prerp Builder version 2019.2.2 and later. This enables you to
specify an Initial SQL command to run when connecting to databases that support it.

For example, you can enter a SQL statement to apply a filter when connecting to the database. Tableau Prep Builder applies the SQL
command before it samples the data, reducing the size of the data set loaded into Tableau Prep Builder, therby increasing
performance.

You can also include parameters to pass data such as application name, version, and flow name. This helps you keep track of
the data when you query your data source using Tableau Prep Builder version 2020.1.3 and greater

## Use the Data Interpreter on Excel files
You can use the Data Interpreter in Tableau Prep Builder on Microsoft Excel files that contain specific types of formatting
such as merged cells, empty columns or rows and sub-tables, that are problematic for clean data input.

Data Interpreter detects these types of formatting and handles them correctly, which helps you start your flows with cleaner data in the
input steps, making it easier to complete them. Following is an Excel file with some of this formatting

* Field Names
* Merged Cells

## Explore data input options
When you select an Input step in the Flow pane, such as **Navy** in the following image, you can configure what data from the step is
brought into the flow using the metadata profile and options in the **Input** pane.

For example, you may not need all of the fields from a table in the output of your flow. Working with a smaller amount of data
will make it easier to work with and improve the peroformance of a flow. You also may want to change some of the field names
so they better describe the values in the fields.


# Using the Profile Pane to Examine Data
Explore the data structure and study field values

## Diving deeper: Distributions and Highlighting
Study distributions and highlight values to uncover data issues

## Diving Deeper: Searching and Sorting
Use search and sort options to investigate anomalies and uncover patterns


# Cleaning Data with One-Click Operations
Let's say you've examined your data in Prep Builder, using techniques such as studying data distributions to find data
issues. Now that you've found issues, you're ready to start cleaning data.

You may wonder that options Tableau Prep Builder offers to clean specific data issues. For example:
* Do you have a field that is not relevant to your analysis that you would like to remove?
* Would you like to rename a field so it makes sense within the context of your department?
* Do you have field values that can be split up to improve analysis?
* Do filed values have data inconsistencies such as extra spaces or punctuation, that you need to clean?
* How do you keep track of all the data cleaning changes that you make?

Tableau Prep Builder provides many one-click operations, including fixes for common field value inconsistencies that help
you address issues such as these. It records all cleaning operations so you can keep track of changes you're making to the data.

By cleaning up data issues, you make the data easier to work with and analyze. This also benefits others with whom you share data sets.

## Applying cleaning operations to fields
When examining your data using techniques such as exploring data distribution and data composition, you have likely uncovered cleaning
issues that will need to be addressed. You can apply cleaning operations to fields and their values to analyze the data.

In this lesson we will take a look at the following cleaning operations:
* Splitting field valeus: automatic and custom
* Removing fields
* Duplicating fields
* Renaming fields
* Keep Only fields
* Changing data types

## Performing quick cleaning string operations
Values stored in string fields often contain data inconsistenices that can affect your visualizations. Extra spaces, capitalization
and punctuation are a few examples that can cause duplication in your data. Tableau Prep Builder provides one-click string cleaning
operations, so you can clean theses issues up quickly

### Clean mane operations
## Review recorded changes
Every cleaning operation that you apply in Tableau Prep Builder is recorded in the Changes pane, to the left of the profile pane.




# Validating Your Data
What does it mean to validate your data? Imaginee that you have a data set and you want to know if it will answer your
analytic questions. Data validation will tell you.

Or, say that you have performed data preparation operations on that data, and you want to verify the success of those changes as well
as to investigate the results. Tha is also data validation.

Next you may wonder how to validate your data. For example:
* How do you incorporate data validation into an interactive analytic process?
* How do you use Tableau Prep Builder to verify operations applied during your data preparation?
* How can you compare your data in Tableau Desktop before and after applying changes?
* Now that you've applied chnages to your data, how can you see if it's possible to create meaningful visualizations with it in
Tableau Desktop?

This module will demonstrate how to use Tableau Prep Builder to validate that your data meets your needs, as well as to
validate that the results are as you intended

## Use the data grid to support data validation
Validating your data is a necessary part of the iterative analytic process because it allows you to see
if you have the needed fields to complete your analysis. It also allows you to verify that your data preparation operations
have been successfully applied.

You'll use the data grid to perform theses validation tasks as we take a look at some flight data to which some cleaning operations
have already been applied.

## Preview in Desktop to support data validation
Another way to validate your data is to preview it in Tableau Desktop. You can do this directly within Tableau Prep Builder.
When you select a point in your flow to preview in Tableau Desktop, the data preview will reflect all operations you applied
to the data up to and including that point your data preparation.

## Try it! Preview your data
Now is your opportunity to apply what you've learned about how to validate your data by previewing it in Tableau Desktop.
The guided activity below will prompt you to take specific actions and will provide feedback and guidance along the way.
