---
title: Lesson 1. Getting Started
subject: Tutorial
format:
  html:
    toc: true
    toc-expand: 2
    toc-title: CONTENTS
bibliography:
  - references.bib
---

This lesson provides an introduction to the Tableau workspace and explores effective strategies for finding and critically assessing publicly available data. Readings also offer guidance on understanding audience needs prior to data visualization and outline key principles for creating more impactful visual representations.

## Data skills | concepts
- Tableau
- Finding data
- Reading data

## Learning objectives
1. Apply the __[DRAMA Framework](https://www.librarianyarns.com/drama-source-eval#:~:text=DRAMA%20stands%20for%20Date%2C%20Relevance,in%20a%20one%20shot%20session.)__ to critically evaluate a dataset.
2. Navigate the Tableau **start page**, **data source page**, and **workspace**.

This tutorial is designed to support a multi-session __[Tableau for Research](https://library.osu.edu/events?combine=&tid=All&field_location_code_value=10&sort_bef_combine=field_end_date_value_ASC)__ workshop hosted by The Ohio State University Libraries Research Commons. It is intended to help the **ABSOLUTE** beginner, or anyone who is relatively new to Tableau to build the skills and confidence to apply Tableau to research projects.


# LESSON 1

# Finding and/or obtaining data

::: {.p-4}
::: {.grid .g-4}
::: {.g-col-12 .g-col-md-6 .g-col-lg-4}
<div class="card d-flex flex-column h-100 text-center">
  <div class="card-body d-flex flex-column justify-content-center align-items-center"><img src="images/DataOhio-logo-hiRes.png" class="card-img-top p-3" alt="DataOhio logo"></div>
  <div class="card-footer text-muted"><a href="https://data.ohio.gov/">data.ohio.gov</a>
  </div>
</div>

:::

::: {.g-col-12 .g-col-md-6 .g-col-lg-4}
<div class="card d-flex flex-column h-100 text-center">
  <div class="card-body d-flex flex-column justify-content-center align-items-center"><img src="images/data_gov_logo.png" class="card-img-top p-3" alt="data.gov logo">
  </div>
  <div class="card-footer text-muted"><a href="https://data.gov/">data.gov</a>
  </div>
</div>

:::

::: {.g-col-12 .g-col-md-6 .g-col-lg-4}
<div class="card d-flex flex-column h-100 text-center">
  <div class="card-body d-flex flex-column justify-content-center align-items-center"><img src="images/Wikipedia-logo-v2-en.svg" class="card-img-top p-3" alt="Wikipedia logo"></div>
  <div class="card-footer text-muted"><a href="https://www.wikipedia.org/">Wikipedia</a>
  </div>
</div>

:::

:::
:::

Data is everywhere. Data for examples and activities in this tutorial was gathered from __[DataOhio](https://data.ohio.gov/)__, __[data.gov](https://data.gov/)__, and __[Wikipedia](https://www.wikipedia.org/)__. Open data, defined as any data that "can be freely used, re-used, and redistributed by anyone, subject only, at most, to the requirement to attribute and share alike," serves as a valuable resource for developing data literacy. [@open_data_handbook] While government agencies are primary producers of open data, it is also commonly shared by nonprofit organizations, independent researchers, and data enthusiasts. Enagaging with datasets that hold personal relevance can significantly enhance the learning experience, particularly when developing skills in data analysis and visualization. You are encouraged to download a dataset that is personally meaningful to you to use with the practices activities provided in this tutorial.

University Libraries offers several guides to help you __[Find Data](https://guides.osu.edu/data/home)__. You can also talk with a __[librarian](https://library.osu.edu/unit/research-commons/meet-the-team)__ about your unique research data needs. Collecting research data on your own requires an understanding of acceptable methods for rigorously and ethically collecting data for your discipline. You can find current methodology for your discipline by talking to experts in your field, __[searching the literature](https://search.library.osu.edu/discovery/dbsearch?vid=01OHIOLINK_OSU:OSU&lang=en)__, and finding ebooks, journals, and more the __[library catalog](https://library.ohio-state.edu/)__.

If you need data for a project, chances are your data may already be collected by a federal, state, or local governmental entity. Major publishers of U.S. government data include:

- __[Bureau of Economic Analysis](https://www.bea.gov/)__
- __[Bureau of Justice Statistics](https://bjs.ojp.gov/)__
- __[Bureau of Labor Statistics](https://www.bls.gov/)__
- __[Bureau of Transportation Statistics](https://www.bts.gov/)__
- __[Census Bureau](https://www.census.gov/)__
- __[Economic Research Service of the Department of Agriculture](https://www.ers.usda.gov/)__
- __[Energy Information Administration](https://www.eia.gov/)__
- __[National Center for Education Statistics](https://nces.ed.gov/)__
- __[National Center for Health Statistics](https://www.cdc.gov/nchs/index.html?CDC_AA_refVal=https%3A%2F%2Fwww.cdc.gov%2Fnchs%2Findex.htm)__
- __[National Center for Science and Engineering Statistics](https://ncses.nsf.gov/)__
- __[Office of Research, Evaluation and Statistics of the Social Security Administration](https://www.ssa.gov/policy/about/ORES.html)__
- __[Statistics of Income Division of the Internal Revenue Service](https://www.irs.gov/statistics/soi-tax-stats-statistics-of-income)__


# Reading data
Reading data effectively requires us to slow down and critically evaluate a dataset to understand its origins, who the dataset was created by and why, how the data was collected, and more. Data is not inherently neutral and context matters. Data analysts have a responsibility to recognize and disclose potential data biases that may distort visualizations and lead to misleading interpretations or narratives.  A thorough examination of the dataset‚Äôs structure‚Äîincluding its dimensions (categorical variables) and measures (quantitative variables)‚Äîand a clear understanding of how these elements are defined, is fundamental to responsible and accurate data analysis.

## The DRAMA framework
The **DRAMA Framework** is a helpful tool for critically evaluating data sources. [@primeau]

<div class="list-group">
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start active"><div class="d-flex w-100 justify-content-between"><h5 class="mb-1">DRAMA Framework</h5></div>
  </a>
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start"><div class="d-flex w-100 justify-content-between"><p class="mb-1"><strong>D</strong>ate</p></div><small>When was the data last updated? Is it current? Does it reflect current trends?</small>
  </a>
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start"><div class="d-flex w-100 justify-content-between"><p class="mb-1"><strong>R</strong>elevance</p></div><small>What procedures were used to collect the data? Is the data relevant to my research project? Did sampling procedures target the right audience or population? What was the context for collecting the data? Is there a description of the data set and what data it does and does not contain?</small>
  </a>
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start"><div class="d-flex w-100 justify-content-between"><p class="mb-1"><strong>A</strong>ccuracy</p></div><small>Is the data reliable? Valid? Were procedures for gathering the data followed consistently?</small>
  </a>
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start"><div class="d-flex w-100 justify-content-between"><p class="mb-1"><strong>M</strong>otivation</p></div><small>Why was the data collected? Are there any potential biases in the data? Was any relevant data not included in the dataset? If yes, was this disclosed?</small>
  </a>
  <a href="https://www.librarianyarns.com/drama-source-eval" class="list-group-item list-group-item-action flex-column align-items-start"><div class="d-flex w-100 justify-content-between"><p class="mb-1"><strong>A</strong>uthority</p></div><small>Who collected the data? an individual? a government agency? a business? or a political action committee? Are they credible?</small></p>
  </a>
</div>

# The Tableau Environment

<div class="alert alert-dismissible alert-info">
  <button type="button" class="btn-close" data-bs-dismiss="alert"></button>
  <h4 class="alert-heading"><img src="images/document_pencil_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Note:</h4>
  <p>This tutorial is designed to support multi-session <a href="https://library.osu.edu/events?combine=&tid=All&field_location_code_value=10&sort_bef_combine=field_end_date_value_ASC">workshops</a> hosted by The Ohio State University Libraries Research Commons.</p>
  <p>Since the Tableau for Research workshop takes place in the Research Commons computer lab, the examples provided here use the <strong>Tableau Desktop</strong> interface. Please note that this interface may look slightly different from the <strong>Tableau Desktop Public Edition</strong>.</p>
</div>


## The Start page

When you open Tableau Desktop, you'll land on the blue **Start Page**. Here's what you'll find:

- **Connect (Left Pane):** Connect to your data sources. Connections to flat files, such as .xlsx, .csv, and .json documents are listed on the top. Direct connections to tables hosted on servers are listed below.
- **Center Pane:** Open recently used workbooks.
- **Discover (Right Pane):** Learn more about Tableau.

![](images/tableau_desktop_start.png "Tableau Desktop Start Page")

With each new release, Tableau introduces new features and improvements. The **Discover** section is a valuable resource for staying informed and seeing how these features may support your ability to work with, analyze, and argue with data.



### Make Learning Meaningful!!!üåü 

Let's take a moment here to emphasize something important:

When learning data analysis and visualization, it's incredibly helpful to work with **data that matters to you**. 

- üéµ **Love music?**  Try using a Wikipedia table listing albums or songs by your favorite artist.[^1]
- üçΩÔ∏è **Foodie at heart?**  Explore recipes using the __[TheMealDB](https://www.themealdb.com/)__ API.
- üèÄ **Into sports?** Check out the curated __[Sports Data Sets](https://sportsandsociety.osu.edu/sports-data-sets)__ from The Ohio State University Sports and Society Initiative.

[^1]: Visit the Websites and APIs. __[Lesson 3. Wikipedia](https://osu-libraries-research-services.github.io/data_visualization/wikipedia)__ tutorial to learn how to extract tables from HTML using pandas.read_html.See the __[Websites and APIs. Lesson 4. iCite](https://osu-libraries-research-services.github.io/data_visualization/icite)__ tutorial and __[Websites and APIs. Lesson 7. Crossref](https://osu-libraries-research-services.github.io/data_visualization/crossref)__ tutorial to learn how to use APIs to gather data.

Working with familiar or interesting data makes the learning process more engaging‚Äîand more fun!

<div class="alert alert-dismissible alert-primary">
  <button type="button" class="btn-close" data-bs-dismiss="alert"></button>
  <h4 class="alert-heading"><img src="images/star_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Important!</h4><p><strong>Always review the copyright and terms of use</strong> before sourcing data from any website.</p><p>Limited use of copyrighted materials is allowed under certain conditions for journalism, scholarship, and teaching. <a href="https://library.osu.edu/copyright/fair-use">Use the Resources for determining fair use</a> to verify your project is within the scope of fair use. Contact University Libraries <a href="https://library.osu.edu/copyright">Copyright Services</a> if you have any questions.</p> 
</div>


## The Data Source page

To get started with the Tableau **Data Source** page, we'll use the **Performers** table from the Wikipedia page on __[Rock and Roll Hall of Fame inductees](https://en.wikipedia.org/wiki/List_of_Rock_and_Roll_Hall_of_Fame_inductees)__. The **Performers** category honors recording artists and bands who have had a significant and lasting impact on the development and legacy of rock and roll.

That said, you‚Äôre encouraged to use a dataset that‚Äôs personally meaningful to you! Feel free to substitute the example with your own data as you work through the practice activities in this tutorial.

To connect to `rock_n_roll_performers.csv` in your workshop materials. 

On Tableau's **Start Page**

1. Go to the **Connect** pane on the left.
2. Select **Text file**.
3. Navigate to and open the CSV file.

This opens the **Data Source Page**. 

<video src="videos/tableau_data_source.mp4" 
    autoplay 
    muted 
    loop 
    playsinline 
    style="max-width: 100%; border-radius: 8px; padding: 1rem;">
</video>



On the **Data Source** page you'll find:

The **Connections**</strong>** pane on the left. Tableau Desktop allows multiple connections, which can be joined or related using common fields. For this tutorial, we will keep things simple and use only the `rock_n_roll_performers.csv` file.

The **Canvas** in the top center. The canvas displays the `rock_n_roll_performers.csv` file in a rectangle. Right-click on this rectangle or use the ‚ñº caret to:

- [ ] Rename tables
- [ ] Join and relate tables
- [ ] Apply pre-filters to your data

The **Metadata Grid** in the bottom center. Here the data headers are displayed as rows. This feature is particularly helpful when you connect to a dataset with multiple tables and fields. The metadata grid allows you to:

- [ ] Understand your data structure
- [ ] Change data types
- [ ] Hide unnecessary fields not required for your analysis

The **Data Grid** on the bottom right. The data grid shows the first 1,000 rows of data in your data source. In the data grid you can:

- [ ] Pivot data
- [ ] Create groups and bins
- [ ] Build calculated fields

### Dimensions vs. measures - Part 1

#### What is a measure?

Think of a measure as a variable used for math. Measures represent our quantitative data or units of measure.

#### What is a dimension?

Dimensions represent the qualitative data used to segment the measures.¬†

#### Dimension or measure?

Dimensions sometimes function as measures. Age, for example, can be used to categorize data, or as a measure in a calculation. Measures sometimes function as dimensions. An identification number, for example, can represent a person, yet consist of all numbers. If the identification number is added sequentially to a database, it may also be used in a calculation.

![Dimensions and Measures](images/dimensions_measures_fruit.png "example showing apple, banana, pear, and kiwi as a dimension in the fruit column and the number of apples, bananas, pears, and kiki in the number of fruit column"){#fig-figure1}


### Change data type - Option 1
The metadata grid for the `rock_n_roll_performers.csv` dataset primarily shows dimensions. The **Index** field represents an identification number and can serve as a dimension or measure. The **Year** field is currently recognized as a whole number. To change the data type of the **Year** field:

1. Locate the **YEAR** field in the metadata grid.
2. Right-click on the **number icon** before the field name.
3. Select  **Date & Time** from the menu.

|                                                         |         **Type**         | **Field Name**                 | **Physical Table**         | **Remote Field Name**       |
|:---------------------------------------------------------|:--------------------------|:--------------------------------|:----------------------------|:-----------------------------|
| ![](images/numeric_icon.png "Decorative # icon")        | Number (whole)           | Index                          | rock_n_roll_performers.csv | index                       |
| ![](images/symbol_datetime.png "Decorative abc icon")   | Date & Time              | Year                           | rock_n_roll_performers.csv | year                        |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Image                          | rock_n_roll_performers.csv | image                       |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Name                           | rock_n_roll_performers.csv | name                        |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Inducted Members               | rock_n_roll_performers.csv | inducted_members            |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Prior Nominations              | rock_n_roll_performers.csv | prior_nominations           |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Induction Presenter            | rock_n_roll_performers.csv | induction_presenter         |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Artist                         | rock_n_roll_performers.csv | artist                      |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Image Url                      | rock_n_roll_performers.csv | image_url                   |
| ![](images/abc_icon.png "Decorative abc icon")          | String                   | Artist Url                     | rock_n_roll_performers.csv | artist_url                  |



### Wide vs. tall data

Sometimes to efficiently and effectively analyze and/or visualize data we must restructure our data from **wide** to **tall** format. In @fig-figure1 above, each individual is represented by a single row, with separate columns for name, age, and number of visits. To transform this data into **tall** format, each individual would have a row for each measure. This structure includes:

- A column for the **measure name** (e.g., "Age", "Visits")
- A column for the **measure values** (e.g., 34, 5)

![Tall data](images/tall_format.png "example showing data from figure 1 reconfigured in tall format"){#fig-figure2}

For many Tableau projects, converting data to a **tall** format can enhance analysis and visualization.

<div class="accordion" id="accordionExample">

  <div class="accordion-item"><h2 class="accordion-header" id="headingOne"><button class="accordion-button fs-3" type="button" data-bs-toggle="collapse" data-bs-target="#collapseOne" aria-expanded="true" aria-controls="collapseOne"><img src="images/guidepost_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Exercise 1. Transform data to a tall format</button></h2><div id="collapseOne" class="accordion-collapse collapse show fs-4" aria-labelledby="headingOne" data-bs-parent="#accordionExample"><div class="accordion-body fs-4"><p>Try transforming the <strong>rock_n_roll_performers.csv</strong> dataset from <strong>wide</strong> to <strong>tall</strong> format using Tableau's pivot feature.</p><ol><li>In the <strong>data grid</strong>, click to highlight the <strong>Year</strong> column.</li><li>Hold the <strong>Shift</strong> key, scroll to the right, and click to highlight the <strong>Artist Url</strong> column.</li><li>Click the ‚ñº caret on right side of the <strong>Artist Url</strong> column and select <strong>Pivot</strong>.</li><li>Tableau will transform the selected columns from a <strong>wide</strong> format (many columns) to <strong>tall</strong> format (fewer columns, more rows). ‚úÖ This is useful for reshaping data to make it easier to analyze or visualize in Tableau.</li><li>To undo the pivot, press <strong>Ctrl+Z</strong> (Windows) or <strong>Command+Z</strong> (Mac).</li></ol>
  </div></div>
  </div>

  <div class="accordion-item"><h2 class="accordion-header" id="headingTwo"><button class="accordion-button fs-3 collapsed" type="button" data-bs-toggle="collapse" data-bs-target="#collapseTwo" aria-expanded="false" aria-controls="collapseTwo"><img src="images/magnifying_glass_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Solution:</button></h2><div id="collapseTwo" class="accordion-collapse collapse" aria-labelledby="headingTwo" data-bs-parent="#accordionExample"> <div class="accordion-body"><img src="images/tableau_desktop_data_source_window_pivot.png" alt="data source window showing all fields except index pivoted." style="max-width: 100%; border-radius: 8px; padding: 1rem;"></div> 
  </div>
  </div>

</div>
¬†

## The Tableau Workspace

To access the Tableau workspace, select the **Sheet1** tab at the bottom of workbook.

<p class="visually-hidden">
    Video showing the location of elements listed below.
</p>

<video src="videos/tableau_workspace.mp4" 
    autoplay 
    muted 
    loop 
    playsinline 
    style="max-width: 100%; border-radius: 8px; padding: 1rem;">
</video>

- The **Data Pane** and the **Analytics Pane** are located in the  **Side Bar** on the left.
- The **Marks Card** is to the right of the **Side Bar**.
- The **Filters Shelf** is above the **Marks Card**.
- Dimensions and measures are placed on the columns and rows **Shelves**.
- The data visualization is designed in the **View**.

<div class="card border-primary mb-3 p-1" style="max-width: 100%;">
  <div class="card-header" style="font-size: 1.8rem;"><img src="images/idea_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Multiple ways to accomplish a task!!!</div>
  <div class="card-body"><img src="images/tableau_logo.svg" class="mx-auto d-block" alt="tableau logo" style="vertical-align: middle; margin-right: 0.5rem; padding: 1rem;"><p>The more you use Tableau, the more you'll notice there‚Äôs often more than one way to accomplish the same task!!!</p>
  </div>
</div>

### Dimensions vs. measures - Part 2


When you connect to a data source, Tableau automatically categorizes each field as either a *dimension* or a *measure*.  In the **Data Pane**, dimensions appear above the gray line, while measures are listed below it.  If Tableau classifies a dimension as a measure, you can easily correct it by dragging the field above the gray line into the dimension area.

<div class="accordion" id="accordionExample">

  <div class="accordion-item"><h2 class="accordion-header" id="headingOne"><button class="accordion-button fs-3" type="button" data-bs-toggle="collapse" data-bs-target="#collapseOne" aria-expanded="true" aria-controls="collapseOne"><img src="images/guidepost_standard_icon.png" alt="" aria-hidden="true" style="height: 3rem; vertical-align: middle; margin-right: 0.5rem;">Exercise 2. Change measure to dimension</button></h2><div id="collapseOne" class="accordion-collapse collapse show fs-4" aria-labelledby="headingOne" data-bs-parent="#accordionExample"><div class="accordion-body fs-4">Drag the <strong>Index</strong> field above the gray line to categorize <strong>Index</strong> as a dimension.
  </div></div>
  </div>
</div>

### Change data type - Option 2

To change the data type of a field in the Tableau Workspace:

1. Locate the field in the data pane.
2. Right-click on the data type icon before the field name.
3. Select the desired data type from the menu.

### Toggle between workspace, data source and start page
  
<video src="videos/tableau_toggle.mp4" 
    autoplay 
    muted 
    loop 
    playsinline 
    style="max-width: 100%; border-radius: 8px; padding: 1rem;">
</video>

|                                                                                |  **location**  |        **Toggles between**        |
| :---------------|:---------------|:--------------------------------------------------|
|![Data Source tab](images/tableau_data_source.png "Decorative")  |  bottom left  |  Tableau Workspace and Data Source       |     
| ![Small Tableau logo white](images/tableau_logo_small.png "Decorative") |  top left     | Data Source and  Start Page  
| ![Small Tableau logo](images/tableau_logo_small_white.png "Decorative")  | top left      | Start Page and Data Source               | 
|  `Sheet 1`                                                               | bottom left   | Data Source and Tableau Workspace        | 


# Supplemental readings

::: {.grid .g-4}
::: {.g-col-12 .g-col-md-6 .g-col-lg-4}
<div class="card bg-light mb-3" style="max-width: 20rem;">
  <div class="card-header">BETTER DATA VISUALIZATIONS</div>
  <div class="card-body"><img src="images/cover_schwabish_data_visualization.png" alt="better data visualizations book cover" class="d-block mx-auto"></img><h4 class="card-title"><a href="https://search.library.osu.edu/permalink/01OHIOLINK_OSU/rr4vai/alma991085487354008507">Better Data Visualizations: A Guide for Scholars, Researchers, and Wonks</a></h4>
  </div>
  <ul class="list-group list-group-flush">
    <li class="list-group-item">by Jonathan Schwabish</li>
    <li class="list-group-item">New York : Columbia University Press, 2021.</li>
  </ul>
</div>

:::

::: {.g-col-12 .g-col-md-6 .g-col-lg-4}
<div class="card bg-light mb-3" style="max-width: 20rem;">
  <div class="card-header">STORYTELLING WITH DATA</div>
  <div class="card-body"><img src="images/cover_knaflic.png" alt="storytelling with data book cover" class="d-block mx-auto" style="max-width: 100%; height: auto;"><h4 class="card-title"><a href="https://search.library.osu.edu/permalink/01OHIOLINK_OSU/1n38col/cdi_proquest_ebookcentral_EBC4187267">Storytelling with Data: A Data Visualization Guide for Business Professionals</a></h4>
  </div>
  <ul class="list-group list-group-flush">
    <li class="list-group-item">by Cole Nussbaumer Knaflic</li>
    <li class="list-group-item">Hoboken, New Jersey: Wiley, 2015.</li>
  </ul>
</div>

:::

:::