### What is a visual argument?

All design choices impact how a viewer will understand a data visualization. Even the simplest visualizations have an argument, a thesis, or a central point — and the design choices we make (or ignore) can have a positive or negative effect on getting that point across.

For the goal of creating more readable and understandable visualizations, there are some simple, effective tools at our disposal in matplotlib. Here are 6 strategies we’ll learn for making a strong, clear visual argument:

-   choose the right chart
-   use subplots to compare multiple graphs
-   remove distracting lines (i.e., chartjunk)
-   use color for emphasis
-   add annotations to the graph
-   present the graph with context

In this lesson, we’ll work with a dataset that catalogs trees around the Tapajós River, a tributary of the Amazon River that runs through the Amazon Rainforest. Some preliminary data manipulation has been done for you to aggregate and organize the data for our purposes. (This is a crucial step in most data visualization processes, and a great reason to become familiar with the  `pandas`  library! You can check out the other notebook in this folder if you want to see how we organized the data using  `pandas`.) Use the Jupyter notebook to the right to explore the data, and then we’ll dive into making some visualizations in the next exercise!


### Choose the right chart for the data

Charts and data visualizations help us to make sense of the world around us. They are most helpful when we can connect what we see on the page or screen to our everyday understanding of the world.

We can make that connection easier by picking a chart that, whenever possible, leans on the audience’s real-world interaction with the data. For example, consider whether your data is better suited to a horizontal or vertical presentation. Simply changing the orientation of a chart can have a big impact on its readability – check out this example with two boxplots:

A boxplot of finishing times for a half-marathon makes most sense as a horizontal box plot, since linear time is often visualized horizontally from left to right (and distance is a “horizontal” measure).

A box plot of women’s heights, on the other hand, makes sense visualized as a vertical box plot, since we measure standing height as a vertical measure.

No matter the type of chart, this simple choice can help to start the viewer off on the right foot.

In the Jupyter notebook, we’re going to visualize the average tree heights of the five most common plant families in our dataset. We’ll visualize both the  `crown_base`  height (the trunk) and the  `depth_crown`  (the leafy part). What do you think will be the clearest way to communicate that visually?

In [3]:
from matplotlib import pyplot as plt
import pandas as pd

In [4]:
# display charts correctly
%matplotlib inline
plt.rcParams['figure.figsize'] = (5, 3)
plt.rcParams['figure.dpi'] = 75