### Visualization in Python

Visualization in python is done through graphs.
The various types of graphs are given as below:
- line graph (used for understanding the trend)
- area graph (used for understanding the trend)
- bar graph (comparing the discrete values)
- pie graph (comparing percentage values)
- boxplot graph (for understanding the outliers and quartiles in statistical analysis)
- histogram (for understanding the distribution of data in statistical analysis)
- scatter graph (for understanding the relationship between two or more variables)
- heatmap graph (it is a type of 2d histogram)

In [None]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# The boxplot is a quick way of visually summarizing
# one or more groups of numerical data through their quartiles.
data = np.random.randint(1, 11, 10)
# adding an outlier in the data
data[9] = 20
# The dots represent an outlier, which means
# that the value is far from the rest of the data.
# The median in shown in the middle of the box by a line.
# The lowest and highest values are shown by the ends of the whiskers.
# The top and the bottom of the box are called the first and third quartiles.
plt.boxplot(data, notch=True, showmeans=True)
plt.text(1.1, 19.75, 'Outlier')
plt.show()

# Histogram is a graphical representation of the distribution of numerical data.
# It is an estimate of the probability distribution of a continuous variable.
# ticks are the values that appear on the x-axis and y-axis.
# bins are the number of intervals in a histogram.
gap = range(data.min(), data.max() + 1, 2)
plt.hist(data, bins = gap, edgecolor='black')
plt.xticks(gap)
plt.show()

# The scatter plot is a graph in which the values of two variables are plotted
# along two axes, the pattern of the resulting points revealing any correlation present.
# The relationship between two variables is called their correlation.
# The correlation coefficient is a measure
# of how closely the points in a scatter plot lie on a straight line.
x = np.random.randint(0, 101, 100)
y = np.random.randint(0, 101, 100)
plt.scatter(x, y, s=x*y**0.5, alpha=0.5, c=x*y, cmap='Blues')
plt.colorbar()
plt.show()

# The heatmap is a graphical representation of data in which data values
# are represented as colors. That is, it uses color in order to communicate a value.
# This makes it easy to visualize large amounts of data.
# The heatmap is also known as a colored matrix.
# Seaborn is a Python data visualization library based on matplotlib.
# It provides a high-level interface for drawing attractive and informative statistical graphics.
tips = sns.load_dataset('tips')
# We can have y, hue, col, row parameters.
sns.displot(x='tip', y='day', col='sex', row='time', data=tips)
plt.show()