Q1. Load the "titanic" dataset using the load_dataset function of seaborn. Use Plotly express to plot a
scatter plot for age and fare columns in the titanic dataset.

In [2]:
import seaborn as sns
import plotly.express as px

# Load the titanic dataset
titanic_data = sns.load_dataset("titanic")

# Create a scatter plot for age and fare columns
fig = px.scatter(titanic_data, x="age", y="fare", title="Age vs Fare Scatter Plot - Titanic Dataset",
                 labels={"age": "Age", "fare": "Fare ($)"})

# Show plot
fig.show()


Q2. Using the tips dataset in the Plotly library, plot a box plot using Plotly express.

In [3]:
import plotly.express as px

# Load the tips dataset available in Plotly
tips_data = px.data.tips()

# Create a box plot using Plotly Express
fig = px.box(tips_data, x="day", y="total_bill", title="Box Plot of Total Bill by Day",
             labels={"day": "Day of the Week", "total_bill": "Total Bill ($)"})

# Show the plot
fig.show()


Q3. Using the tips dataset in the Plotly library, Plot a histogram for x= "sex" and y="total_bill" column in
the tips dataset. Also, use the "smoker" column with the pattern_shape parameter and the "day"
column with the color parameter.

In [4]:
import plotly.express as px

# Load the tips dataset from Plotly
tips_data = px.data.tips()

# Plot a histogram with specified parameters
fig = px.histogram(
    tips_data,
    x="sex",
    y="total_bill",
    color="day",                 # Color by the 'day' column
    pattern_shape="smoker",       # Pattern by the 'smoker' column
    title="Histogram of Total Bill by Gender with Day and Smoker Pattern",
    labels={"sex": "Gender", "total_bill": "Total Bill ($)"}
)

# Show the plot
fig.show()


Q4. Using the iris dataset in the Plotly library, Plot a scatter matrix plot, using the "species" column for
the color parameter.
Note: Use "sepal_length", "sepal_width", "petal_length", "petal_width" columns only with the
dimensions parameter.

In [7]:
import plotly.express as px

# Load the iris dataset from Plotly
iris_data = px.data.iris()

# Create a scatter matrix plot with specified dimensions and color by species
fig = px.scatter_matrix(
    iris_data,
    dimensions=["sepal_length", "sepal_width", "petal_length", "petal_width"],  # Only the specified columns
    color="species",  # Color by species column
    title="Scatter Matrix Plot of Iris Dataset",
    labels={"species": "Species"}
)

# Show the plot
fig.show()


Q5. What is Distplot? Using Plotly express, plot a distplot.

A5. A distplot, short for "distribution plot," is a plot that shows the distribution of a dataset. It typically displays a histogram and a kernel density estimation (KDE) curve to represent the data's distribution. The histogram provides a visual representation of data frequencies, while the KDE curve smooths these frequencies to show a continuous probability density.

In [1]:
import plotly.express as px
import plotly.graph_objects as go
import numpy as np

# Generate some random data
data = np.random.normal(0, 1, 1000)

# Plot histogram using Plotly Express
fig = px.histogram(data, nbins=30, marginal="violin", opacity=0.7, title="Distplot Example with Plotly Express")

# Add KDE curve using Plotly Graph Objects
fig.add_trace(go.Scatter(
    x=np.linspace(-3, 3, 100),
    y=np.exp(-0.5*np.linspace(-3, 3, 100)**2) / np.sqrt(2 * np.pi),
    mode='lines',
    name='KDE Curve',
    line=dict(color='blue')
))

# Show plot
fig.show()
