# Introduction

I've created a simple example to show how to display a visualization publicly using GitHub and [Binder](https://notebooks.gesis.org/binder/).

# Import Libraries

In [1]:
import pandas as pd
import plotly.express as px

# Import Data

SOURCE DATA: Walsh, M. and Keyes, O. (2024) [U.S National Park Visit Data (1979-2023)](https://www.responsible-datasets-in-context.com/posts/np-data/?tab=data-essay)

In [2]:
# public file specification
file_url = "https://raw.githubusercontent.com/melaniewalsh/responsible-datasets-in-context/main/datasets/national-parks/US-National-Parks_RecreationVisits_1979-2023.csv"

# load dataframe
df = (pd.read_csv(file_url,
                 dtype={'ParkName' : str,
                        'Region' : str,
                        'State' : str,
                        'Year' : int,
                        'RecreationVisits' : int})
     .query("Year == 2019")
     .sort_values(by=['RecreationVisits'], ascending=False)
     .head(25)
     )

# Inspect Data

In [3]:
df.head()

Unnamed: 0,ParkName,Region,State,Year,RecreationVisits
1201,Great Smoky Mountains NP,Southeast,TN,2019,12547743
1021,Grand Canyon NP,Intermountain,AZ,2019,5974411
2241,Rocky Mountain NP,Intermountain,CO,2019,4670053
2778,Zion NP,Intermountain,UT,2019,4488268
2733,Yosemite NP,Pacific West,CA,2019,4422861


# Top 25 Parks by Total Recreational Visits in 2019

In [4]:
# Bar chart Race
fig = px.bar(
    df,
    x="RecreationVisits",
    y="ParkName",
    color="ParkName",
    orientation="h",
    title="National Park Visits in 2019",
    labels={"RecreationVisits": "Visits", "ParkName": "Park"},
    height=800,
)

# hide axis title
fig.update_layout(yaxis={'title': ''})

fig.show()