# Carkeek Park Salmon Survey Visualization

To start, create a personal copy of this notebook by clicking File -> Save a Copy in Drive! Then, read the text below and follow the instructions to create a visualization! If you missed the first day in class, make sure to check in with a labmate

There are several species of salmon and most are anadromous, which means they are born in rivers and creeks, go out to sea to live, and return to breed (often in the same river or creek of their birth). Most salmon spend up to a couple years at sea before returning in Autumn in a seasonal migration and spawning event called a salmon run. Around Seattle, there are many fewer locations that support the reproductive cycle of salmon than existed in recent centuries. However, Northwest Seattle's Carkeek Park has been maintained by the Carkeek Watershed Community Action Project ([CWCAP](https://www.carkeekwatershed.org/mission/)) as an educational site with creeks that support (partly) a live salmon run. After decades of work to restore the wetlands around the park, salmon now return in Autumn to spawn. As it turns out, the conditions at the park do not allow many of the eggs laid by the salmon to survive long enough to become fry (the name for baby salmon). However, the population is supported by the additional release of fry that are born and temporarily raised at the Grover's Creek Salmon Hatchery, which is ran by the Suquamish Tribe in Poulsbo.

In Autumn, you can visit the creeks in the park and see salmon swim upstream to spawn as well as the carcasses of salmon -- which die after spawning. You can check out [this video by the Seattle Channel](https://www.youtube.com/watch?v=S5JAcvXAr44) to get a sense of what this looks like.

Every year for several decades, volunteers with CWCAP conduct a survey to try to count all the living and dead salmon that come in the salmon run, as well as the redds (nests) that salmon build there. In 2024, a record number of salmon were observed at more than 3000, where the previous record in recent decades was less than 1000. The volunteers collect other information about the salmon and carcasses tht they observe and make it available publicly. While the 2024 record year data is not yet released, we can check out the 2023 data! Let's check out that data to answer a question:

**How many male and female chum salmon that were found dead in Carkeek Park were determined to have successfully spawned before perishing in 2023?**

## Objectives

*  Create a data visualization that lets you answer this question

## Setup Extra Tools

Run the code below. It sets up some extra tools and specifications in R. No need to change anything!

In [None]:
library(tidyverse) # This gets some extra graphing tools
options(scipen = 100) # This basically sets how many decimal points you can see in numbers
options(ggplot2.continuous.colour="viridis") # This chooses some color settings
options(ggplot2.continuous.fill = "viridis") # This chooses some color settings
options(warn=-1) # This stops some warning messages popping up

Run the code below to load the 2023 salmon data (it's saved in the "CWCAP_Salmon_2023.csv" file) and save it with the name carkeek_2023.

In [None]:
carkeek_2023 <- read_csv2("CWCAP_Salmon_2023.csv")

We can check out the variables for this survey data:

In [None]:
glimpse(carkeek_2023)

A lot of these variable names are probably sensible. Some others are weird looking. Some weird variables are used for technical reasons. For instance, uuid stands for Universally Unique Identifier, and is a code to identify this specific data record. Other variables may not make sense if you don't know about the salmon lifecycle or salmon fisheries management.

*   A redd is a "nest" that salmon make on the bed of a river or creek to deposit eggs in
*   The adipose fin is a small fin found on salmon (and other salmonid species). This fin is sometimes found removed from salmon. This is done purposely by human fisheries workers. The fin is removed from fish that are raised by humans in hatcheries before being released as part of programs to manage and sustain salmon population levels. The removal does not seem to impair the salmons movement to a great degree and is used to identify whether a salmon was raised in a hatchery or spawned "naturally".



## Example Visualization

The code below will create a visualization to look at observations of dead salmon. You don't need to change this code but run it to have a look at what is shown. Then, inspect each line of code and try to figure out what it is doing. You can delete or modify lines one by one and try to rerun the code to figure out what's going on in each. What do you think filter(Type == "Dead") does? Try swapping out "Dead" for something else. You can always undo your deletions with CTRL+Z (you need to click the code box first). Once you feel like you have an idea of what's going on, you can progress to make your own graph in the next section!

In [None]:
options(repr.plot.width=10, repr.plot.height=8) # This sets up the size of graphs

carkeek_2023 |>
  filter(Type == "Dead") |>
  ggplot(aes(x = Species, fill = Sex)) +
  geom_bar(alpha = 0.75) +
  theme_minimal() +
  theme(text = element_text(size = 24)) +
  labs(title = "Carkeek Park Salmon Corpses 2023",
       subtitle = "Mostly Chum Salmon in 2023",
        caption = "Data from Carkeek Watershed Community Action Project",
        x = "Salmon Species")

## Create Your Graph!

We want to answer: **How many male and female chum salmon that were found dead in Carkeek Park were determined to have successfully spawned before perishing in 2023?**

Create a new visualization that lets you see how many male and female **chum** salmon carcasses were found in Carkeek park in 2023 *and* the proportion of those chum salmon that surveyors were able to determine had spawned (mated) or not.

Some notes:

*   You can copy and adapt code from above
*   You will need to add a *new line* to the code. This new line of code will make sure that you are only looking at **chum** salmon and not other species of salmon. In other words, we want to filter through our data to make sure that we only look at chum salmon.
*   You will need to swap out variable names
*   You should change the title and labels of this graph.



## Upload your graph!

Once you're done, take a screenshot of your graph or save it by right-clicking and selecting save-as. In Lab Check-In one, submit your graph!