# [Main Concept Question]? An Introduction to [Topic]

<div class="alert alert-info">

Welcome back! This activity is part of an introduction to computational notebooks, designed specifically for K-12 educators.

In this notebook, we'll dive into a specific, powerful type of analysis: **[Describe the notebook's main goal]**.

</div>

[Add 1-2 introductory sentences about the topic. e.g., "Data that unfolds over time is everywhere..."]

## Key Ideas in [Topic]
<br/>

To explore the key ideas related to the code, math, and science of [Topic], in this notebook we will be exploring **[Content Area, e.g., "air quality data" or "housing prices"]**.

**Our Learning Goals:**

* **[Code/Data Skill, e.g., Handling Dates or Managing Lists]:** [Description of the skill, e.g., "Learn to create and use `datetime` objects..."]
* **[Conceptual Skill, e.g., Visualizing Trends or Exploring Center]:** [Description of the skill, e.g., "Use `lineplot` to reveal patterns..." or "Understand how `mean` and `median`..."]
* **[Content Connection, e.g., Content]:** [Description of the content, e.g., "Explore how air quality is measured..." or "Apply these statistical concepts to explore classroom test scores..."]

<hr style="border: 1px solid #fdb515;" />

We'll start with a quick intro to [Topic] by looking at a simple example: **[Describe the simple scenario, e.g., "a set of student test scores" or "a week of hourly AQI data"]**.

### Part I: Loading the Data

We are going to load a simple dataset. [Describe the data source, e.g., "This is a simple list of hypothetical test scores..." or "This data comes from..."].

In [None]:
# [e.g., Import pandas]
import pandas as pd
import numpy as np

# [Load the simple data, either as a list or from a simple CSV]
# Example 1: List
simple_data = [85, 92, 78, 88, 95, 81, 75, 89, 90, 85]
print(simple_data)

# Example 2: Simple CSV
# simple_data = pd.read_csv('[URL_to_simple_data.csv]')
# simple_data

<div class="alert alert-success">

**Considering the data above…**

* What do you notice and wonder about this dataset?
* What questions could you explore further?

</div>

### Part II: [Core Skill, e.g., "Describing the data" or "Working with `datetime`"]

[Explain the skill you are about to introduce. e.g., "You may have noticed that the month, day, year, and hour... To fix this, we’ll combine them..."]

In [None]:
# [Add code that demonstrates the core skill]
# Example 1: Calculating statistics
# mean_score = np.mean(simple_data)
# median_score = np.median(simple_data)
# print("Mean:", mean_score)
# print("Median:", median_score)

# Example 2: Converting to datetime
# simple_data['datetime'] = pd.to_datetime(simple_data['date_column'])
# simple_data

<div class="alert alert-success">

**Considering the results above…**

* [Add a question about the output. e.g., "Would you say the mean or the median is more 'typical'...? Why?"]
* [Add another question. e.g., "How might this new `datetime` column help with analysis?"]

</div>

### Part III: Visualizing the Data

Now that we have [prepared our data/calculated statistics], we can make a plot. This shows the "shape" of the data.

[Optional: Add context for the visualization, like the EPA AQI chart image.]

In [None]:
import seaborn as sns
import matplotlib.pyplot as plt

# [Add code for a simple visualization]
# Example 1: Histogram
# sns.histplot(simple_data, bins=5)

# Example 2: Lineplot
# sns.lineplot(data=simple_data, x='datetime', y='AQI')

<div class="alert alert-success">

**Looking at the visualization above…**

* What do you notice and wonder?
* [Add a question about the plot. e.g., "What could have happened to create the pattern you see above?"]
* [Add another question. e.g., "What is easier, and harder, to understand... when you are using the plot versus when you are using the values?"]

</div>

### Part IV: Exploring [e.g., Different Distributions or Adding to the Plot]

[This section is for prompting the user to modify code or explore "what if" scenarios.]

**Scenario 1: Prompting code adaptation**
You can add extra lines to the graph to help you hone in on what's going on. Use the code below to get comfortable adding lines to the plot above.

[Provide a code snippet for the user to copy/adapt]
plt.axvline(x=pd.to_datetime("..."), color='red')
plt.axhline(y=20, color='red')

**Scenario 2: Prompting data changes**
Now, let's see how interactivity can help you explore these ideas. What would happen to the [mean, median, etc.] if [scenario, e.g., "a test was particularly difficult"]?

Go back to [Part I: Loading the Data](#set-scores) and change the list of scores to test your predictions.

**Summary:** You now know how to [skill 1, e.g., "work with `datetime`"], [skill 2, e.g., "plot line graphs"], and [skill 3, e.g., "explore trends over time"].

Now let’s apply these skills to a more interactive data investigation.

<hr style="border: 5px solid #003262;" />

# [Engaging Title for Investigation, e.g., "Keeping it Real (Estate)" or "Getting Clear on Air Quality"]

[Add 1-2 paragraphs of introduction and context for the real-world scenario. Add images if helpful.]

## Part I: Loading the data

First, let’s preview the dataset we’ll work with. [Describe the real-world dataset].

In [None]:
# [Load the real-world data, usually from a URL]
real_data = pd.read_csv('[URL_to_real_data.csv]')

# [Optional: Filter the data to a subset]
# my_data = real_data[real_data['city'] == 'South Lake Tahoe']
# my_data

real_data

## Part II: [e.g., Understanding the Pollutants or Describing the Data]

[This is a good place to provide more context (like the pollutant table) or to repeat the analysis from the simple example, but on the new data.]

In [None]:
# [Add analysis code for the real-world data]
# Example: Calculating stats
# mean_price = np.mean(my_data['price'])
# median_price = np.median(my_data['price'])
# print("Mean Price:", mean_price)
# print("Median Price:", median_price)

<div class="alert alert-success">

**Considering the values above…**

* [Add a question about the real-world data output. e.g., "Do these values seem reasonable, given what you know...?"]
* [Add another question. e.g., "Do you expect the mean or median home price to be higher? Why?"]

</div>

## Part III: Visualizing the Data

Now let's visualize this new data to look for patterns. [Add any specific instructions.]

In [None]:
# [Add code for visualizing the real-world data]
# [This might be a more complex plot, like a FacetGrid]

# Example 1: Histogram
# plt.hist(my_data['price'], bins=20, edgecolor='black')
# plt.axvline(mean_price, color='red', label='Mean Price')
# plt.axvline(median_price, color='yellow', label='Median Price')
# plt.legend()
# plt.show()

# Example 2: FacetGrid
# g = sns.FacetGrid(real_data, row="pollutant", aspect=4, sharey=False)
# g.map(sns.lineplot, "datetime", "value")

<div class="alert alert-success">

**Looking at the visualization above…**

* [Add a question about the plot. e.g., "Has your understanding of what a 'typical' home price... changed?"]
* [Add another question. e.g., "Which patterns among the graphs are worth taking seriously?"]

</div>

## In the Classroom

The skills you just practiced—[e.g., "calculating mean vs. median" or "analyzing time series data"]—are critical in many fields.

**Where Else Do We See [Concept, e.g., "Skewed Data" or "Patterns Over Time"]?**

* **[Subject 1, e.g., Economics]:** [Example, e.g., "Individual income..."]
* **[Subject 2, e.g., Biology]:** [Example, e.g., "Number of offspring..."]
* **[Subject 3, e.g., Social Media]:** [Example, e.g., "Number of 'likes'..."]

**Connecting to Standards**

### Common Core State Standards (CCSS) - Mathematics
* **[Standard, e.g., 6.SP.B.5c]:** [Description, e.g., "Giving quantitative measures of center (median and/or mean) and variability..."]

### Next Generation Science Standards (NGSS)
* **[Standard, e.g., Practice 4: Analyzing and Interpreting Data]:** [Description, e.g., "This notebook provides a direct application of this practice..."]
* **[Standard, e.g., Crosscutting Concept 3: Scale, Proportion, and Quantity]:** [Description, e.g., "Understanding measures of center and spread is fundamental..."]

# Credits

This notebook was developed as part of "Show Your Work" (SyW), a research and development project at UC Berkeley to introduce computational notebooks to K-12 educators.

[Acknowledge team members, specific contributions, and any LLM assistance.]

[Acknowledge data sources, e.g., "Data from [xxx BAAQMD and AQS]."]

[Acknowledge image sources, e.g., "Observing the Orange Sky... by Semiautonomous / Wikipedia / CC-BY-SA-4.0."]