# Homework 01: Surveys, Syllabus, and Reading

### <p style="text-align: right;"> &#9989; Joshua Belieu


Welcome to your first homework assignment! This assignment will help you set up the necessary tools and familiarize yourself with course materials essential for your success in this Data Science course.

## Objectives
By completing this homework, you will:

* Familiarize yourself with course expectations and policies
* Set up essential tools for the course
* Assess your current Python knowledge
* Begin engaging with course content

## Part 1. Read Course Materials (10 points)

There are three important documents for this course. In this problem you will find them, read them and summarize them.

🗒️ **Task:** Read the course syllabus (available in Teams and D2L as a PDF). In a new markdown cell, summarize the portions of the syllabus regarding attendance and late assignments.

✏️ **Answer**

Attendance is required for both weekly sessions and is graded based on presence and participation. Late assignmets are summarized :

1] ICAs - no late submissions

2] Homeworks - 10% for every 24 hours passed the submission deadline up to 48 hours. Then, there are no late submissions.

🗒️ **Task:** Read the AI policy (PDF). In a new markdown cell, summarize what you need to do to do when you use AI for an assignment.



✏️ **Answer**

Responsibilties when using an AI tool :

1] Document and report it's use.

2] Understand the code it generates and be prepared to defend or augment the code.

3] Inform collaborators of it's use.

4] Explicitly mark passages of code generated by the tool.

5] Do not let AI replace learning.

🗒️ **Task:** Read the AI guide to best practices (PDF). Also in a new markdown cell, give us some feedback on your experiences using AI in the past. How well did it work? What strategies have you developed for yourself, if any? 

✏️ **Answer**

A few versions back I tried to have ChatGPT output some code to solve the Schrodinger equation in order to get eigenfunctions and eigenenergies of a system with a Coulombic and screened potential. The best that ChatGPT could do was to place the system in a box and treat it as some strange finite square well problem (without using perturbation theory). I do not remember the results but the method was not to my liking so I went back to the Stack Exchange. My strategy/ typical workflow is as follows:

1] Attempt to write my own code and get a solution.

   1a) If successful, proceed to examine results and refine methods.

   1b) If unsuccessful, give the problem as a prompt to ChatGPT and provide code if useable.

2] Inspect solution and consider results.

   2a) If ChatGPT was used, I will go back and read every line to try and understand what it is doing. I will then add comments of my understanding and confer with ChatGPT to confirm the understanding or expose any faults in my understanding or the code.
   
3] Refine methods, if needed.

## Part 2. GitHub Setup (10 points)

You will use GitHub for your two projects. In addition to learning an important data science tool, this will allow you to retain all of your work beyond this course and share with people around the world. 

🗒️ **Task:**
* Create a GitHub account (if you don't already have one)
* If you are not already familiar with GH:
  * [this](https://www.youtube.com/watch?v=tRZGeaHPoaw) is a great video - watch it slowly over the next week or so
  * and, you can complete the GitHub Hello World guide: https://guides.github.com/activities/hello-world/
* Create a new repository for this course and call it `cmse830_fds`

In the markdown cell below add your github username and repo name. 

✏️ **Answer**

username : jabelieu

repo name : cmse830_fds

## Part 3. Initial and Python Surveys (10 points)

Complete the [initial survey](https://forms.office.com/r/GNHk4SgyNg) _if you haven't already_.

Complete the Python coding knowledge survey: [click here](https://forms.office.com/r/c9exRxM4aV)

Put a statement in a markdown cell that you have completed both of these.


✏️ **Answer**

I have done both of these!


## Part 4. Textbook Reading (10 points)

🗒️ **Task:** Read Chapter 1 of "_Thinking Clearly with Data_"
* In a markdown cell, give a very high-level overview of the book
   * what are its goals and how are those goals organized? 
   * what can you hope to get out of this book?
* Write a brief summary (200-300 words) of the key points from Chapter 1

✏️ **Answer**

The goal of this book is to inform its readers about how to be mindful and thoughtful about conclusions drawn from data. The book is ordered into 4 parts which can be understood as defining what is causation and correlation, what is required to establish a correlation, what is required to establish causality, and how to take the data and analysis into account when considering next steps. I hope to gain a better foundation and intuition when it comes to consuming analysis and its conclusions. I also hope to be able to perform my own analysis on a dataset adhering to the tools and skills promised in this book.

Chapter 1 is composed of three sections; an introduction, a set of cautionary tales, and the relationship between thinking and data. The introduction establishes the notion that the modern day is inundated with data and with the vast amounts that are generated each data it is even more critical to be able to engage with data to extract conclusions. This section also provides a tone for the rest of the book emphasizing conceptual understanding as opposed to mathematical rigor. The cautionary tales section takes us through a medical diagnosis, governmental resistance schemes, and “broken window” policing. Each story has a moral linked to topics in the text and are summarized:

* Be aware of all statistics, what they mean explicitly, and how they inform conclusions. This was seen from the diagnosis story when it came to the validity of test conclusions when considering only some of the relevant statistics.

* Data classification is essential in analyzing and drawing conclusions from the data. This seemed to be an element in the protesting story. There may have been misinformed results based on how one classifies the protests.

* Establishing causal relationships requires supporting analysis to set itself apart from correlational relationships. This was seen in the protest and policing stories. Both stories saw theses statements that claimed a causal relationship but with no analysis to support that claim, until after the fact.

* One must keep in their mind the “fluid” nature of data about the mean when drawing conclusions from their analysis. This was seen in the policing story when considering the change in reported crime activity from sequential years. The activity changed both years but they seemingly pivoted about a mean.


---
## Part 5. Streamlit Setup and Test (10 points)

🗒️ **Task:** We will develop web apps using [Streamlit](https://streamlit.io) throughout the semester and your projects will use this tool. 
 
Install Streamlit on your computer, follow the installation guide: https://docs.streamlit.io/library/get-started/installation

Note that Streamlit is run from the command line. [Read this.](https://docs.streamlit.io/get-started/fundamentals/main-concepts) Bookmark this documentation for later in the semester. 

Let us know if you have any issues with this.

Next, create .py files with the following two codes and ensure that these run on your computer. It is not necessary that you understand what they do; the goal here is:

```python
# Streamlit Code 1

import streamlit as st

st.write("Hello World!")
```

```python
# Streamlit Code 2

import streamlit as st
import pandas as pd
import numpy as np

chart_data = pd.DataFrame(
   np.random.randn(20, 3),
   columns=['a', 'b', 'c'])

st.line_chart(chart_data)
```

1. to get Streamlit operational on your computer, and let us know where the problems are,
2. and, see what Streamlit does and how it functions. 

Include screen shots to show that you got this to work. 


✏️ **Answer:** Put your screenshots here

![Alt text for image](fds_hw1_st1.png "Optional Title")

![Alt text for image](fds_hw1_st.png "Optional Title")

---
## Congratulations, you're done!

Submit this assignment by uploading your notebook to the course Desire2Learn web page.  Go to the "Homework" folder, find the appropriate submission link, and upload everything there. Make sure your name is on it!

&#169; Copyright 2023, Department of Computational Mathematics, Science and Engineering at Michigan State University.