**First things first** - please go to 'File' and select 'Save a copy in Drive' so that you have your own version of this activity set up and ready to use.
Remember to add the links to your own work once completed!

# Welcome to Course 101: Applying statistics and core data science techniques in business.

Throughout the course, you will learn to:

1. Systematically analyse complex organisational challenges to enhance problem-solving skills and propose strategic solutions.
2. Develop advanced critical and statistical thinking skills by identifying meaningful patterns in complex data.
3. Evaluate and select statistical techniques appropriate to a solutions design approach.
4. Synthesise and systematically implement techniques to enhance data representation and improve model performance.
5. Critically select, evaluate and implement effective unsupervised learning techniques to design innovative solutions.

This will be the index page for your portfolio. For each activity in the Course, you will need to save a copy of the provided template (available on the relevant Canvas page), complete it, and add the link to this index to direct your tutors, peers, and potential employers to your own work. We have added Activity titles as an example in Week 1. You will then link these to your own work.

You will also complete reflections relating to each demonstration and activity throughout the course in this section of your portfolio.

Here is a link to the [Orientation content](https://colab.research.google.com/drive/1bAE7vviDHiU1bQipsB5hFvF-IyCiPMdd?usp=sharing), if you need a recap on how to use Colab.

**Reflection**

Think about a data project you’ve worked on or are aware of that went down the wrong pathway. You may research a case study online if you do not have a specific example to draw on. Then answer the following questions:

What was the cause of the incorrect focus of the analysis? For example, were the participants using the wrong data or asking the wrong questions?
What would you do differently if you could perform the analysis again or if you had been involved?

[# Week 1: Developing commercial awareness through statistical thinking](https://www.notion.so/Study-Guide-Week-1-Foundations-CRISP-DM-Maturity-Big-Data-Stats-Basics-3cf72335ea184688937aaafd726b95d8?source=copy_link)

**Throughout the week, you will learn to:**
1. Systematically analyse a range of organisational scenarios to identify key strategic issues.
2. Develop advanced problem-solving skills by applying statistical thinking.
3. Design effective solution paths through systematic critical thinking and problem-solving techniques.
4. Critically compare and evaluate a range of solutions to design effective outcomes.


[**Activity 1.1.5** Using statistical thinking to analyse and define a business problem from a given scenario.](https://colab.research.google.com/drive/1pTH0aOluJLr3YRuW56MHMOFwKlxJMtMy?usp=sharing)

[**Activity 1.2.3** Developing a hypothesis and presenting a proposed solution](https://colab.research.google.com/drive/1PJCm4m1KXCf3S1ZBBJ9p0OiGq2YsXghl?usp=sharing)

[**Activity 1.2.5** Using feature engineering as a problem-solving tool](https://colab.research.google.com/drive/1N1-L5JYeC7Ne7rt6ZWpzxVJp1RDdUfrh?usp=sharing)


## Week 1 Reflection
Now that you've completed Week 1, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:


The most interesting part was using the 5 Whys and CRISP-DM framework to transform "messy" qualitative data—like angry customer comments about wait times—into a structured SMART research question. It was fascinating to see how raw human emotion in feedback can be decoded into measurable data points like Velocity (response times) and Veracity (billing accuracy) to reveal systemic business failures.


The biggest challenge was distinguishing between symptoms and root causes. It is easy to look at a complaint about a "rude staff member" and think the solution is more training. However, the data suggested the "rudeness" was a symptom of Support Scalability issues—staff were overwhelmed by a high volume of queries caused by systemic billing errors. Learning to look past the surface-level emotion to find the underlying data-driven problem required a significant shift in perspective.

The concept of Feature Engineering as a problem-solving tool is immediately applicable. In any business role, we are often overwhelmed with metrics, but this week taught me to prioritize features that mirror the Customer Journey. Being able to select 5–7 key variables that explain why a customer leaves, rather than just that they left, is a skill that turns a data analyst into a strategic partner who provides actionable wisdom rather than just numbers.

# Week 2: Applying advanced statistical techniques for data science

**Throughout the week, you will learn to:**
1. Develop proficiency in hypothesis testing using Python for data-driven decision-making.
2. Investigate data systematically to identify causal links, account for biases, and assess confounding variables.
3. Interpret statistical test results rigorously, identifying and addressing assumptions and limitations.
4. Critically analyse outputs to evaluate predictive models, accurately interpreting relevant metrics to ensure generalisability.


### 2.1 Hypothesis Testing
* [Demonstration 2.1.2: Testing hypotheses with t-tests, ANOVA, and chi-square](https://colab.research.google.com/drive/1ycWjElyZNje23jB86L3Kf6MKVsI8I2qq?usp=sharing)

---

### 2.2 Correlation
* [Demonstration 2.2.1: Measuring correlation](https://colab.research.google.com/drive/1AOXRSRWaBFndVpezc1b2rNbLDmOBihP8?usp=sharing)
* [Demonstration 2.2.2: Pitfalls and best practices](https://colab.research.google.com/drive/1aWvvk9gi92DdBOUHVEBwTxbg2s5oa_8F?usp=sharing)
* [Activity 2.2.3: Interpreting correlation](https://colab.research.google.com/drive/1exFgkWDb2ToylLp10WQp5K26eQ27JggV?usp=sharing)

---

### 2.3 Building Models
* [Activity 2.3.5: Building models and interpreting results](https://colab.research.google.com/drive/1DRbt6phCwXuOfKkA0ai_6a7YvtgjPu0c?usp=sharing)


## Week 2 Reflection
Now that you have completed Week 2, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:

# Week 3: Taking a critical approach to selecting statistical techniques

**Throughout the week, you will learn to:**
1. Analyse ethical issues in data science environments to advocate for appropriate solutions.
2. Analyse diverse data characteristics to accurately apply parametric and non-parametric methods
3. Critically compare and select appropriate statistical tests, ensuring robustness and accuracy in analysis

Insert you activity links here:

## Week 3 Reflection
Now that you have completed Week 3, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:

# Week 4: Engineering features and reducing dimensions

**Throughout the week, you will learn to:**
1. Systematically apply specialist methodological approaches to extract relevant features from diverse data sets.
2. Tailor feature engineering methods to address specific data challenges, optimising feature representation for accurate analysis of complex problems.
3. Synthesise and perform dimensionality reduction to effectively reduce data dimensions while retaining meaningful patterns.
4. Critically identify and define the strengths and limitations of applying principal component analysis and t-SNE.

Insert you activity links here:

## Week 4 Reflection
Now that you've completed Week 4, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:

# Week 5: Detecting anomalies with unsupervised learning

**Throughout the week, you will learn to:**
1. Synthesise the concept of anomaly detection and define its importance in data science and organisational applications.
2. Systematically select and apply statistical methods such as z-score and IQR to detect anomalies in complex data sets.
3. Critically evaluate the effectiveness of ML methods to detect anomalies in complex data sets.
4. Critically select and apply advanced anomaly detection principles, concepts, and approaches to create innovative solutions to a real-world organisational scenario.

Insert you activity and mini-project links here:

## Week 5 Reflection
Now that you have completed Week 5, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:

# Week 6: Performing clustering with unsupervised learning

**Throughout the week, you will learn to:**
1. Synthesise and apply K-means clustering, including logic, partitioning methods, and the optimal number of clusters.
2. Synthesise and apply the principles of Hierarchical Clustering, including agglomerative approaches, distance metrics, and cluster evaluation.
3. Critically select and apply advanced clustering principles, concepts, and approaches to create innovative solutions to a real-world organisational scenario.


Insert your activity and mini-project links here:

## Week 6 Reflection
Now that you have completed Week 6, it's time to reflect on your learning.

**Reflection prompts:**
1. What was the most interesting part of this week's learning? Why?
2. What did you find the most challenging? Why?
3. What, so far, will be the most applicable to your current or future career? Why?

> Type your reflection here:

# Course 101: Reflection
Now that you have completed Course 101, it's time to reflect on your learning.

**Reflection prompts:**
1. What are the key concepts and skills you have learned in this course? Why?
2. How do you foresee applying the concepts and skills you acquired in real-world situations or future endeavors? Why?
3. Can you identify any challenges you faced during this course and explain how you overcame them?
4. How has this course influenced your perspective or understanding of applying statistics and core data science techniques in business? Why?

> Type your reflection here:


