#### Module 5: Prescriptive Analytics  

>- Creating data for analytics through **designed experiments**  
>- Creating data for analytics through **active learning**  
>- Creating data for analytics through **reinforcement learning**

- **Creating Data for Analytics Through:**  
  - Designed Experiments  
  - Active Learning  
  - Reinforcement Learning
---

# **Module 5: Prescriptive Analytics**

## **1. Introduction to Prescriptive Analytics**

### **What is Prescriptive Analytics?**

Prescriptive analytics is the most advanced form of analytics, providing actionable recommendations by combining data insights, optimization techniques, and machine learning. It not only predicts future outcomes but also suggests the best course of action to achieve the best possible results.

It answers the question: "What should be done next?" to achieve the desired outcome.

### **Types of Analytics: A Quick Overview**

| **Type**     | **Question Answered**       | **Techniques Used**              | **Example**                            |
| ------------ | --------------------------- | -------------------------------- | -------------------------------------- |
| Descriptive  | What happened?              | Data Aggregation, Visualization  | Website traffic reports                |
| Diagnostic   | Why did it happen?          | Root Cause Analysis, Correlation | Analyzing reasons for sales drop       |
| Predictive   | What will happen?           | Machine Learning, Time Series    | Stock price forecasting                |
| Prescriptive | What should we do about it? | Optimization, Simulation, RL     | AI-driven treatment plans for patients |

üí° **Prescriptive analytics uses advanced techniques like decision optimization, reinforcement learning, and experimental design to recommend the best actions.**

---

## **2. Creating Data for Analytics**

For effective prescriptive analytics, properly generated data is essential. Three primary approaches to generating data for prescriptive analytics are:

1. **Designed Experiments:** Systematic control of variables to test outcomes.
2. **Active Learning:** Selectively collecting data based on model uncertainties.
3. **Reinforcement Learning:** Learning optimal actions through trial and error.

---



---

## **2.1. Designed Experiments**

### **What Are Designed Experiments?**

A **designed experiment** is a systematic way of testing how one or more variables (called independent variables) affect an outcome (called the dependent variable). It is used to uncover **cause-and-effect relationships** in various fields.

### **Key Features:**

‚úî **Controlled Environment:** Minimizes the impact of external factors.  
‚úî **Randomization:** Ensures unbiased assignment of participants or units.  
‚úî **Replication:** Repeats the experiment for accuracy.  
‚úî Helps find optimal solutions in complex environments.    

### **Real-Life Example:**

üìå **Clinical Drug Trials:**

* A pharmaceutical company tests a new diabetes drug.
* They divide participants into two groups:

  * **Group A (Treatment Group):** Gets the new drug.
  * **Group B (Control Group):** Gets a placebo (fake pill).
* After a period, they compare health results between the two groups to see if the drug works.

### **Steps in Designed Experiments:**

1. **Define the Problem:** Clearly state what you want to test.
2. **Identify Variables:**

   * **Independent Variables:** The factors you manipulate (e.g., drug dosage).
   * **Dependent Variable:** The outcome you measure (e.g., blood sugar level).
3. **Establish Groups:** Create control and treatment groups (randomized).
4. **Apply Interventions:** Conduct the experiment and collect data.
5. **Analyze Results:** Use statistical methods (e.g., ANOVA, regression).
6. **Draw Conclusions:** Interpret results and optimize the process.

### **Advantages:**

‚úî Establishes clear cause-and-effect relationships.  
‚úî Minimizes the influence of external factors (controlled setting).  
‚úî Useful in optimizing processes in fields like medicine, manufacturing, etc.  

### **Challenges:**

‚ùå Can be expensive and time-consuming.  
‚ùå Ethical concerns (especially in medical trials).  
‚ùå May require complex statistical analysis.  

---



---

## **2.2. Active Learning**

### ‚úÖ **What is Active Learning?**

**Active Learning** is a type of **machine learning** where the model doesn‚Äôt just passively learn from data ‚Äî it **actively chooses the most useful data to learn from**.

üí° Think of it as a smart student who asks questions instead of reading the whole book.

### üß† **Why Use Active Learning?**

* Labeling data is often **expensive and time-consuming** (e.g., medical images).
* Active Learning helps you learn effectively with **fewer labeled examples**.
* Best when you have **lots of unlabeled data**, but labeling is costly.

### üîÑ **How Active Learning Works (Step-by-Step)**

1. Start with a **small set of labeled data**.
2. Train an initial model using this data.
3. Use the model to **make predictions on unlabeled data**.
4. Identify which data the model is **most uncertain about**.
5. Request labels for these uncertain examples from an **expert (oracle)**.
6. Add the new labeled data to the training set.
7. Repeat until the model performs well.

### üîë **Key Concepts:**

* **Labeled Data:** Data with answers (e.g., an image of a cat with the label "Cat").
* **Unlabeled Data:** Data without answers.
* **Oracle:** The expert who provides correct labels (e.g., a doctor).
* **Query Strategy:** The method the model uses to choose which data to ask labels for.

### üß™ **Types of Query Strategies:**

| **Strategy**              | **How It Works**                                                  |
| ------------------------- | ----------------------------------------------------------------- |
| **Uncertainty Sampling**  | Selects data where the model is most unsure.                      |
| **Query by Committee**    | Uses multiple models to choose data where they disagree the most. |
| **Expected Model Change** | Picks data that would most change the model if labeled.           |
| **Diversity Sampling**    | Chooses a wide variety of data to avoid bias.                     |

### üåç **Real-Life Applications:**

| **Field**                | **Example**                                  |
| ------------------------ | -------------------------------------------- |
| **Healthcare**           | Label only unclear medical images.           |
| **Spam Detection**       | Focus on emails that are hard to classify.   |
| **Autonomous Vehicles**  | Request labels for tricky driving scenarios. |
| **Image Classification** | Label images the model is unsure about.      |

### ‚úÖ **Benefits:**

* Saves time and labeling costs.
* Learns faster by focusing only on important data.
* Improves model accuracy with fewer examples.

### ‚ö†Ô∏è **Challenges:**

* Requires a reliable oracle (expert).
* May need a good query strategy for best results.
* Can be slow if querying too often.

### üí° **Simple Example: Learning Fruits**

Imagine you are teaching a model to recognize fruits:

* Start with 5 labeled fruit images (apple, banana, etc.).
* The model tries to classify others but is unsure about a peach.
* It asks you (the oracle) to label the peach.
* With your answer, it learns faster without labeling everything.

### üìù **Summary:**

| **Aspect**         | **Details**                                   |
| ------------------ | --------------------------------------------- |
| **Learning style** | Learns by choosing what to learn.             |
| **Goal**           | Label less, learn more.                       |
| **Best for**       | When labeling is expensive or time-consuming. |
| **Smart move**     | Focus on uncertain or confusing data points.  |

---



---

## **2.3. Reinforcement Learning (RL)**

### ‚úÖ **What is Reinforcement Learning?**

Reinforcement Learning (RL) is a type of **machine learning** where an **agent (learner)** interacts with an **environment**, takes **actions**, and learns from **rewards (good)** or **penalties (bad)** it receives.

üí° Think of it like training a dog ‚Äî you give it treats (rewards) for good behavior and say ‚ÄúNo!‚Äù (penalty) for bad behavior.

### üß† **How Reinforcement Learning Works:**

1. The **Agent** observes the current **State** of the environment.
2. It takes an **Action**.
3. The **Environment** responds with:

   * A **new State**
   * A **Reward** (positive or negative)
4. The agent learns from this feedback to make better decisions next time.

### üîë **Key Terminologies:**

| **Term**             | **Meaning**                                                       |
| -------------------- | ----------------------------------------------------------------- |
| **Agent**            | The learner or decision-maker (e.g., robot, AI model).            |
| **Environment**      | The world where the agent operates (e.g., game, real-world task). |
| **State (S)**        | The current situation of the agent.                               |
| **Action (A)**       | What the agent chooses to do.                                     |
| **Reward (R)**       | Feedback from the environment (positive = good, negative = bad).  |
| **Policy (œÄ)**       | The strategy the agent follows to choose actions.                 |
| **Value Function**   | Measures how good a state is for the agent.                       |
| **Q-Value (Q(s,a))** | The expected reward for taking action 'a' in state 's'.           |

### üîÅ **Types of Reinforcement Learning:**

| **Type**           | **Explanation**                                              |
| ------------------ | ------------------------------------------------------------ |
| **Positive RL**    | The agent gets rewards for good actions.                     |
| **Negative RL**    | The agent gets penalties for bad actions.                    |
| **Model-Based RL** | The agent builds a model of the environment to plan actions. |
| **Model-Free RL**  | The agent learns directly from experience without a model.   |

### üßÆ **Popular RL Algorithms:**

| **Algorithm**            | **How It Works**                                                |
| ------------------------ | --------------------------------------------------------------- |
| **Q-Learning**           | Learns the best actions for each state using rewards.           |
| **SARSA**                | Similar to Q-Learning but learns from the actual actions taken. |
| **Deep Q Network (DQN)** | Uses neural networks with Q-Learning for complex problems.      |
| **Policy Gradient**      | Directly learns the best policy (not values).                   |
| **Actor-Critic**         | Combines policy-based and value-based methods.                  |

### üåç **Real-Life Applications:**

| **Field**             | **Example**                                    |
| --------------------- | ---------------------------------------------- |
| **Gaming**            | Training AI to play chess, Go, or video games. |
| **Robotics**          | Teaching robots to walk, grab, or clean.       |
| **Self-Driving Cars** | Learning to drive safely by trial and error.   |
| **Healthcare**        | Optimizing treatment plans for patients.       |
| **Finance**           | Smart stock trading and investment bots.       |

### ‚úÖ **Benefits:**

* Can learn without labeled data.
* Great for complex, dynamic environments.
* Solves problems with a sequence of decisions.

### ‚ö†Ô∏è **Challenges:**

* Needs a lot of interactions (time-consuming).
* May get stuck in bad strategies (local optimum).
* Balancing exploration (try new actions) vs exploitation (use what works) is tough.

### üß† **Simple Example: Training a Dog**

* **Agent:** Dog.
* **Environment:** Home.
* **Action:** Sit, Jump, Bark.
* **Reward:** Treat (for sitting), No treat (for barking).
* Over time, the dog learns that sitting = treat, barking too much = no treat.

### üìù **Summary:**

| **Aspect**         | **Details**                                         |
| ------------------ | --------------------------------------------------- |
| **Learning style** | Trial-and-error based.                              |
| **Feedback**       | Rewards (positive) and Penalties (negative).        |
| **Goal**           | Maximize total reward over time.                    |
| **Best for**       | Games, robotics, automation, self-learning systems. |

---



---

### üîö **Summary Table**

| **Concept**                | **Meaning & Main Point**                                                             | **Real-Life Examples**                                                                  |
| -------------------------- | ------------------------------------------------------------------------------------------ | --------------------------------------------------------------------------------------- |
| **Designed Experiments**   | - Test one thing and see the effect. <br> - Plan, test, and see what happens.                      | - Trying new fertilizers <br> - Testing new medicines <br> - A/B testing on websites    |
| **Active Learning**        | - Smart learning by asking questions. <br> - The model asks for the most useful examples to learn. | - Detecting spam emails <br> - Training a chatbot <br> - Diagnosing medical images      |
| **Reinforcement Learning** | - Learning by doing, with rewards/punishments. <br> - Learn from experience, like training a pet.  | - Self-driving cars <br> - Playing chess or video games <br> - Training a robot to walk |

---


---

### **1. Designed Experiments**

**What It Means:**
In designed experiments, you test one thing at a time and observe the effect it has. The main goal is to plan a controlled test, change one factor, and measure the outcome to see what happens.

**Examples:**

* **Trying New Fertilizers:**
  Farmers might use different fertilizers on the same type of crop to observe how each one affects growth. For example, they could test fertilizer A against fertilizer B to see which one helps the plants grow taller or produce more fruit.

* **Testing New Medicines:**
  When a pharmaceutical company creates a new medicine, they first test it on a small group of patients to see how effective it is and if there are any side effects. This helps to determine whether it should be approved for widespread use.

* **A/B Testing on Websites:**
  Websites often test two versions (A and B) of a page to see which one works better. For example, an e-commerce website may change the color of a "Buy Now" button and show version A to half of the visitors and version B to the other half. By comparing how many purchases are made in each case, they can determine which button color is more effective.

---

### **2. Active Learning**

**What It Means:**
Active learning is a smart way for a model to learn by asking questions. Instead of learning from everything, it focuses on asking for the most useful examples to improve its performance faster.

**Examples:**

* **Detecting Spam Emails:**
  A spam filter might look at emails and identify which ones are likely spam based on patterns. When it‚Äôs uncertain (e.g., a new type of spam), the filter may ask for feedback. A user can tell it whether an email is spam or not, helping the filter improve faster by learning from only the most relevant examples.

* **Training a Chatbot:**
  A chatbot learns to improve its answers by asking questions. If it receives an unclear or complex query, it might ask the user to rephrase or clarify their question. By learning from these interactions, the chatbot gets better at responding in the future.

* **Diagnosing Medical Images:**
  A machine learning model used to analyze medical images (like X-rays or MRIs) can be uncertain about certain images. In those cases, it might ask a doctor to label or classify the image. By getting this human feedback, the model improves its ability to diagnose images more accurately in the future.

---

### **3. Reinforcement Learning**

**What It Means:**
Reinforcement Learning is a type of learning where an agent (like a robot or AI) learns by interacting with its environment. The agent takes actions, receives rewards or punishments based on those actions, and uses the feedback to improve its decisions.

**Examples:**

* **Self-driving Cars:**
  A self-driving car learns to navigate by trial and error. When it stops at a red light, it receives a "reward" (it stays on track). If it runs a red light, it gets a "punishment" (an accident or traffic violation). Over time, the car learns the best actions to take in different traffic scenarios to ensure safety.

* **Playing Chess or Video Games:**
  An AI that plays chess or a video game learns by playing many games. It receives rewards when it wins and penalties when it loses. With each game, the AI refines its strategies to increase its chances of winning in the future.

* **Training a Robot to Walk:**
  A robot learns how to walk by attempting different movements. Every time it takes a step without falling, it receives a reward (it progresses). If it falls, it gets a penalty (it has to start over). Over time, the robot improves its walking skills by learning from these successes and failures.

---