## 1. Fine-Tuning vs. RAG for Improving LLM Performance

### Question
We considered an LLM for this purpose but based on opinions of subject matter experts the treatments prescribed by the LLM are not aligned with precise treatments required for emergency medicine. We have a static dataset of emergency cases with the correct diagnoses (labels). Explain which approach, fine-tuning or RAG, is suitable for improving performance of the LLM.

### Answer
**Recommendation: Fine-Tuning**
- Fine-tuning updates the LLM’s internal weights using the static dataset, enabling the model to better align with precise emergency medicine treatments.
- The static dataset provides a structured and complete set of correct diagnoses that can directly improve the model's performance.

#### Advantages over RAG:
- Fine-tuning ensures the model internalizes the knowledge, which eliminates reliance on external retrieval mechanisms.
- RAG adds complexity as it requires maintaining and querying an external knowledge base, which may not ensure response improvements.


## 2. Storing Real-Time Treatment Procedures: Fine-Tuning vs. RAG

### Question
Suppose that we receive real-time treatment procedures (sometimes containing images) for different diseases from health authorities on the internet. When the LLM diagnoses a disease, it should also provide doctors with relevant treatment procedures for that disease. Explain which approach, fine-tuning or RAG, is the best way to store these procedures and what are the advantages of the proposed solution to the other one.

### Answer
**Recommendation: RAG (Retrieval-Augmented Generation)**
- RAG allows the integration of dynamic, real-time treatment procedures from health authorities by storing them in an external, easily updatable knowledge base.
- This approach supports the inclusion of multimedia content, such as images, without requiring changes to the LLM.

#### Advantages over Fine-Tuning:
- Fine-tuning is time-intensive and requires retraining the model each time new treatment data is received.
- RAG enables quick updates to the knowledge base and ensures the LLM provides the most up-to-date information without retraining.

## 3. Prompt and Decoding Analysis

### Question
Prompt: A patient is experiencing a high fever, low blood pressure, and a headache. What is the most probable cause of these symptoms?

Suppose that the probabilities of LLM’s response are as follows:

P(Migraine)=45%, P(Sepsis)=25%, P(Diabetes)=15% , P(Heart Attack)=10%, P(Diarrhea)=5%

### a) Question
What are probable responses of the LLM if K=4 is used in Top-K approach?

### Answer
With **K=4**, the LLM selects the top 4 responses based on probability:
- Migraine (45%)
- Sepsis (25%)
- Diabetes (15%)
- Heart Attack (10%)

### b) Question
What are probable responses of the LLM if P=80% is used in Top-P approach?

### Answer
With **P=80%**, the LLM selects responses until the cumulative probability exceeds 80%:
- Migraine (45%)
- Sepsis (25%)
- Diabetes (15%)
- Cumulative Probability = 85%
- Responses: Migraine, Sepsis, Diabetes

### c) Question
Suppose that we tested four combinations of decoding parameters (Temperature and K) using the prompt. For each combination of parameters, we ran the prompt for three times (iterations) as follows: Regarding the responses, which of the above combinations would be most suitable for a medical decision support system which needs stability of results? Explain why such a setting produces more stable results.

### Answer
The combination with **Temperature = 0.3** and **K = 2** is most suitable for a medical decision support system requiring stability. 

#### Explanation:
- **Lower temperature (0.3):** Reduces randomness, ensuring more deterministic responses.
- **Smaller K (K=2):** Focuses on the most probable responses, minimizing variability across iterations.

This setting consistently produces the same or highly similar responses, making it reliable for medical decision-making.

### d) Question
If a self-consistency approach is applied to all responses in the table to produce a unified diagnosis, what would that diagnosis be?

### Answer
Based on the responses in the table, the most frequent response is **Migraine**. 

**Unified Diagnosis:** Migraine