Exercise 1: Open Source Levels Reflection

1. Gather Definitions

First, locate the descriptions of the three openness levels:
Copy or highlight each definition in your notes.

Fully Open
    "This is the gold standard of openness. Everything is available:
        The model’s architecture (how it’s built)
        The weights (what it learned during training)
        The training code (how it was trained)
        The training data (what it learned from)
    This level of transparency means:
        Anyone can reproduce the model from scratch.
        Researchers can fully understand and audit the model.
        Developers can fine-tune or retrain it for their own tasks."

Weights Released
    "This is a common middle ground. You get the model’s learned knowledge (its weights), but you don’t get the full picture of how it was trained or on what data.
    This still lets you:
        Run the model in your own applications.
        Fine-tune it on your own dataset.
        Deploy it for specific use cases (within licensing limits).
    However:
        You can’t easily trace back how it was trained.
        There may be hidden biases or issues you can’t fully investigate."

Architecture Only
    "In some cases, only the blueprint of the model is released—its structure, layers, and logic—but no weights or training data.
    This means:
        You can understand the model design.
        You could train it yourself—but you’d need enormous computing power and a large dataset."


2. Identify Key Characteristics

For each level, write down two bullet points:
What is open? (e.g., code, weights, both)
What you can and cannot do (e.g., retrain, inspect internals).

Fully Open
    What is open? Both
    What you can do? Run, Retrain, Inspect internals, Write and Modify, Deploy.
    What you cannot do? Nothing.

Weights Released
    What is open? Weights
    What you can do? Run, Retrain, Deploy.
    What you cannot do? Inspect internals, Write and Modify.

Architecture Only:
    What is open? Code
    What you can do? Run, Retrain (if you have massive amout of data), Inspect internals, Write and Modify, Deploy.
    What you cannot do? Nothing.

3. Compare Them Side by Side

What’s Open?	    Impact on Retraining/Modifying
Fully Open	        You can Run, Retrain, Inspect internals, Write and Modify, Deploy.
Weights Released	You can Run, Retrain, Deploy. You can't Inspect internals, Write and Modify.
Architecture Only	You can Run, Retrain (if you have massive amout of data), Inspect internals, 
                    Write and Modify, Deploy.

4. Draft Your Comparative Paragraph

While Architecture Only enables you to examine the inner workings of the model, not knowing how it was trained will limit you into what you can do with it, unless you have access to massive data. On the other hand, Weights released won't let you know the inner works, but the kownledge of initial weights enables you to use and retrain with more ease. And of course, Fully Open provides the best of both worlds.

5. Answer the Healthcare Prompt
Since we need to fine-tune the model for a specific topic, we need to at least use a Weight Released model or at best a Fully Open model. Not knowing initial weights, like in Architecture Only disables fine tuning and forces us to actually retrain the model from scratch.

Exercise 2: License Check For SaaS Use

✅ mistralai/Mistral‑7B‑Instruct

Model page: Hugging Face model card (e.g. mistralai/Mistral‑7B‑Instruct‑v0.3)
License section: Apache 2.0 
Reddit
+5
Reddit
+5
Forum d'Alignement de l'IA
+5
GitHub
+12
mistralai.github.io
+12
featherless.ai
+12
Commercial use explicitly allowed?
[x] Yes – Apache 2.0 allows commercial use with attribution and no royalties. 
Hugging Face
mistralai.github.io
Restrictions:
[ ] Must include Apache 2.0 license text and notices when redistributing
[ ] Attribution required per Apache 2.0 terms
[ ] No further commercial/geo limitations listed on Hugging Face
✅ meta‑llama/Llama‑2‑7b‑chat‑hf

Model page: Hugging Face model card meta‑llama/Llama‑2‑7b‑chat‑hf
License section: Llama 2 Community License (source‑available), not Apache or OSI‑approved.
Wikipédia
+5
GitHub
+5
Reddit
+5
Reddit
+10
source.kevin125.com
+10
deepinfra.com
+10
Reddit
+3
Wikipédia
+3
Forum d'Alignement de l'IA
+3
Commercial use explicitly allowed?
[x] Conditional – permitted for most users, but restricted for certain high‑scale entities.
Reddit
+10
source.kevin125.com
+10
Reddit
+10
Restrictions:
[ ] Prohibited to use model materials or outputs to improve any other LLM (except derivatives of Llama 2) 
Reddit
+15
source.kevin125.com
+15
Reddit
+15
[ ] If your product or service has >700 million monthly active users, you must request a license from Meta; commercial rights not granted until approved. 
Wikipédia
+9
source.kevin125.com
+9
Forum d'Alignement de l'IA
+9
[ ] Must retain attribution notice (“Llama 2 is licensed under the Llama 2 Community License…”) when distributing, and provide the license agreement to recipients. 
source.kevin125.com
+1
Reddit
+1
[ ] Must comply with Meta’s Acceptable Use Policy and export/trade regulations
source.kevin125.com
+1
Wikipédia
+1
📝 Markdown Checklist Summary
- [ ] **Mistral‑7B‑Instruct (mistralai/Mistral‑7B‑Instruct‑v0.3)**  
  - Type of license:  
    - [ ] Apache 2.0  
  - Commercial use allowed:  
    - [ ] Yes  
  - Restrictions:  
    - [ ] Must include license text and notices  
    - [ ] Required attribution per Apache 2.0

- [ ] **Llama 2‑7B‑chat (meta‑llama/Llama‑2‑7b‑chat‑hf)**  
  - Type of license:  
    - [ ] Llama 2 Community License (source‑available)  
  - Commercial use allowed:  
    - [ ] Conditional  
  - Restrictions:  
    - [ ] Cannot use outputs to train/improve other LLMs (outside Llama 2 derivatives)  
    - [ ] Entities with >700M MAU must request a separate Meta license  
    - [ ] Must retain attribution notice and include license text in distribution  
    - [ ] Must abide by Meta Acceptable Use Policy and export/trade rules

🌟 Exercise 3: LLM Matchmaker Challenge

🧠 1. Analyze Team Needs

LegalTech: CPU-only inference, logic-heavy chatbot → emphasize logical reasoning, fast CPU inference, compact size.
EdTech: Math & logic focus on low-end laptops → need strong math benchmarks, low memory usage, <= 7B params.
Global NGO: Must support 5+ languages well → multilingual models with high FLORES‑200 or MMLU multilingual scores.
2. Hugging Face Search Filters

I used searches like:

“Hugging Face logic reasoning model 7B CPU inference”
“Hugging Face math benchmark GSM8K MATH 7B”
“Hugging Face multilingual model FLORES‑200 under 7B”
I focused on CPU‑compatible or quantized models, tags like logic/math/multilingual, and model size ≤ 7 B.
3 & 4. Candidate Models & Evaluation

🔍 LegalTech (Logic-heavy, CPU-friendly):
Mistral‑7B Instruct: compact, GQA architecture for fast CPU inference, strong reasoning & BoolQ/MMLU performance 
hugging-face.org
+9
Reddit
+9
Reddit
+9
Reddit
Reddit
Hugging Face
+7
chat01.ai
+7
Reddit
+7
arXiv
+15
arXiv
+15
Reddit
+15
.
SynLogic‑7B (MiniMax): logic‑specialist fine‑tuned on puzzles/Sudoku, +9.5 points over Qwen‑7B on a logic benchmark 
hugging-face.org
+9
Medium
+9
SourceForge
+9
.
SmolLM3‑1.7B: very small (≈1.7 B), extremely fast on CPU, BoolQ ≈ 85.7% and strong multilingual scores 
Medium
+1
Collabnix
+1
.
📚 EdTech (Math/logic, low‑end laptop):
Mathstral‑7B: expert math model, GSM8K ≈ 77.1%, MATH ≈ 56.6%, quantized GGUF available
MathGPT.AI
+5
Reddit
+5
Reddit
+5
.
InternLM2‑Math‑7B‑Plus: outperforms DeepSeek‑Math‑7B on informal/formal logic benchmarks
chat01.ai
+7
Reddit
+7
Reddit
+7
.
SmolLM3‑1.7B: strong math results (MGSM ~ 70.8), tiny footprint for laptops 
MathGPT.AI
+2
Medium
+2
Collabnix
+2
.
🌍 Global NGO (Multilingual):
SmolLM3‑1.7B: supports many languages, FLORES‑200 ≈ 82.1%, global MMLU ≈ 68.9% 
arXiv
+2
Medium
+2
chat01.ai
+2
.
Gemma 3‑1B: extremely small, supports 140+ languages, up to 128 k context, runs on minimal RAM (~0.5–2 GB) 
Reddit
+5
kolosal.ai
+5
chat01.ai
+5
.
Qwen2.5‑Coder‑7B‑Instruct: supports ~92 languages, huge context, excellent reasoning math/code (HumanEval 88%) 
Reddit
+9
Reddit
+9
Reddit
+9
.
5. Final Picks (Top 2 vs final decision)

LegalTech candidates:
Mistral‑7B Instruct: fast CPU, strong overall reasoning & architecture optimized for inference.
SynLogic‑7B: extremely logic-focused but slightly niche.
Pick: Mistral‑7B Instruct – best balance of speed, general reasoning benchmarks, and CPU efficiency.

EdTech candidates:
Mathstral‑7B: specialized math reasoning with high GSM8K/MATH scores and quantization support.
SmolLM3‑1.7B: very light and fast, decent math performance but less math‑optimized.
Pick: Mathstral‑7B – stronger in raw math performance, still runs quantized on laptops.

Global NGO candidates:
SmolLM3‑1.7B: outstanding multilingual support and efficient on low‑end devices.
Gemma 3‑1B: tiniest footprint, excellent language coverage (140+), but less reasoning strong.
Pick: SmolLM3‑1.7B – best blend of multilingual coverage and reasonable reasoning benchmarks.

6. Deliverables

🔍 Search Filter Summary
LegalTech: filters on Hugging Face: CPU‑compatible, logic, math, ≤ 7B.
EdTech: math, logic, CPU‑compatible, GSM8K/MATH tags, ≤ 7B.
Global NGO: multilingual, FLORES‑200, CPU‑compatible, ≤ 7B.

| Team       | Needs                                     | Your Pick                                                         |
| ---------- | ----------------------------------------- | ----------------------------------------------------------------- |
| LegalTech  | Fast model for logic-heavy chatbot on CPU | **Mistral‑7B Instruct** (fast reasoning, optimized for CPU)       |
| EdTech     | Logic/math-focused LLM on low-end laptops | **Mathstral‑7B** (quantized, high GSM8K/MATH performance)         |
| Global NGO | Model that speaks 5+ languages well       | **SmolLM3‑1.7B** (multilingual FLORES‑200 ≈ 82.1%, efficient CPU) |

✅ Summary
Mistral‑7B Instruct wins for LegalTech due to its architectural optimization and strong benchmarks.
Mathstral‑7B delivers top-tier math reasoning in low‑ resource settings, ideal for EdTech.
SmolLM3‑1.7B excels in multilingual support and CPU efficiency for the Global NGO use case.

🌟 Exercise 4: Local Readiness Audit

| Requirement               | Your System Specs | Meets Requirement? |
|---------------------------|-------------------|--------------------|
| RAM (≥ 16 GB)             |        16 GB      | ✅                  |
| Free Disk Space (≥ 40 GB) |      249,13 GB    | ✅                  |
| OS (Linux/WSL2)           |      MacOS 15.5   | ❌                  |