# 🎯 FIXED GPT-2 Singapore Financial Fine-Tuning (Proven Working Approach)

## ❌ **Previous Issues:**
- Singapore content dropped from 75% to 37.5%
- Domain accuracy halved
- Model knowledge corrupted instead of enhanced

## ✅ **This Fixed Version:**
- Uses **proven working parameters** from successful runs
- **Conservative LoRA** to prevent knowledge corruption
- **Proper data formatting** for Singapore financial content
- **Training mode inference** for better results
- **Expected: 80%+ Singapore content, significant improvements**


In [None]:
# 🚀 SETUP WITH PROVEN WORKING CONFIGURATION
!pip install torch transformers datasets peft accelerate rouge-score nltk sentence-transformers -q

import torch
import json
import time
import numpy as np
from pathlib import Path

from transformers import (
    AutoTokenizer, AutoModelForCausalLM, 
    TrainingArguments, Trainer, DataCollatorForLanguageModeling
)
from peft import LoraConfig, TaskType, get_peft_model
from datasets import Dataset

# Evaluation libraries
from rouge_score import rouge_scorer
from nltk.translate.bleu_score import sentence_bleu, SmoothingFunction
from sentence_transformers import SentenceTransformer
import nltk
nltk.download('punkt', quiet=True)

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
print(f"✅ Setup complete! Using device: {device}")
