AskSG

A Retrieval-Augmented Generation (RAG) assistant for Singapore public policy documents.

Ask questions in natural language about Singapore's Budget, housing policies, CPF rules, and monetary policy — and get answers grounded in official government sources.

Live demo: (coming soon)

What It Does

AskSG lets you ask questions like:

"What did Budget 2025 say about support for first-time homebuyers?"
"Am I eligible to buy an HDB resale flat as a Singapore PR?"
"How has Singapore's fiscal policy stance changed from 2023 to 2025?"
"What are the CPF contribution rates for employees above 55?"
"What is MAS's current assessment of Singapore's inflation outlook?"

Instead of reading through long government PDFs, you get a cited, grounded answer in seconds.

Architecture

Singapore government documents
   (Budget speeches, MAS reviews, HDB guides, CPF guides)
            │
            ▼
    [ ETL / Ingestion ]          etl/fetch_documents.py
    Download PDFs + HTML
            │
            ▼
    [ Preprocessing ]            etl/chunk_documents.py
    Clean + split into
    overlapping text chunks
    (500 chars, 50 overlap)
            │
            ▼
    [ Embedding + Indexing ]     etl/build_index.py
    sentence-transformers
    (all-MiniLM-L6-v2)
    → ChromaDB vector store
            │
            ▼
    [ RAG Pipeline ]             app/rag.py
    User question
    → embed → retrieve top-k chunks
    → Groq LLM (Llama 3.3 70B)
    → grounded answer
            │
            ▼
    [ Streamlit Interface ]      app/main.py
    Chat interface

Data Sources

Source	Documents	Format
Singapore Budget	Budget speeches 2023, 2024, 2025	PDF
MAS	Macroeconomic Reviews Apr/Oct 2024, Apr 2025	PDF
HDB	Buying guides, resale eligibility, couples & families	HTML
CPF Board	Contribution rates, housing usage, OW ceiling schedule, Budget 2023 CPF highlights	HTML

All documents are official Singapore government publications, free for public use.

Tech Stack

Layer	Tool
Language	Python 3.9+
Embeddings	`sentence-transformers` — `all-MiniLM-L6-v2`
Vector store	ChromaDB (local, persistent)
LLM	Groq API — `llama-3.3-70b-versatile` (free tier)
PDF parsing	`pdfplumber`
Frontend	Streamlit
Deployment (planned)	FastAPI + Docker + AWS EC2

Project Structure

asksg/
├── corpus/                    # Extracted text documents (tracked in git)
│   ├── budget/                # Singapore Budget speeches (3 PDFs)
│   ├── mas/                   # MAS Macroeconomic Reviews (3 PDFs)
│   ├── hdb/                   # HDB eligibility guides (3 HTML)
│   ├── cpf/                   # CPF contribution and housing guides (4 HTML)
│   └── chunks.jsonl           # 2,107 text chunks ready for embedding
├── etl/
│   ├── fetch_documents.py     # Downloads source documents
│   ├── chunk_documents.py     # Cleans and chunks text
│   ├── build_index.py         # Embeds chunks into ChromaDB
│   └── extract_local_pdf.py   # Helper: extract text from local PDFs
├── app/
│   ├── rag.py                 # RAG pipeline (retrieve + Groq generate)
│   └── main.py                # Streamlit chat interface
├── eval/
│   ├── ragas_eval.py          # Evaluation script (local NLI + cosine similarity)
│   └── test_set.json          # 10 curated Q&A pairs with ground truth
├── .gitignore
└── requirements.txt

Getting Started

# 1. Clone and set up environment
git clone https://github.com/isafzh/asksg.git
cd asksg
python -m venv .venv
.venv\Scripts\activate       # Windows
# source .venv/bin/activate  # Mac/Linux
pip install -r requirements.txt

# 2. Add your Groq API key
echo GROQ_API_KEY=your_key_here > .env
# Get a free key at https://console.groq.com

# 3. Build the vector index (downloads ~90MB model on first run)
python etl/build_index.py

# 4. Run the app
streamlit run app/main.py
# → opens at http://localhost:8501

To refresh the document corpus:

python etl/fetch_documents.py   # re-download source documents
python etl/chunk_documents.py   # re-chunk
python etl/build_index.py       # re-embed (delete chroma_db/ first)

Evaluation

RAG quality is measured on a hand-curated test set of 10 Q&A pairs drawn from the source documents (eval/test_set.json).

Two-tier evaluation: local metrics (zero extra API calls) plus LLM-as-judge (one Groq call per question):

Metric	Method	API cost
NLI Faithfulness	NLI entailment (`cross-encoder/nli-deberta-v3-base`)	0 extra calls
Context Relevance	Cosine similarity: query vs retrieved chunks (MiniLM)	0 extra calls
Answer Similarity	Cosine similarity: answer vs ground truth (MiniLM)	0 extra calls
Keyword Recall	Ground-truth keyword presence in retrieved context	0 extra calls
Faithfulness (LLM)	LLM-as-judge: is every claim grounded in the context?	1 call/question
Answer Relevance (LLM)	LLM-as-judge: does the answer address the question?	1 call/question

Total per run: 20 Groq API calls (10 generation + 10 judge). Original Ragas approach required 1,300+ calls and exhausted the free-tier quota in one run.

Results by Top-K

The eval script accepts a --top-k argument to control how many chunks are retrieved per query. Four values were tested to find the diminishing-returns curve:

Metric	K=5	K=7	K=9	K=11
Faithfulness (NLI entailment)	0.4449	0.4550	0.4922	0.4659
Answer Similarity (cosine)	0.8413	0.8768	0.8777	0.8906
Keyword Recall	0.7405	0.7664	0.8453	0.8958

K=9 is the sweet spot: faithfulness peaks here then falls at K=11 (noise from weakly-related chunks causes the LLM to synthesise beyond what any single chunk supports). Answer similarity and keyword recall keep rising, but the faithfulness drop signals retrieval quality declining. Each extra chunk adds ~130 tokens to every Groq API call, so higher K also reduces daily query capacity on the free tier.

10 hand-curated Q&A pairs across Budget, CPF, HDB, and MAS sources. All metrics computed locally — only 10 Groq API calls used per run.

To run:

python eval/ragas_eval.py              # default top-k=5  → saves eval/results_k5.json
python eval/ragas_eval.py --top-k 7   # retrieve 7 chunks → saves eval/results_k7.json
python eval/ragas_eval.py --top-k 9   # optimal           → saves eval/results_k9.json
python eval/ragas_eval.py --top-k 11  # diminishing gains → saves eval/results_k11.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AskSG

What It Does

Architecture

Data Sources

Tech Stack

Project Structure

Getting Started

Evaluation

Results by Top-K

Roadmap

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
app		app
corpus		corpus
etl		etl
eval		eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AskSG

What It Does

Architecture

Data Sources

Tech Stack

Project Structure

Getting Started

Evaluation

Results by Top-K

Roadmap

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages