# 📘 Demo Notebook: AI Agent on GCP


This notebook demonstrates the **data preparation**, **entity extraction**, and **summarization** workflow for the case study project.

Make sure you have:
- Enabled GCP services (Cloud Storage, BigQuery, Vertex AI, Cloud NLP).
- Set your service account credentials:
```bash
export GOOGLE_APPLICATION_CREDENTIALS="path/to/key.json"
```


In [None]:
# Install dependencies (uncomment if needed)
# !pip install -r ../requirements.txt

In [None]:
# Load sample dataset
import json

with open("../data/raw_docs.json", "r") as f:
    docs = json.load(f)

print("Sample documents:")
for d in docs:
    print(f"ID: {d['id']} | Text: {d['text']}")

In [None]:
# Run data preparation script
!python ../src/data_prep.py --input ../data/raw_docs.json --output ../data/cleaned_docs.json

In [None]:
# Run entity extraction + summarization (replace YOUR_PROJECT_ID)
!python ../src/extract_and_summarize.py --input ../data/cleaned_docs.json --output ../data/results.json --project YOUR_PROJECT_ID

In [None]:
# Display results
with open("../data/results.json", "r") as f:
    results = json.load(f)

for r in results:
    print(f"Doc ID: {r['id']}")
    print("Entities:", r['entities'])
    print("Summary:", r['summary'])
    print("-"*50)

In [None]:
# Run Agent Workflow (replace YOUR_PROJECT_ID)
!python ../src/agent_workflow.py --query "Summarize customer complaints about battery life" --project YOUR_PROJECT_ID