pgflow Starter - HackerNews Classifier

This repository contains a starter example for pgflow, a workflow orchestration system that runs inside your Supabase Postgres database. It shows how to classify HackerNews posts using OpenAI models.

The example demonstrates how to replay historical inputs against modified workflow steps - useful for testing how changes to prompts, models, or logic affect outputs without re-fetching source data. This makes it easy to iterate on AI workflows and compare results across different configurations.

For detailed documentation, architecture overview, and advanced usage, see the Full README.

Flow Architecture

graph TD
    Start([Input: HN URL]) --> Item[Fetch Item]
    Start --> Comment[Fetch First Comment]
    Item --> Classification[AI Classification]
    Comment --> Classification

    style Start fill:#1e293b,color:#fff
    style Item fill:#1e293b,color:#fff
    style Comment fill:#1e293b,color:#fff
    style Classification fill:#1e293b,color:#fff

Source files: Flow Definition | Original Classify Task | Model-Parametrized Version

Quick Start

Setup

cd supabase/functions && cp .env.example .env
# Add: OPENAI_API_KEY, EDGE_WORKER_DB_URL (from supabase status)

npx -y supabase@latest start
npx -y supabase@latest functions serve

Wait 3s for pg_cron to start the worker.

Run Classification

SELECT * FROM pgflow.start_flow(
  flow_slug => 'classifyHnItem',
  input => jsonb_build_object('url','https://news.ycombinator.com/item?id=45245948')
);

Batch Process

WITH urls AS (
  SELECT unnest(ARRAY[
    'https://news.ycombinator.com/item?id=45247890',
    'https://news.ycombinator.com/item?id=45248802',
    'https://news.ycombinator.com/item?id=45223827',
    'https://news.ycombinator.com/item?id=45246953',
    'https://news.ycombinator.com/item?id=45245948',
    'https://news.ycombinator.com/item?id=45243635',
    'https://news.ycombinator.com/item?id=45221023',
    'https://news.ycombinator.com/item?id=45226938',
    'https://news.ycombinator.com/item?id=45246403',
    'https://news.ycombinator.com/item?id=45243803'
  ]) AS url
)
SELECT pgflow.start_flow(
  flow_slug => 'classifyHnItem',
  input => jsonb_build_object('url', url)
) FROM urls;

Check Results

-- Recent runs
SELECT flow_slug, status, output->'classification'
FROM pgflow.runs
ORDER BY started_at DESC LIMIT 10;

-- Completed classifications
SELECT input->>'url', output->'classification'
FROM pgflow.runs
WHERE flow_slug = 'classifyHnItem' AND status = 'completed';

→ Monitor flow execution (pgflow.dev)

Clear to start from scratch

SELECT pgflow.prune_data_older_than(make_interval(days => 0));

Compare Models

npm run compare
# or
npm run compare:limit -- --limit 10

The compare script demonstrates pgflow's replay capability with no magic - it simply:

Loads past run inputs with a single SQL query
Calls the classification step directly with those inputs
Compares outputs from different models (gpt-5-nano/mini/gpt5)

This lets you test how prompt or model changes affect outputs without re-fetching data or modifying the database.

Full README | pgflow.dev

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
supabase		supabase
README.md		README.md
README_FULL.md		README_FULL.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pgflow Starter - HackerNews Classifier

Flow Architecture

Quick Start

Setup

Run Classification

Batch Process

Check Results

Clear to start from scratch

Compare Models

About

Uh oh!

Releases

Packages

Languages

pgflow-dev/demo-hn-classifier

Folders and files

Latest commit

History

Repository files navigation

pgflow Starter - HackerNews Classifier

Flow Architecture

Quick Start

Setup

Run Classification

Batch Process

Check Results

Clear to start from scratch

Compare Models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages