Murmur

Lambda-architecture-aware streaming aggregation framework for Go.

Murmur is a spiritual successor to Twitter's Summingbird, built for Go-shop AWS deployments in 2026. One pipeline definition, three execution modes (live stream, snapshot bootstrap, archive replay), monoid-typed state in DynamoDB with optional Valkey acceleration, and a generic gRPC query layer that merges across windows.

Status

Pre-1.0, experimental. The architecture is built and exercised end-to-end against a docker-compose stack. Several rough edges are tracked openly in STABILITY.md — most notably error-handling gaps and the gRPC Get/GetWindow/etc. surface being a generic adapter today rather than per-pipeline codegen.

Feature	Status
Pipeline DSL with structural monoids	✅
Live mode: Kafka source (franz-go)	✅
Live mode: Kinesis source	⚠️ single-instance only, no checkpointing — KCL v3 multi-instance is on the roadmap
Bootstrap mode: Mongo SnapshotSource + handoff token	✅
Replay mode: S3 / MinIO JSON-Lines	✅
State: DynamoDB Int64SumStore (atomic ADD) + BytesStore (CAS)	✅
Cache: Valkey Int64Cache (write-through, INCRBY)	✅
Monoids: Sum, Count, First, Last, Set	✅
Monoids: Min, Max (`Bounded[V]`)	✅
Sketches: HyperLogLog, TopK (Misra-Gries), Bloom	✅
Windowed aggregations + sliding-window queries	✅
Generic gRPC query service (`Get` / `GetMany` / `GetWindow` / `GetRange`)	✅ — typed-per-pipeline codegen is on the roadmap
Atomic state-table swap (alias version pointer)	✅
Spark Connect batch executor (user-supplied SQL)	✅ validated locally against `apache/spark:4.0.1`
Lambda mode (batch view ⊕ realtime delta merge)	✅ via `pkg/query.LambdaQuery`
Decayed-value monoid (exponential decay)	✅ via `pkg/monoid/compose.DecayedSum`
Minute / hour / daily windowed buckets	✅
Web UI (dark mode, pipeline DAG, live metrics, query console)	✅ `cmd/murmur-ui`
Admin control plane — Connect-RPC, single port speaks gRPC + gRPC-Web + Connect/HTTP-JSON; proto-defined contract for any-language clients	✅ `pkg/admin`, `proto/murmur/admin/v1/admin.proto`
Metrics recorder hook in streaming runtime	✅ `pkg/metrics` + `streaming.WithMetrics`
DX facade (`Counter` / `UniqueCount` / `TopN` presets)	✅ `pkg/murmur`
Terraform `pipeline-counter` module	✅
Worked example: `page-view-counters` (worker + query binaries)	✅
Per-pipeline gRPC codegen (typed responses)	🛣 roadmap
Valkey-native HLL/Bloom acceleration	🛣 roadmap
KCL-v3 Kinesis source	🛣 roadmap

Limitations to read before adopting

replace directive only for Spark Connect. The root github.com/gallowaysoftware/murmur module no longer depends on apache/spark-connect-go — pkg/exec/batch/sparkconnect carries its own go.mod. Consumers who don't use Spark Connect (95% of users) get a clean go.mod. Consumers who DO use the sparkconnect submodule must mirror its replace github.com/apache/spark-connect-go => github.com/pequalsnp/spark-connect-go … line in their own go.mod — Go does not propagate replace directives transitively.
At-least-once with optional dedup. Pass streaming.WithDedup(d) (where d is a pkg/state/dynamodb.Deduper) to make replay-after-crash idempotent for any monoid. Without it, the streaming runtime is at-least-once with no per-EventID dedup — fine for idempotent monoids (Set, Min, Max, Bloom) but double-counts non-idempotent ones (Sum, HLL, TopK).
Single-goroutine streaming runtime. Phase-1 streaming processes records sequentially per worker. Throughput ceiling is roughly 5–10 k events/s/worker against DDB-local depending on item size. Scale horizontally with Kafka partitions until per-partition parallelism lands.
~~Min / Max monoids violate the identity law.~~ Fixed: lift inputs via core.NewBounded(v); the monoid value type is core.Bounded[V] and Identity is the unset wrapper.
CORS is closed by default. Pass admin.WithAllowedOrigins("https://dashboard.example", …) (or cmd/murmur-ui --allow-origin=…) to open it up. The admin API is read-only but still leaks pipeline metadata, so don't expose it to the public internet without auth in front.
CI runs on every PR. gofmt / go vet / unit tests with -race / golangci-lint / web tsc + eslint + vite build. Dependabot is wired up for Go, npm, and Actions.

Quick taste

import (
    "context"
    "time"

    "github.com/gallowaysoftware/murmur/pkg/murmur"
    mkafka "github.com/gallowaysoftware/murmur/pkg/source/kafka"
    mddb "github.com/gallowaysoftware/murmur/pkg/state/dynamodb"
)

type PageView struct {
    PageID string `json:"page_id"`
    UserID string `json:"user_id"`
}

func main() {
    src, err := mkafka.NewSource(mkafka.Config[PageView]{
        Brokers:       []string{"localhost:9092"},
        Topic:         "page_views",
        ConsumerGroup: "page_views_worker",
        Decode:        mkafka.JSONDecoder[PageView](),
    })
    if err != nil {
        panic(err)
    }
    defer src.Close()

    store := mddb.NewInt64SumStore(ddbClient, "page_views")
    defer store.Close()

    pipe := murmur.Counter[PageView]("page_views").
        From(src).
        KeyBy(func(e PageView) string { return e.PageID }).
        Daily(90 * 24 * time.Hour).
        StoreIn(store).
        Build()

    ctx := context.Background()
    if rc := murmur.RunStreamingWorker(ctx, pipe); rc != 0 {
        panic("worker exited with non-zero code")
    }
}

For the runnable version, see examples/page-view-counters/.

The generic gRPC service exposes Get(entity), GetWindow(entity, duration), GetMany(entities), and GetRange(entity, start, end) — wire it up with pkg/query/grpc.NewServer; proto definitions in proto/murmur/v1/query.proto. Per-pipeline typed responses (today everything is bytes) are tracked on the roadmap.

Architecture

The full design is documented in doc/architecture.md. For the canonical "how do I integrate Murmur with text search" question, see doc/search-integration.md — three patterns (query-time rescore, bucketed indexing, snapshot+delta), their tradeoffs, and a reference DDB-Streams Lambda projector.

The headline ideas:

Structural monoids. Each well-known monoid (Sum, HLL, TopK, Bloom, …) carries a Kind that backend executors dispatch on — DDB picks atomic ADD vs CAS, Spark picks the right SQL aggregation, Valkey picks PFADD vs INCRBY. Custom monoids work as opaque Go closures on Go-only execution backends.
Three execution modes, one DSL. A pipeline definition is execution-mode-agnostic. The same monoid Combine runs from a Kafka consumer (live), a Mongo collection scan (bootstrap), or an S3 JSON-Lines archive (replay).
DDB is source of truth, Valkey is a cache. State that's lost in Valkey is repopulatable from DDB. The cache is never trusted as ground truth.
Windowed monoids first-class. windowed.Daily(retention) adds a time-bucket dimension to state keys; queries assemble sliding windows by merging the N most-recent buckets via the monoid Combine.
No Beam, no Flink-in-Go. Beam's Go SDK is unmaintained and its Spark runner is batch-only. Murmur is not a streaming engine — it's a framework that runs your monoid Combine on ECS Fargate workers reading from Kinesis/Kafka, and dispatches batch through Spark Connect.

Why not Beam, why not Flink, why not Goka?

doc/architecture.md has the full version. Short version:

Apache Beam Go SDK is unmaintained as of 2.32 and the Spark runner is batch-only — Beam streaming on EMR is not actually possible.
Apache Flink (incl. Amazon Managed Service for Apache Flink) is mature but JVM-only. JVM tax for Go shops, and no auto-generated query layer.
Goka is Kafka-only, no batch story, no query layer, small community.

Murmur fills the gap with: unified Go DSL, structural monoids that dispatch to multiple backends, three execution modes, time-windowed aggregations, and a generic gRPC service that does the merge.

Run locally

make compose-up   # bring up kafka, dynamodb-local, valkey, mongo, minio, spark-connect
                  # plus rs.initiate for Mongo (idempotent)
make seed-ddb     # create the page_views DDB table the example reads from
make test-unit    # fast unit tests, no infra
make test-integration  # full E2E suite against the docker-compose stack
make ui           # build the web UI and run cmd/murmur-ui --demo on :8080

make help lists every target.

The end-to-end tests in test/e2e/ exercise:

Counter pipeline: Kafka → Sum → DDB (counter_test.go)
HLL pipeline: Kafka → HLL → DDB BytesStore CAS (hll_test.go)
Windowed counters with Last1/2/3/7/10/30Days queries (windowed_test.go)
Mongo bootstrap with Change Stream resume token (mongo_bootstrap_test.go)
DDB ParallelScan bootstrap with re-run idempotency under DDB-backed Deduper (ddb_bootstrap_test.go)
S3 replay into a shadow table (s3_replay_test.go)
Spark Connect batch SUM aggregation → DDB (spark_connect_test.go)

Production-readiness packages

Beyond the core pipeline DSL, several packages exist to make Murmur deployable:

pkg/exec/lambda/{kinesis,dynamodbstreams,sqs} — three Lambda runtimes for the AWS-native event sources, all sharing the same retry / dedup / BatchItemFailures contract via pkg/exec/processor.
pkg/source/snapshot/{mongo,dynamodb,jsonl,s3} — bootstrap sources for the four common shapes: Mongo collections, DynamoDB ParallelScan, raw JSON Lines, and S3-prefix-scan-of-JSON-Lines for partitioned archives.
pkg/state/{dynamodb,valkey} — DDB as source-of-truth (Int64SumStore / BytesStore + Deduper), Valkey as cache (Int64Cache / BytesCache + warmup helpers in pkg/query).
pkg/query/grpc — Connect-RPC server speaking gRPC + gRPC-Web + Connect/HTTP-JSON on one port. Singleflight coalescing + fresh_read flag + per-RPC metrics + batched windowed reads (GetWindowMany / GetRangeMany).
pkg/projection — bucket functions (LogBucket / LinearBucket / ManualBucket) + HysteresisBucket for change-data-capture into search indices.
pkg/observability/autoscale — Signal → Emitter loop for publishing scaling-signal metrics. Reference CloudWatch emitter for ECS Fargate target tracking on Kafka consumer lag / Kinesis iterator-age / events-per-second.
pkg/state.NewInstrumented — decorator for any Store[V] / Cache[V] that adds metrics.Recorder hooks (per-op latency + errors). Zero-overhead when the recorder is nil.
pkg/murmur — facade with the common-case presets (Counter, UniqueCount, TopN, Trending) plus RunStreamingWorker and the Lambda-side KinesisHandler / DynamoDBStreamsHandler / SQSHandler / MustHandler wrappers.

Worked examples

examples/page-view-counters/ — runnable two-binary pipeline (cmd/worker + cmd/query), a Dockerfile producing a multi-binary distroless image, and the Terraform deployment via deploy/terraform/modules/pipeline-counter/.
examples/mongo-cdc-orderstats/ — Mongo collection bootstrap → Kafka CDC live, with upstream-id dedup so re-deliveries fold idempotently.
examples/recently-interacted-topk/ — single Top-N pipeline fed by two sources at once: Kinesis (consumed via an AWS Lambda trigger) plus Kafka (consumed by a long-running ECS worker). Both binaries write through the same DDB row; the Misra-Gries Combine produces a unified ranking across channels.
examples/search-projector/ — runnable Pattern B from doc/search-integration.md: a Lambda that tails Murmur's counter table via DDB Streams and projects bucket transitions into an OpenSearch index, reducing search-side index write rate from per-event to per-order-of-magnitude (~6 reindexes for a 0→1M counter rise vs 1M).
examples/search-rerank/ — runnable Pattern A from the same doc: an HTTP search service that does two-stage retrieval (OpenSearch recall + Murmur counter rerank). Pairs with the search-projector to form the canonical "filter on bucket + rank by live counters" shape.
examples/typed-wrapper/ — count-core-shaped reference for the typed-wrapper pattern: how application services expose Murmur counter pipelines through their own typed Connect-RPC API instead of the generic Value{bytes} shape. Uses pkg/query/typed as the building block.

Web UI and admin API

make ui   # builds the UI, builds the binary, runs --demo on :8080
# open http://localhost:8080

--demo registers three synthetic pipelines and ticks fake metrics so the dashboard, DAG, and query console have data to show. Real workers register via pkg/admin.Server.Register.

The bundled UI is one client of the admin API; anyone can sub in their own. The contract lives in proto/murmur/admin/v1/admin.proto and the server uses Connect-RPC, so a single port speaks gRPC, gRPC-Web, and Connect (HTTP+JSON) — pick whichever your client supports. Generate bindings in your language of choice with buf generate. Hit it from curl if you want:

curl -X POST http://localhost:8080/api/murmur.admin.v1.AdminService/ListPipelines \
    -H 'Content-Type: application/json' -d '{}'
# → {"pipelines":[{"name":"page_views","monoidKind":"sum",...}, ...]}

License

Apache 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.claude		.claude
.github		.github
cmd		cmd
deploy/terraform/modules/pipeline-counter		deploy/terraform/modules/pipeline-counter
doc		doc
examples		examples
pkg		pkg
proto		proto
scripts		scripts
test/e2e		test/e2e
web		web
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
STABILITY.md		STABILITY.md
buf.gen.yaml		buf.gen.yaml
buf.yaml		buf.yaml
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
go.work		go.work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Murmur

Status

Limitations to read before adopting

Quick taste

Architecture

Why not Beam, why not Flink, why not Goka?

Run locally

Production-readiness packages

Worked examples

Web UI and admin API

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Murmur

Status

Limitations to read before adopting

Quick taste

Architecture

Why not Beam, why not Flink, why not Goka?

Run locally

Production-readiness packages

Worked examples

Web UI and admin API

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages