CronosDB

Distributed Timestamp-Triggered Database with Built-in Scheduler & Pub/Sub

CronosDB is a distributed database designed for timestamp-triggered event processing. It combines the durability of a write-ahead log (WAL), the precision of a timing wheel scheduler, and the scalability of partitioned, replicated storage.

Features

Core Features ✅

Timestamp-Triggered Events - Schedule events for future execution
Append-Only WAL - Durable, segmented storage with CRC32 checksums
Timing Wheel Scheduler - O(1) timer management for millions of events
gRPC API - High-performance streaming pub/sub with batch support
Bloom Filter + PebbleDB Dedup - Lock-free deduplication with two-tier lookup
Consumer Groups - Kafka-style offset tracking
Replay Engine - Time-range or offset-based event replay
Backpressure Control - Flow control with delivery credits

Distributed Features ✅

Multi-Node Clustering - 3+ node clusters with automatic partition distribution
Leader-Follower Replication - Async WAL replication
Raft Consensus - Metadata consistency via HashiCorp Raft
Consistent Hashing - Automatic partition routing

Quick Start

Prerequisites

Go 1.24+
protoc (Protocol Buffers compiler)

Build & Run

# 1. Generate protobuf code
protoc --go_out=. --go-grpc_out=. proto/events.proto

# 2. Build the server
go build -o bin/cronos-api ./cmd/api/main.go

# 3. Run single node
./bin/cronos-api -node-id=node1 -data-dir=./data

# Or use Makefile for cluster mode (recommended)
make node1  # Terminal 1 - Leader
make node2  # Terminal 2 - Follower
make node3  # Terminal 3 - Follower

# 4. Run load test
make loadtest-batch BATCH_SIZE=100

# 5. Check health
curl http://localhost:8080/health
# Expected: OK

Cluster Mode (3 Nodes)

# Terminal 1: Start leader node
make node1

# Terminal 2: Start follower (joins leader)
make node2

# Terminal 3: Start follower (joins leader)
make node3

# Run benchmark
make loadtest-batch PUBLISHERS=30 EVENTS=3333 BATCH_SIZE=100
# Expected: ~300K events/sec

Test with grpcurl

# Publish an event (single)
grpcurl -plaintext \
  -d '{"event":{"messageId":"test-1","scheduleTs":'$(date -u +%s%3N)',"payload":"SGVsbG8=","topic":"test-topic"}}' \
  localhost:9000 cronos_db.EventService.Publish

# Publish batch (high throughput)
grpcurl -plaintext \
  -d '{"events":[{"messageId":"batch-1","scheduleTs":'$(date -u +%s%3N)',"payload":"SGVsbG8=","topic":"test"}]}' \
  localhost:9000 cronos_db.EventService.PublishBatch

# Subscribe to events
grpcurl -plaintext \
  -d '{"consumerGroup":"group-1","topic":"test-topic","partitionId":0}' \
  localhost:9000 cronos_db.EventService.Subscribe

See MVP_BUILD_GUIDE.md for detailed instructions.

Architecture

┌─────────────┐
│   Client    │
└──────┬──────┘
       │ gRPC
       ▼
┌─────────────────────┐
│   API Gateway       │ (gRPC server)
└──────┬──────────────┘
       │
       ├─────────────────────────────┬─────────────────────────────┐
       │                             │                             │
       ▼                             ▼                             ▼
┌──────────────┐            ┌──────────────┐            ┌──────────────┐
│ Partition 0  │            │ Partition 1  │            │ Partition N  │
│  (Leader)    │◄──────────►│  (Leader)    │◄──────────►│  (Leader)    │
└──────┬───────┘            └──────┬───────┘            └──────┬───────┘
       │                            │                            │
       ├────────────┬───────────────┼────────────┬───────────────┤
       │            │               │            │               │
       ▼            ▼               ▼            ▼               ▼
   [WAL]      [Scheduler]      [Delivery]   [Dedup]      [Consumer]
   [DB]       [TimingWheel]    [Worker]     [Store]      [Groups]

Key Components:

WAL Storage - Append-only, segmented logs with sparse indexes & 1MB buffered writes
Timing Wheel - Hierarchical scheduler for O(1) timer management with batch scheduling
Bloom Filter - Lock-free in-memory filter for fast dedup (skips 99% of PebbleDB reads)
Dedup Store - PebbleDB-backed message deduplication with 64MB memtable
Delivery Worker - Backpressure-controlled event dispatch
Consumer Groups - Offset tracking per group

Documentation

Document	Description
ARCHITECTURE.md	Complete system architecture & design
PROJECT_STRUCTURE.md	Directory layout & file formats
MVP_BUILD_GUIDE.md	Build, deployment & testing guide
IMPLEMENTATION_SUMMARY.md	Implementation details & status
proto/events.proto	Complete API specification

Performance

Benchmarks (3-Node Cluster)

Metric	Value	Notes
Cluster Throughput	303,351 events/sec	Batch mode, 100 events/batch
Per-Node Throughput	101,117 events/sec	3 nodes, round-robin
Publish Latency P50	225µs	Batch publish
Publish Latency P95	607µs	Batch publish
Publish Latency P99	739µs	Batch publish
Success Rate	100%	Zero errors
Scheduler Tick	100ms	Configurable (1-1000ms)

Single Node Performance

Metric	Value
Write Throughput (batch)	~100K events/sec
Write Throughput (single)	~10K events/sec
Latency P99 (batch)	<1ms

Performance Optimizations Applied

Lock-Free Bloom Filter - Atomic CAS operations, skips PebbleDB for new keys
Batch APIs - PublishBatch for 100-500 events per call
Batch Scheduling - Single lock acquisition per batch
PebbleDB Tuning - 64MB memtable, disabled internal WAL, NoSync
WAL Buffering - 1MB buffered writer, batch append

Benchmark Command: make loadtest-batch PUBLISHERS=30 EVENTS=3333 BATCH_SIZE=100

Use Cases

Scheduled Tasks - Execute workflows at specific times
Event Sourcing - Durable event stream with replay
Temporal Workflows - Time-based business logic
Distributed Cron - Cluster-wide scheduled execution
Time-Series Events - Ordered event streams
Message Queue - Durable pub/sub with scheduling

Configuration

Essential Flags

-node-id=string          # Node identifier (required)
-data-dir=string         # Data directory (default: "./data")
-grpc-addr=string        # gRPC address (default: ":9000")

# WAL
-segment-size=bytes      # Segment size (default: 512MB)
-fsync-mode=mode         # Fsync mode: every_event|batch|periodic (default: periodic)

# Scheduler
-tick-ms=int             # Tick duration in ms (default: 100)
-wheel-size=int          # Timing wheel size (default: 60)

# Delivery
-ack-timeout=duration    # Ack timeout (default: 30s)
-max-retries=int         # Max delivery retries (default: 5)
-retry-backoff=duration  # Retry backoff (default: 1s)
-max-credits=int         # Max delivery credits (default: 1000)

# Dedup
-dedup-ttl=int           # Dedup TTL in hours (default: 168/7 days)

Project Structure

cronos_db/
├── cmd/
│   └── api/
│       └── main.go              # Main entry point
├── internal/
│   ├── api/                     # gRPC server & handlers
│   ├── partition/               # Partition management
│   ├── storage/                 # WAL, segments & sparse index
│   ├── scheduler/               # Timing wheel
│   ├── delivery/                # Event delivery & DLQ
│   ├── consumer/                # Consumer groups
│   ├── dedup/                   # Deduplication
│   ├── replay/                  # Replay engine
│   ├── replication/             # Leader-follower
│   └── config/                  # Configuration
├── pkg/
│   ├── types/                   # Shared types & protobuf
│   └── utils/                   # Utility functions
├── proto/
│   └── events.proto             # Protobuf schema
├── integration_test_suite.go    # Integration tests (23 tests)
├── ARCHITECTURE.md
├── PROJECT_STRUCTURE.md
├── MVP_BUILD_GUIDE.md
├── IMPLEMENTATION_SUMMARY.md
└── README.md

Status

MVP ✅ Complete

Distributed ✅ Complete

Multi-node clustering (3+ nodes)
Leader-follower replication
Raft consensus for metadata
Multi-partition support (8 partitions default)
Consistent hashing for partition routing
Cluster membership & discovery

Performance ✅ Optimized

Batch publish API (100-500 events/call)
Lock-free bloom filter deduplication
Batch WAL writes (single syscall per batch)
Batch scheduling (single lock per batch)
PebbleDB tuning (64MB memtable, NoSync)
Timer pooling with sync.Pool
300K+ events/sec achieved 🚀

Production Hardening 🚧

Metrics & monitoring (Prometheus/OpenTelemetry)
Distributed tracing
Rate limiting & quota management
Graceful shutdown & draining
Backup & restore utilities
Admin CLI & dashboard

Technology Stack

Language: Go 1.24+
gRPC: High-performance RPC with streaming
Storage Engine: PebbleDB (LSM tree, CockroachDB) with 64MB memtable
Consensus: HashiCorp Raft for metadata
Serialization: Protocol Buffers
Concurrency: Lock-free bloom filter, sync.Pool, atomic operations

Contributing

This is a reference implementation for educational purposes. The code demonstrates production-ready patterns for distributed systems design.

License

Apache 2.0

Resources

Author

Designed and implemented following production-distributed systems best practices.

CronosDB - Where time meets data. ⏰📊

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
cmd/api		cmd/api
internal		internal
pkg		pkg
proto		proto
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
MVP_BUILD_GUIDE.md		MVP_BUILD_GUIDE.md
Makefile		Makefile
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
cluster_loadtest.exe		cluster_loadtest.exe
cluster_loadtest.go		cluster_loadtest.go
go.mod		go.mod
go.sum		go.sum
integration_test_suite.go		integration_test_suite.go
loadtest.go		loadtest.go

jatin711-debug/cronos_db_golang

Folders and files

Latest commit

History

Repository files navigation

CronosDB

Features

Core Features ✅

Distributed Features ✅

Quick Start

Prerequisites

Build & Run

Cluster Mode (3 Nodes)

Test with grpcurl

Architecture

Documentation

Performance

Benchmarks (3-Node Cluster)

Single Node Performance

Performance Optimizations Applied

Use Cases

Configuration

Essential Flags

Project Structure

Status

MVP ✅ Complete

Distributed ✅ Complete

Performance ✅ Optimized

Production Hardening 🚧

Technology Stack

Contributing

License

Resources

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages