FastShop

A Java + Spring Boot REST API built as a performance optimization challenge. The constraints were deliberately strict:

Single node — no load balancer, no horizontal scaling. One machine runs the API, the database, and the cache.
Hard resource limits — each container is capped via Docker Compose to simulate real cloud instance constraints.
GET requests under 100ms — the target for all read endpoints under sustained load.

The goal wasn't to build a production-ready system but to understand exactly where the limits are and what each optimization actually buys you.

Stack

Layer	Technology
Runtime	Java 25
Framework	Spring Boot 4.0.3
Database	PostgreSQL 18
Cache / Queue	Redis 7
Connection Pool	HikariCP
Migrations	Flyway
Containerization	Docker Compose
Load Testing	Locust + Faker

Resource Constraints

All containers run on the same machine with hard CPU and memory limits set in Docker Compose:

Container	CPUs	RAM
App (Spring Boot)	2 vCPUs	4 GB
PostgreSQL	2 vCPUs	4 GB
Redis	1 vCPU	1 GB
Total	5 vCPUs	9 GB

No resources are shared between containers — each is capped independently. This makes the bottlenecks reproducible and comparable across test runs.

API

Products

Method	Endpoint	Description
`GET`	`/api/products`	List products with optional fuzzy search
`GET`	`/api/products/{id}`	Get product by ID
`POST`	`/api/products`	Create product
`PUT`	`/api/products/{id}`	Update product (async)
`DELETE`	`/api/products/{id}`	Delete product (async)

Orders

Method	Endpoint	Description
`GET`	`/api/orders`	Paginated order history
`GET`	`/api/orders/{id}`	Get order with items
`POST`	`/api/orders`	Place an order
`PATCH`	`/api/orders/{id}/cancel`	Cancel a pending order (async)

Optimizations

1. HikariCP Tuning

The default pool of 10 connections was the first bottleneck. Under concurrent load, requests queued waiting for a free connection, which inflated latency across every endpoint. Increased the pool size and disabled open-in-view, which was keeping connections open during response serialization — long after the actual queries had finished.

2. Redis Cache

Product and order lookups were hitting the database on every request. Added caching via Redis with TTL-based expiration and explicit eviction on writes (@CachePut on update, @CacheEvict on delete). Read endpoints dropped to sub-10ms response times after the cache warmed up.

3. PostgreSQL Trigram Search (`pg_trgm`)

The original search used exact LIKE matching which required a full table scan. Replaced it with pg_trgm — a PostgreSQL extension that enables partial and fuzzy matching using GIN indexes. Supports typo tolerance and partial terms without sacrificing index usage.

CREATE EXTENSION IF NOT EXISTS pg_trgm;
CREATE INDEX idx_products_name_trgm ON products USING gin(name gin_trgm_ops);

4. Async Writes with Redis Streams

PUT, DELETE, and PATCH operations were blocking DB connections for 1800–1900ms under load. Moved them to Redis Streams — controllers return 202 Accepted immediately and a background worker processes the actual DB write. This freed the connection pool to focus on the higher-volume POST and GET operations.

Before: PUT /products/{id} → p50 = 1900ms
After:  PUT /products/{id} → p50 = 2ms

5. N+1 Query Fix on Paginated Endpoints

Fetching paginated order lists with JOIN FETCH caused Hibernate to ignore LIMIT/OFFSET and load everything into memory before paginating in Java. Fixed with a two-query approach: fetch only the IDs for the current page first, then fetch full data for those IDs only.

// Query 1 — paginate IDs at DB level (fast)
List<UUID> ids = orderRepository.findPagedIds(pageable);

// Query 2 — fetch full data for those IDs only
List<Order> orders = orderRepository.findWithItemsByIds(ids);

6. JVM Tuning for Containers

Switched from the default GC to ZGC with generational mode for lower pause times. Fixed heap and metaspace bounds to avoid memory pressure affecting PostgreSQL and Redis on the same machine.

ENV JAVA_OPTS="\
  -XX:+UseZGC \
  -XX:+ZGenerational \
  -Xms256m \
  -Xmx1g \
  -XX:MaxMetaspaceSize=256m \
  -XX:+AlwaysPreTouch \
  -Djava.security.egd=file:/dev/./urandom"

Load Test Results

Tests run with a stepped load shape — users ramped from 50 to the target number over time, holding each level for 60 seconds. Each virtual user seeds their own products and orders on startup (no shared state between users).

3,000 Concurrent Users — 0 failures

Endpoint	p50	p95	p99	RPS
`GET /api/orders`	1ms	5ms	26ms	569.8
`GET /api/products/{id}`	1ms	5ms	25ms	499.0
`GET /api/orders/{id}`	1ms	5ms	29ms	426.8
`POST /api/orders`	3ms	23ms	61ms	295.7
`POST /api/products`	3ms	51ms	84ms	74.2
`GET /api/products?name=`	2ms	45ms	68ms	28.8
Aggregated	2ms	8ms	41ms	2,015

4,000 Concurrent Users — 41 failures

Endpoint	p50	p95	p99	RPS
`GET /api/orders`	1ms	14ms	43ms	682.3
`GET /api/products/{id}`	1ms	18ms	1,200ms	599.2
`GET /api/orders/{id}`	2ms	1,200ms	2,000ms	513.1
`POST /api/orders`	4ms	1,800ms	2,400ms	354.9
`POST /api/products`	14ms	1,700ms	2,200ms	107.9
`GET /api/products?name=`	3ms	1,700ms	2,400ms	34.4
Aggregated	2ms	910ms	1,900ms	2,442

The degradation from 3k to 4k is not gradual. At 3k the system is fully stable with zero failures. At 4k, the DB connection pool saturates under write pressure, requests begin queuing, and p95 latency jumps from 8ms to 910ms. The p50 stays at 2ms because cached read endpoints aren't affected — only requests that reach the database collapse. The 41 failures all correspond to requests hitting the 5s timeout ceiling while queued for a connection.

Running Locally

# Clone and build
git clone https://github.com/your-username/fastshop.git
cd fastshop

# Start everything (app + postgres + redis)
docker compose --compatibility up --build

# API available at
http://localhost:8080

Environment variables (with defaults):

Variable	Default	Description
`DB_URL`	`jdbc:postgresql://localhost:5432/mydatabase`	PostgreSQL URL
`DB_USERNAME`	`myuser`	DB username
`DB_PASSWORD`	`secret`	DB password
`REDIS_HOST`	`localhost`	Redis host
`REDIS_PORT`	`6379`	Redis port
`HIKARI_MAX_POOL_SIZE`	`80`	Max DB connections

Running the Load Test

pip install locust faker
locust --process 4 -f products_insert_search.py --host=http://localhost:8080

Open http://localhost:8089 to start the test. The stepped load shape will automatically ramp from 50 to 4,000 users over ~8 minutes.

What's Next

The natural next steps to push beyond a single machine:

PgBouncer — connection pooler in front of PostgreSQL, dramatically reduces actual DB connections needed
Multiple app instances + Nginx — horizontal scaling, linear throughput increase
PostgreSQL read replicas — route ~70% read traffic to replicas, writes to primary
WebFlux + R2DBC — fully reactive stack, better resource utilization under extreme concurrency

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
gradle/wrapper		gradle/wrapper
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
compose.yaml		compose.yaml
gradlew		gradlew
gradlew.bat		gradlew.bat
monitor.py		monitor.py
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastShop

Stack

Resource Constraints

API

Products

Orders

Optimizations

1. HikariCP Tuning

2. Redis Cache

3. PostgreSQL Trigram Search (`pg_trgm`)

4. Async Writes with Redis Streams

5. N+1 Query Fix on Paginated Endpoints

6. JVM Tuning for Containers

Load Test Results

3,000 Concurrent Users — 0 failures

4,000 Concurrent Users — 41 failures

Running Locally

Running the Load Test

What's Next

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FastShop

Stack

Resource Constraints

API

Products

Orders

Optimizations

1. HikariCP Tuning

2. Redis Cache

3. PostgreSQL Trigram Search (pg_trgm)

4. Async Writes with Redis Streams

5. N+1 Query Fix on Paginated Endpoints

6. JVM Tuning for Containers

Load Test Results

3,000 Concurrent Users — 0 failures

4,000 Concurrent Users — 41 failures

Running Locally

Running the Load Test

What's Next

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

3. PostgreSQL Trigram Search (`pg_trgm`)

Packages