AWS Lambda ARM vs x86 Benchmark

This repository contains a benchmark suite comparing AWS Lambda cost and performance on x86_64 and ARM (Graviton2 Neoverse N1) CPU architectures as of late 2025. It is inspired by the 2023 AWS blog post Comparing AWS Lambda Arm vs. x86 Performance, Cost, and Analysis, which uses older, now-unsupported runtimes.

This benchmark reimplements similar single-threaded workloads in Node.js, Python, and Rust using currently supported AWS runtimes, with a focus on realistic but controlled workloads and repeatable measurement.

Supported Runtimes

Python: 3.13, 3.12, 3.11
Node.js: 22, 20
Rust: Custom runtime on al2023 (Amazon Linux 2023) + rustc 1.91.0

Workloads

See docs/benchmark-design.md for full details.

CPU-intensive
- SHA-256 hashing loop (500k iterations) to exercise raw compute performance.
- No dependency on AWS SDKs.
Memory-intensive
- Allocates a fixed 100 MB data structure and sorts it as a memory-heavy workload.
- No dependency on AWS SDKs.
Light (I/O)
- Writes 5 items to DynamoDB in a batch, then reads 5 items in a batch.
- Uses the AWS SDK and represents a realistic, light “business logic + I/O” workload.

Test Matrix:

6 runtimes × 2 architectures (ARM64/x86_64) × 3 workloads = 36 Lambda functions
3 workload types: CPU-intensive (SHA-256 hashing), Memory-intensive (array sorting), Light (DynamoDB I/O)
Multiple memory configurations (128 MB to 10240 MB)
Cold start vs warm start measurements

Architecture:

Zero-overhead metrics collection (CloudWatch REPORT parsing)
Dynamic memory configuration (no redeployment needed)
DynamoDB storage with pre-computed aggregates
Parallel execution across all functions

📖 Full design documentation in /docs

Quick Start

Prerequisites

AWS account with credentials configured (aws configure)
Node.js 18+
Python 3.11+ (via uv - recommended)
AWS CDK CLI: npm install -g aws-cdk
UV package manager: https://docs.astral.sh/uv/getting-started/installation/
For Rust handlers: cargo-lambda or Docker (automatic via CDK)

Install

git  clone <this-repo>
cd  aws-lambda-performance-benchmakrs
npm  install && npm  run  build

Deploy

npm  run  deploy

Deploys 36 Lambda functions, 2 DynamoDB tables, and supporting infrastructure to us-east-2 (configurable via AWS_REGION).

Cleanup

cdk  destroy  LambdaBenchmarkStack

Run Benchmarks

Choose a test mode based on your needs:

Test mode (quick validation - 2 cold + 2 warm per config, ~10 min):

npm  run  benchmark:test

Balanced mode (recommended for publication - 50 cold + 200 warm per config, ~$2-4):

uv  run  python  scripts/benchmark_orchestrator.py  --balanced

Production mode (maximum rigor - 100 cold + 500 warm per config, ~18-24 hours, ~$5-10):

uv  run  python  scripts/benchmark_orchestrator.py  --production

Each mode tests all 36 functions across multiple memory configurations. Higher modes provide better statistical confidence.

Running Long Benchmarks (Balanced/Production Mode)

IMPORTANT: For Balanced (~1 hour) and Production (several hours) modes, AWS SSO tokens may expire mid-test, causing benchmark failures.

Recommended Solution: Run benchmarks on an EC2 instance with an IAM instance profile (no credential expiration).

Prerequisites:

LambdaBenchmarkStack must already be deployed (npm run deploy)
IAM permissions to create EC2 instances, IAM roles, and security groups

Usage:

# Launch EC2 instance that auto-runs benchmark and terminates when complete
uv run python scripts/run_benchmark_on_ec2.py --mode balanced
# Production mode (several hours)
uv run python scripts/run_benchmark_on_ec2.py --mode production
# Keep instance alive for debugging
uv run python scripts/run_benchmark_on_ec2.py --mode balanced --keep-alive
# Upload results to S3
uv run python scripts/run_benchmark_on_ec2.py --mode production --s3-bucket my-results-bucket

Benefits:

No SSO token expiration issues
Runs in background (immune to laptop sleep/network issues)
Auto-terminates after completion (unless --keep-alive specified)
Cost-effective: t4g.micro instance

Monitor progress:

# View instance status
aws ec2 describe-instance-status --instance-ids <instance-id>
# Stream benchmark logs (once instance is running)
aws logs tail /var/log/cloud-init-output.log --follow
# SSH into instance (requires AWS Systems Manager)
aws ssm start-session --target <instance-id>

Alternative (Local Execution): If running locally, extend your SSO session duration before starting:

aws sso login --profile <your-profile>  # Refresh token before benchmark

Note: SSO session duration is configurable in your AWS SSO settings (typically 8-12 hours max). For Production mode (18-24 hours), EC2 execution is strongly recommended.

Analyze Results

After running a benchmark, analyze the results:

# Analyze all results for a test run
uv  run  python  scripts/analyze_results.py <test-run-id>
# Filter by specific dimensions
uv  run  python  scripts/analyze_results.py <test-run-id> --runtime  python3.13
uv  run  python  scripts/analyze_results.py <test-run-id> --workload  cpu-intensive
uv  run  python  scripts/analyze_results.py <test-run-id> --architecture  arm64

Output includes:

Performance comparisons (ARM vs x86, runtime vs runtime)
Cold start vs warm start analysis
Memory scaling charts
Cost efficiency calculations
Statistical summaries (mean, median, p50/p90/p95/p99)

Metrics Collected

For each invocation, the benchmark extracts from CloudWatch REPORT logs:

Execution duration (ms) - Actual function runtime
Billed duration (ms) - Rounded for AWS billing
Max memory used (MB) - Peak memory consumption
Init duration (ms) - Cold start initialization time (cold starts only)
Client latency (ms) - End-to-end time from orchestrator

Results are stored in DynamoDB with pre-computed aggregates (mean, median, p50/p90/p95/p99, std dev) for fast analysis.

Zero overhead: Metrics are parsed from CloudWatch logs, NOT measured in-function, ensuring no performance impact on benchmarks.

Documentation

Design & Architecture:

docs/benchmark-design.md - Test matrix, workloads, orchestration
DECISIONS.md - Architectural decision records (ADRs)

Implementation:

docs/handler-api-spec.md - Lambda handler contract
docs/metrics-collection-implementation.md - CloudWatch REPORT parsing
docs/dynamodb-schema.md - Results storage schema

Contributing

This is a research benchmark project. Feel free to fork it, or if you find issues or have suggestions:

Check existing ADRs in DECISIONS.md to understand design rationale
Review benchmark-design.md for methodology
Open an issue with your findings or proposed changes

Credits & References

2023 AWS blog benchmark: https://aws.amazon.com/blogs/apn/comparing-aws-lambda-arm-vs-x86-performance-cost-and-analysis-2/
Forced cold start technique: AJ Stuyvenberg - https://github.com/astuyve/cold-start-benchmarker
- This approach helps us test cold start invocations much faster than having to re-deploy Lambdas or wait for all instances to become cold.
Rust Lambda support: https://aws.amazon.com/blogs/compute/building-serverless-applications-with-rust-on-aws-lambda/
cargo-lambda-cdk: https://github.com/cargo-lambda/cargo-lambda-cdk

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.claude/commands		.claude/commands
.github		.github
cdk		cdk
docs		docs
lambdas		lambdas
scripts		scripts
.editorconfig		.editorconfig
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.mcp.json		.mcp.json
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
.python-version		.python-version
CLAUDE.md		CLAUDE.md
DECISIONS.md		DECISIONS.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AWS Lambda ARM vs x86 Benchmark

Supported Runtimes

Workloads

Quick Start

Prerequisites

Install

Deploy

Cleanup

Run Benchmarks

Running Long Benchmarks (Balanced/Production Mode)

Analyze Results

Metrics Collected

Documentation

Contributing

Credits & References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

License

cebert/aws-lambda-performance-benchmarks

Folders and files

Latest commit

History

Repository files navigation

AWS Lambda ARM vs x86 Benchmark

Supported Runtimes

Workloads

Quick Start

Prerequisites

Install

Deploy

Cleanup

Run Benchmarks

Running Long Benchmarks (Balanced/Production Mode)

Analyze Results

Metrics Collected

Documentation

Contributing

Credits & References

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages