A Reproducible Benchmark of Apache IoTDB, InfluxDB, and TimescaleDB for IoT Workloads

Undergraduate thesis (TCC) project comparing three time-series databases — Apache IoTDB, InfluxDB, and TimescaleDB — across nine workload scenarios representative of real IoT deployments, presented for approval in the Computer Science program at Universidade Federal da Fronteira Sul (UFFS) in 2026.

Overview

IoT systems produce continuous streams of sensor readings that must be ingested at high rates and queried efficiently for monitoring and analytics. Different databases take fundamentally different architectural approaches to this problem:

Apache IoTDB is purpose-built for IoT time-series data, using a custom columnar file format (TsFile) and a native binary RPC protocol (Thrift SESSION_BY_TABLET).
InfluxDB 2.x is a purpose-built time-series database using the TSM (Time-Structured Merge Tree) storage engine, exposed via an HTTP line-protocol API.
TimescaleDB is a PostgreSQL extension that adds time-series capabilities (hypertables, time_bucket aggregation) on top of a full relational engine with WAL, MVCC, and B-tree indexes.

The benchmark uses iot-benchmark (Tsinghua University) as the workload generator, which simulates IoT sensor devices writing and querying time-series data.

Architecture

benchmark.py              ← CLI entry point (--db, --test, --scale)
benchrunner/
  config.py               ← connection settings + scale parameters
  influxdb.py             ← InfluxDB 2.x driver (updates config, invokes benchmark)
  timescaledb.py          ← TimescaleDB driver
  iotdb.py                ← IoTDB driver (delegates to iot_benchmark.py)
  iot_benchmark.py        ← IoTDB config writer + subprocess runner
  metrics.py              ← Docker stats monitor + output parser + result table
iot-benchmark/            ← git submodule (thulab/iot-benchmark)
  influxdb-2.0/           ← Java benchmark client for InfluxDB
  timescaledb/            ← Java benchmark client for TimescaleDB
  iotdb-1.3/              ← Java benchmark client for IoTDB
docker-compose.yml        ← IoTDB + InfluxDB + TimescaleDB containers
flake.nix                 ← Nix dev shell + build + benchmark apps
scripts/
  init-timescaledb.sql    ← Forces MD5 auth for the legacy JDBC driver bundled in iot-benchmark

How a benchmark run works:

benchmark.py starts each database container immediately before its tests begin and stops it once all its tests complete, so only one database is active at a time.
For each (database, test) pair, the Python driver writes the correct parameters to the Java benchmark client's conf/config.properties.
The Java client is invoked via bash benchmark.sh; a background thread polls docker stats every second to collect CPU and memory samples.
After the run, metrics.py parses the Result Matrix and Latency Matrix from the Java client's stdout, combines them with the Docker samples, and appends a row to the results table.

Tech Stack

Layer	Technology
Workload generator	iot-benchmark (Java, Maven)
Orchestration	Python 3
Database containers	Docker Compose
Dev environment	Nix flake (`nix develop`, `nix run`)
IoTDB	apache/iotdb:1.3.0-standalone
InfluxDB	influxdb:2.7-alpine
TimescaleDB	timescale/timescaledb:latest-pg15

Prerequisites

JDK 17
Maven
Docker and Docker Compose
Python 3
Docker daemon running

Setup

# Clone with submodule
git clone --recurse-submodules https://github.com/LuizFerK/iot-benchrunner
cd iot-benchrunner

Nix

# Enter dev shell (provides java, mvn, docker, python)
nix develop

# Build the Java benchmark clients (one-time)
nix run .#build

Non-Nix

# Build the Java benchmark clients (one-time)
cd iot-benchmark && mvn -pl influxdb-2.0,timescaledb,iotdb-1.3 package -DskipTests -q && cd ..

Running

# Full suite, all databases, small scale (~5 min)
python3 benchmark.py

# Single database, single test
python3 benchmark.py --db influxdb --test write

# All tests, specific database, medium scale
python3 benchmark.py --db timescaledb --scale medium

Options

Flag	Default	Values
`--db`	`all`	`all`, `influxdb`, `timescaledb`, `iotdb`
`--scale`	`small`	`small` (~5 min), `medium` (~30 min), `large` (~2 h)

--test (default all): all, write, out-of-order, batch-small, batch-large, read, latest-point, downsample, range-query, value-filter

Scale parameters

Scale	Clients	Devices	Sensors	Batch size	Loops
small	5	10	10	100	1000
medium	5	10	10	100	5000
large	10	50	20	100	5000

Generating Charts

After running benchmarks the result CSVs are written to results/. The charts.py script generates all figures from those files.

Dependencies

matplotlib, numpy, and pandas are required. Inside nix develop they are already available.

Usage

# All chart sets, English labels (default)
python3 charts.py

# Specific scale
python3 charts.py --source small
python3 charts.py --source medium
python3 charts.py --source large

# Rerun / isolation comparison charts
python3 charts.py --source rerun

# Cross-scale overlay plots
python3 charts.py --source comparison

# Portuguese labels
python3 charts.py --language pt-br

# Generate every available chart set at once
python3 charts.py --source all

`--source` values

Value	Description
`small`	Charts for the small-scale run only
`medium`	Charts for the medium-scale run only
`large`	Charts for the large-scale run only
`mixed`	Combined small + medium + large comparison
`all`	All of the above
`rerun`	Charts from the isolated/tuned reruns
`rerun-small` / `rerun-medium` / `rerun-large` / `rerun-mixed`	Per-scale rerun charts
`comparison`	Cross-scale overlay plots used in the paper

Charts are saved as PNG files in the charts/ directory, organized by source and language.

Test Types

When --test all is used, tests run in this order: write variants first (each clears and re-seeds the database), then read variants (which all operate on the data written by the baseline write test).

Write tests

Test	What it measures
`write`	Baseline sequential ingestion - ordered timestamps, default batch size
`out-of-order`	20% late-arriving data (Poisson distribution) - simulates network-delayed IoT sensors; reveals write-amplification from out-of-order handling
`batch-small`	`BATCH_SIZE=1` - one write request per data point; isolates per-request protocol overhead (HTTP round-trip, WAL flush, Thrift call)
`batch-large`	`BATCH_SIZE=1000` - large batches; tests bulk-loading efficiency and where each database's batching ceiling lies

Read tests

Test	Queries active	What it measures
`read`	All 10 supported types	Comprehensive read baseline - all analytics patterns equally weighted
`latest-point`	`LATEST_POINT`	Real-time monitoring - "current sensor value"; tests last-value index vs full reverse scan
`downsample`	`GROUP_BY`	Dashboard aggregation - time-bucketed averages; tests native downsampling push-down
`range-query`	`TIME_RANGE`	Bulk data export - sequential scan over a time window; tests decompression throughput
`value-filter`	`VALUE_RANGE`, `AGG_VALUE`, `AGG_RANGE_VALUE`	Threshold alerting - "readings above X"; tests predicate pushdown into the storage layer

Results

Column definitions

Column	Meaning
Time (s)	Wall-clock duration of the entire benchmark run
Throughput	Points ingested or returned per second, as reported by iot-benchmark
Avg Lat	Mean per-operation latency across all client threads
P99 Lat	99th-percentile latency — 99% of operations completed faster than this value; the remaining 1% took longer
Avg/Peak CPU	Average and maximum CPU usage of the database container sampled every second via `docker stats`
Avg/Peak Mem	Average and maximum RSS of the database container over the same interval

Small scale

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	19.35	5168.85 pts/s	9.12 ms	14.74 ms	2.5%	4.3%	118.9 MB	125.5 MB
INFLUXDB	BATCH-LARGE	45.58	2193904.61 pts/s	22.00 ms	205.07 ms	22.0%	32.7%	596.7 MB	926.0 MB
INFLUXDB	OUT-OF-ORDER	36.66	272760.59 pts/s	17.72 ms	45.44 ms	5.9%	13.6%	318.8 MB	925.0 MB
INFLUXDB	WRITE	22.76	439381.68 pts/s	10.81 ms	39.77 ms	6.3%	18.2%	253.2 MB	320.7 MB
INFLUXDB	READ	28.98	3618.15 pts/s	28.22 ms	269.51 ms	41.7%	52.1%	305.3 MB	318.0 MB
INFLUXDB	LATEST-POINT	12.12	412.43 pts/s	10.85 ms	19.67 ms	34.1%	51.3%	301.1 MB	313.8 MB
INFLUXDB	DOWNSAMPLE	8.10	8021.25 pts/s	6.85 ms	14.36 ms	25.6%	47.2%	292.6 MB	297.8 MB
INFLUXDB	RANGE-QUERY	7.75	31606.13 pts/s	6.50 ms	14.36 ms	23.9%	44.4%	291.6 MB	292.9 MB
INFLUXDB	VALUE-FILTER	92.24	936.26 pts/s	89.85 ms	341.34 ms	46.4%	53.8%	316.7 MB	328.9 MB
TIMESCALEDB	BATCH-SMALL	22.45	4454.53 pts/s	10.67 ms	15.96 ms	2.3%	3.4%	5443.3 MB	5446.7 MB
TIMESCALEDB	BATCH-LARGE	622.87	160546.88 pts/s	302.36 ms	1069.12 ms	31.4%	39.5%	5474.6 MB	5496.8 MB
TIMESCALEDB	OUT-OF-ORDER	75.65	132189.13 pts/s	36.52 ms	250.11 ms	21.8%	32.5%	5478.6 MB	5494.8 MB
TIMESCALEDB	WRITE	74.81	133677.05 pts/s	36.05 ms	246.52 ms	23.8%	33.9%	5516.7 MB	5534.7 MB
TIMESCALEDB	READ	13.45	7945.08 pts/s	12.96 ms	275.42 ms	57.3%	83.6%	5545.3 MB	5560.3 MB
TIMESCALEDB	LATEST-POINT	1.61	3102.08 pts/s	0.50 ms	1.59 ms	7.4%	14.9%	5519.4 MB	5519.4 MB
TIMESCALEDB	DOWNSAMPLE	1.83	35454.63 pts/s	0.72 ms	1.84 ms	6.9%	20.8%	5523.8 MB	5532.7 MB
TIMESCALEDB	RANGE-QUERY	1.73	144191.68 pts/s	0.64 ms	1.66 ms	6.4%	19.2%	5523.5 MB	5531.6 MB
TIMESCALEDB	VALUE-FILTER	40.81	2157.80 pts/s	40.12 ms	251.06 ms	74.8%	90.3%	5552.9 MB	5561.3 MB
IOTDB	BATCH-SMALL	4.12	24268.33 pts/s	1.54 ms	1.19 ms	0.3%	0.7%	12469.4 MB	12482.6 MB
IOTDB	BATCH-LARGE	7.20	13897642.72 pts/s	2.91 ms	9.99 ms	6.6%	36.9%	12513.3 MB	12533.8 MB
IOTDB	OUT-OF-ORDER	5.22	1917405.53 pts/s	2.02 ms	2.73 ms	1.9%	10.3%	12544.0 MB	12564.5 MB
IOTDB	WRITE	4.69	2130911.33 pts/s	1.77 ms	1.99 ms	0.8%	3.8%	12573.3 MB	12574.7 MB
IOTDB	READ	3.47	29649.05 pts/s	2.26 ms	20.46 ms	15.2%	28.2%	12554.2 MB	12574.7 MB
IOTDB	LATEST-POINT	1.91	2617.04 pts/s	0.78 ms	2.81 ms	5.1%	14.8%	12550.8 MB	12554.2 MB
IOTDB	DOWNSAMPLE	2.19	29697.05 pts/s	1.07 ms	3.01 ms	4.8%	13.8%	12554.2 MB	12554.2 MB
IOTDB	RANGE-QUERY	2.48	100916.37 pts/s	1.35 ms	3.81 ms	5.1%	14.3%	12554.2 MB	12554.2 MB
IOTDB	VALUE-FILTER	5.03	17504.97 pts/s	3.79 ms	14.22 ms	17.4%	42.3%	12556.8 MB	12564.5 MB

Medium scale

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	106.88	4678.00 pts/s	10.56 ms	36.41 ms	4.5%	7.0%	192.4 MB	207.7 MB
INFLUXDB	BATCH-LARGE	240.45	2079443.35 pts/s	23.81 ms	238.84 ms	23.7%	36.8%	1812.1 MB	3152.9 MB
INFLUXDB	OUT-OF-ORDER	200.75	249071.69 pts/s	19.88 ms	102.65 ms	7.6%	21.8%	693.3 MB	3145.7 MB
INFLUXDB	WRITE	137.49	363669.90 pts/s	13.59 ms	210.95 ms	8.8%	18.8%	502.9 MB	664.9 MB
INFLUXDB	READ	605.70	871.99 pts/s	121.21 ms	1330.58 ms	52.3%	57.5%	713.8 MB	790.5 MB
INFLUXDB	LATEST-POINT	117.40	212.95 pts/s	23.18 ms	32.34 ms	49.8%	54.3%	270.4 MB	285.8 MB
INFLUXDB	DOWNSAMPLE	37.34	8702.82 pts/s	7.17 ms	11.80 ms	38.7%	45.2%	252.1 MB	255.4 MB
INFLUXDB	RANGE-QUERY	38.67	31679.75 pts/s	7.45 ms	15.17 ms	35.6%	44.0%	254.3 MB	256.6 MB
INFLUXDB	VALUE-FILTER	2003.30	210.46 pts/s	396.15 ms	1363.74 ms	46.9%	50.2%	304.4 MB	335.2 MB
TIMESCALEDB	BATCH-SMALL	110.59	4521.20 pts/s	10.91 ms	36.46 ms	4.1%	6.1%	5450.6 MB	5458.9 MB
TIMESCALEDB	BATCH-LARGE	3104.23	161070.48 pts/s	308.38 ms	1036.76 ms	31.0%	40.0%	6042.1 MB	8177.7 MB
TIMESCALEDB	OUT-OF-ORDER	396.24	126186.45 pts/s	38.71 ms	263.93 ms	24.7%	36.4%	8160.1 MB	8178.7 MB
TIMESCALEDB	WRITE	396.41	126131.18 pts/s	38.72 ms	252.60 ms	26.0%	34.6%	8185.3 MB	8201.2 MB
TIMESCALEDB	READ	284.43	1891.76 pts/s	57.25 ms	1672.74 ms	77.4%	87.9%	8227.4 MB	8233.0 MB
TIMESCALEDB	LATEST-POINT	3.79	6588.95 pts/s	0.53 ms	1.56 ms	13.7%	34.7%	8184.6 MB	8192.0 MB
TIMESCALEDB	DOWNSAMPLE	4.54	71508.71 pts/s	0.67 ms	1.74 ms	13.9%	35.9%	8184.3 MB	8191.0 MB
TIMESCALEDB	RANGE-QUERY	4.09	305461.28 pts/s	0.59 ms	1.61 ms	13.5%	34.8%	8184.3 MB	8191.0 MB
TIMESCALEDB	VALUE-FILTER	891.75	482.07 pts/s	175.07 ms	1299.50 ms	76.6%	88.3%	8228.8 MB	8236.0 MB
IOTDB	BATCH-SMALL	6.26	79923.31 pts/s	0.51 ms	1.29 ms	4.9%	21.9%	12520.6 MB	12533.8 MB
IOTDB	BATCH-LARGE	25.17	19868484.80 pts/s	2.22 ms	10.28 ms	23.0%	48.5%	12573.5 MB	12595.2 MB
IOTDB	OUT-OF-ORDER	8.30	6021299.84 pts/s	0.67 ms	1.82 ms	8.4%	37.4%	12625.9 MB	12656.6 MB
IOTDB	WRITE	7.84	6378193.32 pts/s	0.65 ms	2.01 ms	6.5%	30.1%	12680.5 MB	12697.6 MB
IOTDB	READ	34.21	15215.53 pts/s	6.62 ms	78.49 ms	34.9%	45.6%	12699.8 MB	12707.8 MB
IOTDB	LATEST-POINT	4.98	5018.40 pts/s	0.75 ms	2.65 ms	13.6%	30.1%	12707.8 MB	12707.8 MB
IOTDB	DOWNSAMPLE	7.58	42888.90 pts/s	1.26 ms	4.06 ms	17.2%	35.0%	12707.8 MB	12707.8 MB
IOTDB	RANGE-QUERY	8.53	146541.98 pts/s	1.46 ms	4.40 ms	17.6%	32.9%	12707.8 MB	12707.8 MB
IOTDB	VALUE-FILTER	98.46	4366.15 pts/s	19.17 ms	73.48 ms	38.5%	41.5%	12715.2 MB	12718.1 MB

Large scale

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	311.34	16059.65 pts/s	12.38 ms	40.09 ms	10.5%	16.3%	317.4 MB	393.6 MB
INFLUXDB	BATCH-LARGE	1335.88	3742857.63 pts/s	52.96 ms	374.60 ms	44.4%	74.1%	1607.4 MB	2573.3 MB
INFLUXDB	OUT-OF-ORDER	657.13	760888.17 pts/s	26.08 ms	254.05 ms	25.5%	51.1%	1012.2 MB	1494.0 MB
INFLUXDB	WRITE	424.39	1178162.42 pts/s	16.85 ms	170.92 ms	31.0%	44.8%	1073.5 MB	1395.7 MB
INFLUXDB	READ	847.93	1253.83 pts/s	168.61 ms	1808.87 ms	85.3%	91.3%	1034.4 MB	1517.6 MB
INFLUXDB	LATEST-POINT	174.01	287.34 pts/s	34.48 ms	50.98 ms	76.6%	86.2%	372.5 MB	389.5 MB
INFLUXDB	DOWNSAMPLE	50.96	12755.81 pts/s	9.91 ms	16.04 ms	69.8%	80.2%	356.1 MB	369.8 MB
INFLUXDB	RANGE-QUERY	47.75	51306.92 pts/s	9.26 ms	15.88 ms	67.2%	77.5%	353.8 MB	367.9 MB
INFLUXDB	VALUE-FILTER	2705.08	314.59 pts/s	537.48 ms	1859.91 ms	83.7%	88.8%	518.3 MB	577.1 MB
TIMESCALEDB	BATCH-SMALL	281.1	17787.12 pts/s	11.14 ms	40.78 ms	10.7%	14.1%	7061.2 MB	7065.6 MB
TIMESCALEDB	BATCH-LARGE	3600.0	421972.11 pts/s	473.67 ms	1639.20 ms	53.7%	77.8%	9428.0 MB	10291.2 MB
TIMESCALEDB	OUT-OF-ORDER	1811.92	275951.00 pts/s	71.10 ms	455.19 ms	36.5%	68.6%	9682.7 MB	12113.9 MB
TIMESCALEDB	WRITE	1393.68	358761.62 pts/s	54.36 ms	272.88 ms	47.4%	70.3%	9348.4 MB	9572.4 MB
TIMESCALEDB	READ	999.82	1083.29 pts/s	194.14 ms	4232.60 ms	97.7%	99.7%	9623.5 MB	9660.4 MB
TIMESCALEDB	LATEST-POINT	4.18	11964.72 pts/s	0.60 ms	1.44 ms	26.5%	68.7%	9455.1 MB	9468.9 MB
TIMESCALEDB	DOWNSAMPLE	4.85	133990.23 pts/s	0.74 ms	1.59 ms	28.5%	71.5%	9454.6 MB	9467.9 MB
TIMESCALEDB	RANGE-QUERY	4.49	556226.83 pts/s	0.67 ms	1.52 ms	27.4%	70.0%	9453.8 MB	9466.9 MB
TIMESCALEDB	VALUE-FILTER	3231.41	268.51 pts/s	629.15 ms	3945.32 ms	98.3%	100.0%	9630.1 MB	9658.4 MB
IOTDB	BATCH-SMALL	10.11	494743.40 pts/s	0.35 ms	0.50 ms	9.3%	41.6%	12165.1 MB	12185.6 MB
IOTDB	BATCH-LARGE	181.41	27562372.45 pts/s	6.93 ms	67.48 ms	47.0%	95.1%	15070.8 MB	15564.8 MB
IOTDB	OUT-OF-ORDER	30.63	16326483.43 pts/s	1.09 ms	2.04 ms	26.4%	64.2%	15517.3 MB	15564.8 MB
IOTDB	WRITE	28.55	17511476.59 pts/s	1.04 ms	1.48 ms	17.4%	56.4%	15556.7 MB	15575.0 MB
IOTDB	READ	36.92	28404.80 pts/s	7.07 ms	82.77 ms	70.2%	85.1%	15553.5 MB	15575.0 MB
IOTDB	LATEST-POINT	4.38	11427.87 pts/s	0.64 ms	1.35 ms	20.3%	56.5%	15751.7 MB	15820.8 MB
IOTDB	DOWNSAMPLE	9.94	65411.74 pts/s	1.74 ms	4.84 ms	47.3%	79.7%	15820.8 MB	15820.8 MB
IOTDB	RANGE-QUERY	7.47	334858.48 pts/s	1.25 ms	2.98 ms	38.4%	61.7%	15820.8 MB	15820.8 MB
IOTDB	VALUE-FILTER	113.56	7640.84 pts/s	22.29 ms	76.59 ms	76.6%	88.7%	15892.0 MB	15933.4 MB

Analysis

Write performance

IoTDB dominates write throughput at every batch size. The gap is explained by three compounding factors:

Protocol overhead. IoTDB's SESSION_BY_TABLET uses a binary Thrift RPC over TCP — one syscall submits a typed columnar tablet directly into the TsFile write buffer. InfluxDB requires an HTTP request with line-protocol text parsing. TimescaleDB goes through the full PostgreSQL stack: TCP, libpq, parser, planner, WAL write, MVCC row format.

Memory buffering. IoTDB pre-allocates a large JVM heap (~12 GB in these results) and buffers incoming tablets in memory before flushing to disk. It is essentially writing to RAM until the memtable fills. InfluxDB's TSM cache is capped at 1 GB by default. TimescaleDB writes synchronously through the WAL before acknowledging, offering stronger durability but lower raw throughput.

Batch size sensitivity. The batch-small vs batch-large comparison shows this clearly:

DB	BATCH-SMALL pts/s	BATCH-LARGE pts/s	Ratio
InfluxDB	5,169	2,193,905	424x
TimescaleDB	4,455	160,547	36x
IoTDB	24,268	13,897,643	573x

InfluxDB and IoTDB are extremely sensitive to batch size because of per-request overhead (HTTP round-trip and Thrift call respectively). TimescaleDB's lower ratio reflects PostgreSQL's WAL flush cost dominating even for small batches — the per-row overhead is already high, so larger batches help less.

Out-of-order writes. InfluxDB handles them nearly as fast as sequential writes (272K vs 439K pts/s). TimescaleDB is similar (132K vs 134K) since PostgreSQL's B-tree index updates are order-independent. IoTDB slows down slightly (1.9M vs 2.1M) because its LSM-tree must compact disordered entries into sorted TsFile chunks.

Read performance

The Throughput (pts/s) column for read tests sums the point counts of all active query types. This means read (10 types) appears higher than isolated single-type tests purely because more queries are running — it is not a sign that mixed workloads are faster. Latency is the meaningful metric for reads.

Value-filter queries are the clearest differentiator. Finding records that match a value predicate (sensor > threshold) cannot be answered from time-based indexes alone. Each database must either do a full chunk scan or maintain a secondary index:

DB	Value-filter avg lat	Value-filter P99 lat
IoTDB	3.79 ms	14.22 ms
TimescaleDB	40.12 ms	251.06 ms
InfluxDB	89.85 ms	341.34 ms

IoTDB stores min/max statistics per TsFile chunk and uses them to skip chunks that cannot contain matching values. InfluxDB has no equivalent — it scans all TSM blocks. TimescaleDB relies on PostgreSQL's BRIN index, which gives partial pruning but still scans substantially more data than IoTDB's statistics-based approach.

Downsampling (GROUP BY time buckets) is where TimescaleDB shows strength relative to InfluxDB despite its write disadvantage:

DB	Downsample avg lat
TimescaleDB	0.72 ms
IoTDB	1.07 ms
InfluxDB	6.85 ms

TimescaleDB's time_bucket() aggregation runs entirely inside the PostgreSQL executor with its hot shared_buffers cache. InfluxDB's Flux aggregation engine has higher per-query overhead.

The latest-point throughput illusion

The pts/s figure for latest-point appears low compared to range-query because the two queries return fundamentally different amounts of data per call: a range query returns hundreds of points per call (high pts/s), while latest-point returns exactly one point per call (pts/s ≈ queries/second). Latency is the correct metric here:

DB	Latest-point avg lat	Range-query avg lat
TimescaleDB	0.50 ms	0.64 ms
IoTDB	0.78 ms	1.35 ms
InfluxDB	10.85 ms	6.50 ms

TimescaleDB and IoTDB are actually faster for latest-point than for range queries — TimescaleDB via a SELECT ... ORDER BY time DESC LIMIT 1 reverse B-tree scan, and IoTDB via a dedicated in-memory last-value cache that bypasses TsFile entirely. InfluxDB has no equivalent shortcut and must seek to the tail of each series in its TSM blocks, making it the only database where latest-point is slower than a range scan.

Memory usage

IoTDB's dominant write throughput comes with a significant cost: it occupies ~12 GB of RAM throughout the run, compared to InfluxDB's ~300–900 MB and TimescaleDB's ~5.5 GB. IoTDB buffers all incoming tablets in a JVM heap before flushing columnar TsFile segments to disk. This is an architectural trade-off — high write throughput in exchange for high memory pressure. In memory-constrained IoT gateway deployments this would be a disqualifying factor.

TimescaleDB's 5.5 GB figure requires context. PostgreSQL's shared_buffers is set to the default 128 MB, yet docker stats reports ~5.4 GB from the very first test. The gap is the Linux OS page cache: PostgreSQL bypasses shared_buffers for large sequential scans and relies heavily on the kernel's file system cache, which the container's cgroup accounts toward its RSS. After the first few tests warm the file system cache, most of TimescaleDB's reported memory is OS-managed page cache rather than database-controlled buffers. This inflates the comparison: InfluxDB and IoTDB manage their own in-process caches (TSM cache and JVM heap respectively), so their docker stats numbers reflect real application memory more directly.

Summary

Workload	Best DB	Reason
High-rate sequential ingestion	IoTDB	Binary tablet protocol + memory buffering
Large-batch bulk loading	IoTDB	Vectorised columnar write path
Single-point streaming (BATCH=1)	IoTDB	Lower Thrift overhead vs HTTP/WAL
Out-of-order tolerance	InfluxDB	Higher absolute OOO pts/s at every scale
Dashboard downsampling	TimescaleDB	time_bucket() in hot shared_buffers
Bulk time-range export	TimescaleDB	Chunk pruning + warm OS page cache
Threshold alerting (value filter)	IoTDB	Chunk statistics for early pruning
Real-time last-value lookup	TimescaleDB	Sub-millisecond reverse index scan
Memory efficiency	InfluxDB	TSM cache capped at 1 GB by default

IoTDB is the clear winner for write-heavy workloads but requires substantial memory and is purpose-built for the IoT domain. TimescaleDB trades raw write speed for SQL compatibility, mature tooling, and surprisingly strong read performance once data is cached. InfluxDB occupies the middle ground: better write throughput than TimescaleDB, better memory efficiency than IoTDB, but weaker on complex read patterns.

Medium-scale validation

Running the same workload with 5× more loops (5,000 vs 1,000) confirms most of the small-scale analysis but also surfaces two false positives and several new findings.

What held up

IoTDB write dominance is structural, not a warm-up artifact. At medium scale the gap widens: IoTDB reaches 6.4 M pts/s vs InfluxDB's 363 K and TimescaleDB's 126 K. The throughput increases with dataset size (2.1 M → 6.4 M for sequential write), which is consistent with JVM JIT compilation saturating the Thrift write path after enough iterations. The small-scale numbers understate IoTDB's ceiling.

TimescaleDB downsample is genuinely cache-resident. Latency is 0.72 ms at small and 0.67 ms at medium — effectively constant. Because time_bucket() queries scan a fixed time window (not the full dataset), the query cost does not grow with total data volume once the relevant chunks are in shared_buffers. This is the strongest validation in the dataset.

IoTDB's value-filter chunk-statistics advantage is a constant factor, not asymptotic. All three databases scale roughly linearly with data size for value-filter:

DB	Small avg lat	Medium avg lat	Ratio	Expected (5×)
IoTDB	3.79 ms	19.17 ms	5.1×	5×
TimescaleDB	40.12 ms	175.07 ms	4.4×	5×
InfluxDB	89.85 ms	396.15 ms	4.4×	5×

IoTDB remains ~5–20× faster in absolute terms, but the gap does not grow with scale. The chunk-statistics skip is a pruning optimisation, not an index.

False positives corrected

"InfluxDB handles out-of-order writes nearly as fast as sequential" — this was imprecise. The actual drop is 38% at small scale (272 K vs 439 K pts/s) and 32% at medium (249 K vs 363 K pts/s). That is a consistent and significant overhead, not a negligible one. The correct statement is that InfluxDB tolerates out-of-order writes with moderate throughput cost, which is markedly better than IoTDB's TSM compaction path. Two separate metrics matter here and they point in different directions:

Absolute OOO throughput: InfluxDB leads at every scale (272 K vs 132 K pts/s at small), so it is the better choice when raw ingestion rate under disorder matters.
Relative OOO degradation: TimescaleDB is more tolerant — it loses less than 1% throughput from OOO at small and medium scales, versus InfluxDB's 32–38%. TimescaleDB's B-tree index updates are order-independent, so late-arriving rows incur no extra cost until the hypertable grows large enough to trigger chunk decompression (which only surfaces at large scale, where TimescaleDB's OOO penalty reaches 23%).

InfluxDB's memory footprint was understated. The small-scale description of "~300–900 MB" and the claim that the TSM cache is "capped at 1 GB by default" are misleading at scale. At medium scale, batch-large and out-of-order peak at 3.1 GB, and the summary table's "memory efficiency" win for InfluxDB only holds for the small write test. Total process RSS includes WAL buffers, the series index, and Go runtime overhead well beyond the TSM cache limit.

New findings from medium scale

InfluxDB latest-point latency scales with dataset size; TimescaleDB and IoTDB do not.

DB	Small avg lat	Medium avg lat
InfluxDB	10.85 ms	23.18 ms
IoTDB	0.78 ms	0.75 ms
TimescaleDB	0.50 ms	0.53 ms

TimescaleDB's reverse B-tree scan and IoTDB's in-memory last-value cache are O(1) with respect to total stored data. InfluxDB must seek to the end of each series' TSM file blocks, and that seek cost grows as more files accumulate. In any long-running deployment this gap will compound.

TimescaleDB read P99 latency degrades sharply at medium scale (275 ms → 1672 ms for the mixed READ workload). The average improves relative to InfluxDB (57 ms vs 121 ms), but the tail grows faster. Queries that fall outside the shared_buffers warm region trigger full chunk scans with no bounding index, producing unbounded tail latency as the dataset grows.

IoTDB reads scale sublinearly. The mixed READ average latency increases only 2.9× for 5× more data (2.26 ms → 6.62 ms). This is better than InfluxDB (4.3×) or TimescaleDB (4.4×), suggesting IoTDB's JVM heap cache covers a proportionally larger share of the working set as queries run longer.

Revised summary

The summary table from the small-scale analysis stands, with two corrections:

Out-of-order tolerance: InfluxDB delivers higher absolute OOO throughput at every scale. However, TimescaleDB has a smaller relative degradation from OOO (<1% at small/medium, 23% at large) compared to InfluxDB's consistent 32–38% drop. The summary table winner (InfluxDB) is correct for absolute pts/s but should not be read as "handles disorder more gracefully."
Memory efficiency: InfluxDB wins only at small scale; at medium scale its peak RSS (3.1 GB) is comparable to or worse than TimescaleDB's starting footprint.

Large-scale validation

The large scale (10 clients, 50 devices, 20 sensors, 5,000 loops) contains roughly 20× more data than medium and 100× more than small. At this volume several trends from the medium analysis sharpen into clearer patterns and two new failure modes emerge.

What continues to hold

IoTDB write dominance widens. The absolute gap grows at every scale:

DB	Small pts/s	Medium pts/s	Large pts/s
IoTDB	2,130,911	6,378,193	17,511,476
InfluxDB	439,381	363,669	1,178,162
TimescaleDB	133,677	126,131	358,761

IoTDB WRITE throughput scales superlinearly with both dataset size and client count, reflecting continued JIT saturation of the Thrift write path. The ratio over InfluxDB is consistent (~5–17× depending on scale); the ratio over TimescaleDB grows from 16× to 50× between small and large.

TimescaleDB downsample and latest-point remain constant-latency across all three scales.

DB	Downsample small	Downsample medium	Downsample large
TimescaleDB	0.72 ms	0.67 ms	0.74 ms
IoTDB	1.07 ms	1.26 ms	1.74 ms
InfluxDB	6.85 ms	7.17 ms	9.91 ms

DB	Latest-point small	Latest-point medium	Latest-point large
TimescaleDB	0.50 ms	0.53 ms	0.60 ms
IoTDB	0.78 ms	0.75 ms	0.64 ms
InfluxDB	10.85 ms	23.18 ms	34.48 ms

TimescaleDB downsample stays sub-millisecond across 100× data growth. The time_bucket() result is fully validated as cache-resident and query-window-bounded. InfluxDB latest-point shows a clean linear progression (10.85 → 23.18 → 34.48 ms), confirming the TSM tail-seek cost grows proportionally with stored files. This is now a three-point regression with no ambiguity.

IoTDB's value-filter advantage becomes sublinear, not just a constant factor. This is the biggest revision from the medium-scale analysis, which only showed a constant-factor difference:

DB	Small avg lat	Medium avg lat	Large avg lat	Small→Large ratio	Data ratio
IoTDB	3.79 ms	19.17 ms	22.29 ms	5.9×	100×
TimescaleDB	40.12 ms	175.07 ms	629.15 ms	15.7×	100×
InfluxDB	89.85 ms	396.15 ms	537.48 ms	6.0×	100×

IoTDB's value-filter latency grew only 5.9× for 100× more data. At medium scale it appeared linear; at large scale it breaks from linearity. IoTDB's chunk min/max statistics skip increasingly large fractions of the dataset as the data grows denser — the pruning ratio improves with scale. TimescaleDB grows 15.7×, worse than linear, consistent with BRIN index becoming less effective as more non-contiguous pages must be read. InfluxDB at 6.0× is surprisingly sublinear too, likely because the TSM block structure allows skipping entire files based on time-range metadata even for value predicates.

New findings from large scale

TimescaleDB out-of-order penalty surfaces at large scale. At small and medium it was effectively zero (OOO ≈ WRITE throughput). At large scale it degrades to 276 K vs 358 K pts/s — a 23% drop. As the hypertable grows, inserting out-of-order rows forces writes into older, already-compressed chunks, triggering decompression-recompression cycles that do not appear at smaller volumes. This reverses the earlier assessment that TimescaleDB is insensitive to write order.

TimescaleDB becomes CPU-saturated on reads at large scale. The mixed READ workload averages 97.7% CPU (peak 99.7%), and value-filter averages 98.3% (peak 100%). P99 latency for READ reaches 4,232 ms. This is not a storage bottleneck — it is the query executor running out of compute. At this point adding more RAM or faster disks would not help; only a stronger CPU or horizontal sharding would. InfluxDB reaches 85% CPU on read but does not saturate. IoTDB peaks at 70% for the same workload.

IoTDB batch-small throughput breaks the expected ceiling. Across scales: 24,268 → 79,923 → 494,743 pts/s, with average latency decreasing from 1.54 ms to 0.35 ms as load increases. (Note: the small-scale run also produces an anomalous P99 of 1.19 ms — lower than its own 1.54 ms average — which is statistically only possible if a small fraction of operations had very large outliers, likely JVM GC pauses, pulling the average above the 99th percentile on a short run. This artifact does not appear at medium or large scale and does not affect the throughput figures.) This is only possible if the JVM JIT compiler is progressively optimising the hot Thrift serialisation path during the run. At large scale, the benchmark runs long enough for the JIT to reach peak compilation, making the large-scale batch-small numbers the most representative of IoTDB's steady-state single-point ingestion throughput.

TimescaleDB batch-large hits the benchmark timeout at large scale (exactly 3,600 s). The P99 of 1,639 ms and average latency of 473 ms reflect a database under sustained WAL pressure: with 50 devices and 100-row batches, each write touches 50 hypertable partitions and triggers 50 WAL flushes, and the cumulative cost exhausts the time budget before the full loop count completes.

IoTDB memory footprint grows beyond its pre-allocated heap at large scale (~15.5 GB vs ~12.7 GB at medium). The JVM heap expanded to accommodate the larger write buffer and series index. The earlier claim that IoTDB memory is a fixed pre-allocation only holds up to medium scale; at large volumes the heap grows dynamically. It remains below 16 GB, still far below a hypothetical equal-memory-budget comparison with TimescaleDB (~9.7 GB) or InfluxDB (~1.5 GB at large WRITE), but the "fixed cost" framing was too strong.

Final summary across all scales

Claim from original analysis	Status after large-scale data
IoTDB dominates write throughput	Confirmed and strengthened — gap widens to 50× over TimescaleDB
TimescaleDB downsample is cache-resident and constant-latency	Confirmed — sub-ms across 100× data growth
TimescaleDB/IoTDB latest-point is O(1) w.r.t. dataset size	Confirmed — both stay flat; InfluxDB shows clean linear growth
IoTDB value-filter advantage is a constant factor	Partially wrong — at large scale it becomes sublinear (5.9× for 100× data)
TimescaleDB OOO cost is negligible	Wrong at large scale — 23% penalty emerges from chunk decompression
InfluxDB memory is ~300–900 MB	Wrong — peaks at 2.6 GB at large BATCH-LARGE
IoTDB memory is a fixed pre-allocation	Overstated — heap grows to ~15.5 GB at large scale
TimescaleDB P99 tail latency grows with data	Confirmed and severe — 275 ms → 1,672 ms → 4,232 ms across scales

Tuned configuration: small-scale re-run

Two configuration changes were applied before this re-run, addressing two of the limitations noted in the original analysis:

Sequential container lifecycle. Previously all three databases started simultaneously and ran in parallel throughout the entire session. Each database container now starts immediately before its own tests and stops as soon as they finish. This eliminates cross-database resource contention and gives each database exclusive access to the host's CPU, memory, and I/O during its workload.
Database-level tuning. TimescaleDB received a pgtune configuration for PostgreSQL 15 on this machine (shared_buffers=8GB, work_mem=27413kB, maintenance_work_mem=2GB, effective_io_concurrency=1000, and WAL/parallelism settings matched to 6 cores and 32 GB RAM). InfluxDB's TSM write-cache ceiling was raised to 8 GB via INFLUXD_STORAGE_CACHE_MAX_MEMORY_SIZE. A 28 GB mem_limit was added to every container.

Results

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	19.9	5024.41 pts/s	9.41 ms	14.21 ms	3.9%	6.9%	98.2 MB	169.1 MB
INFLUXDB	BATCH-LARGE	37.49	2667256.15 pts/s	18.04 ms	60.02 ms	21.6%	31.3%	316.6 MB	470.1 MB
INFLUXDB	OUT-OF-ORDER	39.12	255634.84 pts/s	18.92 ms	50.11 ms	8.1%	16.6%	228.1 MB	411.9 MB
INFLUXDB	WRITE	25.13	397894.94 pts/s	12.00 ms	190.50 ms	8.5%	15.8%	190.8 MB	239.0 MB
INFLUXDB	READ	24.29	4317.23 pts/s	23.27 ms	215.95 ms	41.3%	54.4%	254.2 MB	271.2 MB
INFLUXDB	LATEST-POINT	10.49	476.75 pts/s	9.24 ms	15.54 ms	33.2%	53.1%	255.4 MB	262.6 MB
INFLUXDB	DOWNSAMPLE	6.94	9370.00 pts/s	5.71 ms	11.54 ms	24.5%	48.6%	244.8 MB	245.8 MB
INFLUXDB	RANGE-QUERY	6.62	37002.24 pts/s	5.41 ms	11.49 ms	22.6%	48.5%	241.1 MB	243.0 MB
INFLUXDB	VALUE-FILTER	73.08	1181.67 pts/s	71.09 ms	243.26 ms	49.0%	55.7%	262.4 MB	284.9 MB
TIMESCALEDB	BATCH-SMALL	19.53	5120.34 pts/s	9.22 ms	14.66 ms	3.4%	5.1%	239.7 MB	244.5 MB
TIMESCALEDB	BATCH-LARGE	465.27	214927.88 pts/s	225.23 ms	812.28 ms	31.3%	37.8%	1072.5 MB	1813.5 MB
TIMESCALEDB	OUT-OF-ORDER	62.45	160121.49 pts/s	28.86 ms	226.84 ms	21.5%	30.8%	1816.1 MB	1820.7 MB
TIMESCALEDB	WRITE	57.21	174787.87 pts/s	27.70 ms	223.00 ms	22.5%	32.2%	1816.3 MB	1825.8 MB
TIMESCALEDB	READ	9.15	11679.48 pts/s	8.01 ms	158.62 ms	35.0%	56.4%	1817.4 MB	1826.8 MB
TIMESCALEDB	LATEST-POINT	1.48	3386.06 pts/s	0.38 ms	0.62 ms	6.1%	12.1%	1803.3 MB	1803.3 MB
TIMESCALEDB	DOWNSAMPLE	1.7	38347.26 pts/s	0.61 ms	0.98 ms	7.2%	21.5%	1803.6 MB	1804.3 MB
TIMESCALEDB	RANGE-QUERY	1.62	154115.31 pts/s	0.54 ms	0.86 ms	9.3%	18.6%	1803.8 MB	1804.3 MB
TIMESCALEDB	VALUE-FILTER	24.87	3541.10 pts/s	23.98 ms	157.82 ms	46.5%	56.7%	1821.6 MB	1825.8 MB
IOTDB	BATCH-SMALL	3.5	28593.60 pts/s	1.22 ms	1.35 ms	7.8%	40.2%	1680.8 MB	1858.6 MB
IOTDB	BATCH-LARGE	8.45	11827572.73 pts/s	3.51 ms	18.83 ms	18.3%	80.6%	2366.5 MB	2975.7 MB
IOTDB	OUT-OF-ORDER	5.34	1871351.91 pts/s	2.09 ms	3.07 ms	7.0%	33.8%	3050.6 MB	3120.1 MB
IOTDB	WRITE	4.62	2163809.95 pts/s	1.73 ms	1.18 ms	5.1%	19.7%	3165.9 MB	3205.1 MB
IOTDB	READ	4.82	21381.34 pts/s	3.65 ms	87.14 ms	27.5%	64.5%	6533.9 MB	8358.9 MB
IOTDB	LATEST-POINT	2.38	2104.25 pts/s	1.24 ms	2.53 ms	9.7%	28.0%	8565.8 MB	8630.3 MB
IOTDB	DOWNSAMPLE	2.94	22095.82 pts/s	1.81 ms	4.90 ms	16.1%	32.1%	9856.7 MB	10133.5 MB
IOTDB	RANGE-QUERY	2.3	108630.64 pts/s	1.19 ms	4.75 ms	8.8%	25.1%	10139.3 MB	10139.6 MB
IOTDB	VALUE-FILTER	4.82	18257.47 pts/s	3.72 ms	27.53 ms	22.1%	53.7%	10181.9 MB	10189.8 MB

Analysis

TimescaleDB: substantial write improvement from pgtune

The most significant performance gains are in TimescaleDB write throughput:

Test	Old pts/s	New pts/s	Throughput Δ	Old avg lat	New avg lat	Latency Δ
BATCH-SMALL	4,454	5,120	+15%	10.67 ms	9.22 ms	-14%
BATCH-LARGE	160,546	214,927	+34%	302.36 ms	225.23 ms	-25%
OUT-OF-ORDER	132,189	160,121	+21%	36.52 ms	28.86 ms	-21%
WRITE	133,677	174,787	+30%	36.05 ms	27.70 ms	-25%

The driver is shared_buffers=8GB, which gives PostgreSQL a substantially larger buffer pool and reduces WAL write-amplification from buffer evictions. work_mem=27413kB allows sorting and hashing to stay in memory rather than spilling to disk. checkpoint_completion_target=0.9 spreads checkpoint I/O more evenly across the checkpoint interval, reducing write stall spikes that inflate P99 latency. BATCH-LARGE P99 dropped from 1,069 ms to 812 ms — a 24% improvement driven almost entirely by fewer checkpoint stalls.

TimescaleDB read workloads also improved:

Test	Old avg lat	New avg lat	Δ
VALUE-FILTER	40.12 ms	23.98 ms	-40%
READ	12.96 ms	8.01 ms	-38%
LATEST-POINT	0.50 ms	0.38 ms	-24%
DOWNSAMPLE	0.72 ms	0.61 ms	-15%
RANGE-QUERY	0.64 ms	0.54 ms	-15%

VALUE-FILTER benefited most, consistent with work_mem eliminating disk spills for predicate-evaluation sorting that were not obvious at this data volume.

InfluxDB reads improved; BATCH-LARGE P99 dropped sharply

All InfluxDB read latencies improved with the isolated run:

Test	Old avg lat	New avg lat	Δ
VALUE-FILTER	89.85 ms	71.09 ms	-21%
READ	28.22 ms	23.27 ms	-17%
DOWNSAMPLE	6.85 ms	5.71 ms	-17%
RANGE-QUERY	6.50 ms	5.41 ms	-17%
LATEST-POINT	10.85 ms	9.24 ms	-15%

In the original setup, TimescaleDB occupied ~5.4 GB of OS page cache throughout the InfluxDB test session, reducing the memory available for InfluxDB's own TSM series index and OS file cache. Running in isolation removes that pressure.

BATCH-LARGE also improved notably in both throughput (2.19M → 2.67M pts/s, +21%) and P99 latency (205 ms → 60 ms, -71%). The P99 drop suggests InfluxDB's TSM compaction engine — which runs asynchronously alongside writes — was previously competing with TimescaleDB for I/O bandwidth.

Sequential write (WRITE) shows a slightly lower throughput (439K → 397K pts/s) and a higher P99 (39 ms → 190 ms). The average latency moved less than 11% (10.81 ms → 12.00 ms), and the overall wall time increased only from 22.76 s to 25.13 s. This is within run-to-run variance for a short benchmark and does not represent a configuration regression.

Memory measurements are now more meaningful

The most dramatic numerical change in the table is in the memory columns, and the reason is methodological rather than physical.

TimescaleDB previously reported ~5,443 MB at BATCH-SMALL — the very first test. This was not PostgreSQL's operational footprint; it was accumulated OS page cache from having run alongside IoTDB throughout the entire InfluxDB test session (roughly two hours of idle container time). In the new setup, TimescaleDB starts fresh and shows 239 MB at BATCH-SMALL, growing to ~1,816 MB after the write tests warm shared_buffers. The 1.8 GB figure is the honest representation of TimescaleDB's actual buffer-pool usage for this dataset size. The earlier 5.4 GB was inflated by kernel-managed page cache that had nothing to do with the test being measured.

IoTDB previously reported ~12,469 MB at BATCH-SMALL. IoTDB had been idle but running throughout all InfluxDB and TimescaleDB tests before its own session began. During that time the JVM pre-allocated its configured maximum heap in anticipation of load. In the new setup the JVM starts at 1,680 MB and grows organically to ~10 GB by the end of read tests, reflecting actual working-set demand across workload phases rather than speculative pre-allocation.

IoTDB reads show a slight regression

IoTDB read latencies are modestly higher in the tuned run:

Test	Old avg lat	New avg lat	Δ
READ	2.26 ms	3.65 ms	+62%
LATEST-POINT	0.78 ms	1.24 ms	+59%
DOWNSAMPLE	1.07 ms	1.81 ms	+69%
RANGE-QUERY	1.35 ms	1.19 ms	-12%

This is the inverse of the memory effect: in the original setup, IoTDB's JVM had been running for hours before its read tests began, giving the JIT compiler ample time to optimise the hot read paths at idle. In the tuned run the JVM is freshly started at the beginning of IoTDB's session, and the JIT is still compiling during the read tests. This is the same phenomenon identified in the large-scale analysis — IoTDB's batch-small throughput more than doubles from small to large scale as the JIT saturates the Thrift path. The read latencies from the original run were artificially optimistic due to unintentional JIT pre-warming from idle time. Write performance, which comes earlier in the test order and benefits from JIT warmup during the write phase itself, is unaffected.

Net assessment

pgtune delivers clear, consistent gains for TimescaleDB: 30–34% write throughput improvement and 15–40% read latency reduction. InfluxDB benefits from isolation rather than explicit tuning — read latencies drop 15–21% and BATCH-LARGE P99 drops 71% once resource contention is removed. The memory figures for TimescaleDB (5.4 GB → 1.8 GB) and IoTDB (12.5 GB → 1.7 GB start) are substantially more honest; the old numbers were artefacts of the shared-start methodology, not real database footprints at small scale.

Tuned configuration: medium-scale re-run

The same two configuration changes from the small-scale re-run (sequential container lifecycle + pgtune for TimescaleDB + 8 GB TSM cache ceiling for InfluxDB) were applied at medium scale (5,000 loops — 5× more data than small).

Results

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	102.54	4,876.11 pts/s	10.13 ms	30.46 ms	5.6%	7.3%	215.5 MB	234.4 MB
INFLUXDB	BATCH-LARGE	192.45	2,598,111.46 pts/s	18.92 ms	187.52 ms	24.4%	34.1%	978.8 MB	2185.2 MB
INFLUXDB	OUT-OF-ORDER	203.71	245,447.81 pts/s	20.15 ms	61.53 ms	10.2%	18.7%	533.6 MB	1939.5 MB
INFLUXDB	WRITE	127.23	392,975.20 pts/s	12.56 ms	39.75 ms	11.6%	15.9%	531.9 MB	796.9 MB
INFLUXDB	READ	508.22	1,039.24 pts/s	101.13 ms	1031.74 ms	55.1%	59.1%	897.8 MB	932.5 MB
INFLUXDB	LATEST-POINT	110.04	227.19 pts/s	21.72 ms	33.62 ms	55.6%	60.0%	569.8 MB	795.5 MB
INFLUXDB	DOWNSAMPLE	29.68	10,951.70 pts/s	5.65 ms	8.61 ms	41.3%	48.1%	311.2 MB	313.7 MB
INFLUXDB	RANGE-QUERY	27.89	43,923.27 pts/s	5.31 ms	8.31 ms	40.9%	47.2%	309.6 MB	311.8 MB
INFLUXDB	VALUE-FILTER	1575.87	267.55 pts/s	310.75 ms	1003.52 ms	49.3%	51.3%	359.3 MB	387.8 MB
TIMESCALEDB	BATCH-SMALL	103.34	4,838.20 pts/s	10.20 ms	36.40 ms	4.1%	5.6%	293.6 MB	300.1 MB
TIMESCALEDB	BATCH-LARGE	2330.60	214,537.29 pts/s	230.88 ms	1098.29 ms	31.0%	38.8%	4414.2 MB	8661.0 MB
TIMESCALEDB	OUT-OF-ORDER	295.28	169,331.43 pts/s	28.96 ms	232.60 ms	24.2%	33.3%	8614.5 MB	8622.1 MB
TIMESCALEDB	WRITE	284.60	175,685.00 pts/s	27.93 ms	215.49 ms	24.8%	32.3%	8615.0 MB	8623.1 MB
TIMESCALEDB	READ	199.79	2,693.17 pts/s	39.30 ms	922.11 ms	54.0%	57.8%	8626.8 MB	8630.3 MB
TIMESCALEDB	LATEST-POINT	3.18	7,859.49 pts/s	0.41 ms	0.61 ms	12.4%	23.3%	8603.3 MB	8612.9 MB
TIMESCALEDB	DOWNSAMPLE	3.74	86,933.99 pts/s	0.52 ms	0.83 ms	14.4%	33.3%	8601.9 MB	8611.8 MB
TIMESCALEDB	RANGE-QUERY	3.54	353,181.70 pts/s	0.48 ms	0.74 ms	16.9%	26.7%	8603.0 MB	8610.8 MB
TIMESCALEDB	VALUE-FILTER	666.05	645.43 pts/s	130.08 ms	893.33 ms	55.8%	58.0%	8627.8 MB	8630.3 MB
IOTDB	BATCH-SMALL	6.16	81,112.44 pts/s	0.50 ms	1.11 ms	8.6%	52.6%	1922.9 MB	2219.0 MB
IOTDB	BATCH-LARGE	18.00	27,772,266.64 pts/s	1.56 ms	15.43 ms	28.4%	77.7%	3997.6 MB	6601.7 MB
IOTDB	OUT-OF-ORDER	7.45	6,709,314.58 pts/s	0.60 ms	1.42 ms	10.5%	45.5%	7860.4 MB	9943.0 MB
IOTDB	WRITE	6.26	7,991,224.90 pts/s	0.50 ms	0.87 ms	8.8%	33.7%	9999.2 MB	10076.2 MB
IOTDB	READ	22.05	23,605.75 pts/s	4.19 ms	51.22 ms	46.6%	66.8%	10145.6 MB	10161.2 MB
IOTDB	LATEST-POINT	3.74	6,686.53 pts/s	0.51 ms	1.40 ms	18.0%	43.5%	10255.6 MB	10281.0 MB
IOTDB	DOWNSAMPLE	4.36	74,576.96 pts/s	0.64 ms	1.25 ms	16.7%	44.5%	10286.1 MB	10291.2 MB
IOTDB	RANGE-QUERY	5.55	225,247.09 pts/s	0.88 ms	1.69 ms	26.0%	44.1%	10306.6 MB	10311.7 MB
IOTDB	VALUE-FILTER	51.15	8,403.91 pts/s	9.80 ms	40.50 ms	38.7%	52.4%	10353.4 MB	10373.1 MB

Analysis

TimescaleDB: write and read gains mirror small-scale pattern

Test	Old pts/s	New pts/s	Throughput Δ	Old avg lat	New avg lat	Avg lat Δ	Old P99	New P99	P99 Δ
BATCH-SMALL	4,521	4,838	+7%	10.91 ms	10.20 ms	−7%	36.46 ms	36.40 ms	~0%
BATCH-LARGE	161,070	214,537	+33%	308.38 ms	230.88 ms	−25%	1036.76 ms	1098.29 ms	+6%
OUT-OF-ORDER	126,186	169,331	+34%	38.71 ms	28.96 ms	−25%	263.93 ms	232.60 ms	−12%
WRITE	126,131	175,685	+39%	38.72 ms	27.93 ms	−28%	252.60 ms	215.49 ms	−15%

Test	Old avg lat	New avg lat	Δ	Old P99	New P99	P99 Δ
READ	57.25 ms	39.30 ms	−31%	1672.74 ms	922.11 ms	−45%
VALUE-FILTER	175.07 ms	130.08 ms	−26%	1299.50 ms	893.33 ms	−31%
LATEST-POINT	0.53 ms	0.41 ms	−23%	1.56 ms	0.61 ms	−61%
DOWNSAMPLE	0.67 ms	0.52 ms	−22%	1.74 ms	0.83 ms	−52%
RANGE-QUERY	0.59 ms	0.48 ms	−19%	1.61 ms	0.74 ms	−54%

Write throughput improves 30–40% and average latency 20–30%, consistent with the small-scale pgtune gains. The 45% drop in READ P99 (1673 ms → 922 ms) shows that a significant fraction of the tail-latency inflation reported in the medium-scale validation was driven by checkpoint stalls and resource contention rather than purely by dataset size. The sub-millisecond point queries (LATEST-POINT, DOWNSAMPLE, RANGE-QUERY) see 50–60% P99 improvements — these short queries were most affected by OS scheduling jitter from competing containers in the original setup.

One anomaly: BATCH-LARGE P99 increases from 1037 ms to 1098 ms despite average latency improving by 25% and throughput by 33%. Higher throughput means more batches are queued per interval, so the absolute worst-case tail grows even as the median improves.

InfluxDB: P99 drops far more than throughput

Test	Old pts/s	New pts/s	Throughput Δ	Old avg lat	New avg lat	Avg lat Δ	Old P99	New P99	P99 Δ
WRITE	363,669	392,975	+8%	13.59 ms	12.56 ms	−8%	210.95 ms	39.75 ms	−81%
BATCH-LARGE	2,079,443	2,598,111	+25%	23.81 ms	18.92 ms	−21%	238.84 ms	187.52 ms	−21%
OUT-OF-ORDER	249,071	245,447	−1%	19.88 ms	20.15 ms	+1%	102.65 ms	61.53 ms	−40%

The WRITE P99 improvement is the most striking figure in the medium rerun: 210 ms → 39 ms (−81%) with only an 8% throughput gain. In the original, TimescaleDB was accumulating ~8 GB of OS page cache while running alongside InfluxDB, causing periodic I/O stalls in InfluxDB's TSM compaction path. Those stalls manifest as tail latency spikes that do not affect average throughput but inflate P99 dramatically. The rerun removes this pressure.

OUT-OF-ORDER throughput is essentially unchanged (−1%) while its P99 improves 40%. This makes sense: the OOO ingestion rate is limited by InfluxDB's write-ordering logic, not by I/O contention, so throughput is similar; but the peak stall events that drove P99 to 102 ms are gone.

All InfluxDB read tests improve 15–27% in average latency, consistent with better TSM file cache utilisation without competing page-cache pressure.

IoTDB: largest gains of any database, driven by exclusive resource access and JIT

IoTDB shows the biggest improvements at medium scale — larger in relative terms than at small scale:

Test	Old pts/s	New pts/s	Throughput Δ	Old avg lat	New avg lat	Avg lat Δ
VALUE-FILTER	4,366	8,403	+92%	19.17 ms	9.80 ms	−49%
DOWNSAMPLE	42,888	74,576	+74%	1.26 ms	0.64 ms	−49%
RANGE-QUERY	146,541	225,247	+54%	1.46 ms	0.88 ms	−40%
READ	15,215	23,605	+55%	6.62 ms	4.19 ms	−37%
LATEST-POINT	5,018	6,686	+33%	0.75 ms	0.51 ms	−32%
BATCH-LARGE	19,868,484	27,772,266	+40%	2.22 ms	1.56 ms	−30%
WRITE	6,378,193	7,991,224	+25%	0.65 ms	0.50 ms	−23%

The 5× longer benchmark compared to small scale allows the JVM JIT compiler to saturate the hot read paths during the write phase itself. By the time read tests begin, the JIT is fully optimised — the opposite of the small-scale re-run, where the JVM was freshly started and the write phase was too short for the JIT to finish compiling read paths. This explains why IoTDB reads now improve (instead of regressing as they did in the small-scale re-run) and why all gains are larger than at small scale.

The write improvements (25–40%) are explained by exclusive CPU and I/O access: in the original medium run, IoTDB's JVM competed with TimescaleDB's ~8 GB page cache and active write I/O.

Memory measurements

TimescaleDB starts at 293 MB in the rerun (vs 5,450 MB in the original), for the same reason as at small scale: a fresh container sees no inherited page cache. It grows to ~8,602 MB once the dataset is loaded — the honest shared_buffers=8GB footprint, growing to ~8,628 MB at read tests.

IoTDB starts at 1,922 MB and grows organically to ~10,353 MB, vs the 12,520–12,718 MB range held flat throughout the original medium run (where the JVM had pre-allocated maximum heap during idle time). The growth curve now reflects actual working-set demand at each phase.

InfluxDB BATCH-LARGE peak drops from 3,152 MB to 2,185 MB. Raising the TSM cache ceiling to 8 GB did not increase peak RSS; the isolation from competing page-cache consumers is the dominant factor reducing peak usage.

Validating medium-scale-validation claims

Claim	Rerun verdict
IoTDB write dominance widens at medium scale (6.4 M pts/s)	Confirmed and strengthened — rerun shows 7.99 M pts/s; gap over InfluxDB and TimescaleDB is wider
TimescaleDB downsample latency stays effectively constant (0.67 ms at medium)	Confirmed — rerun 0.52 ms, still sub-millisecond and lower than original
IoTDB/TS/InfluxDB value-filter scale roughly linearly with data volume	Partially wrong — IoTDB is clearly sublinear (2.6× for 5× data using rerun small→medium baseline), not linear; TimescaleDB is slightly superlinear (5.4×); only InfluxDB is close to linear (4.4×). IoTDB's chunk-statistics pruning advantage is already compounding at medium scale
TimescaleDB OOO degradation is < 1% at medium scale	Weakened — rerun shows 3.6% drop (175K vs 169K pts/s); still small but no longer negligible, suggesting the near-zero figure in the original was within run-to-run noise
InfluxDB OOO penalty is 32–38%	Confirmed — rerun shows 37.7% drop (393K vs 245K pts/s)
InfluxDB latest-point latency scales with dataset size	Confirmed — rerun small 9.24 ms → rerun medium 21.72 ms (2.4× increase for 5× data)
IoTDB latest-point and TimescaleDB latest-point are O(1) w.r.t. dataset size	Confirmed — IoTDB: 1.24 ms → 0.51 ms (improving due to JIT); TimescaleDB: 0.38 ms → 0.41 ms (flat)
TimescaleDB READ P99 degrades sharply at medium scale (275 ms → 1,672 ms)	Real but overstated — rerun shows 158 ms → 922 ms; degradation exists (5.8×) but the original 1,672 ms included substantial contention inflation
IoTDB reads scale sublinearly (2.9× for 5× data)	Confirmed and strengthened — rerun shows 1.15× (3.65 ms → 4.19 ms) due to JIT saturation arriving before read tests begin at medium scale

Tuned configuration: large-scale re-run

The same tuned configuration at large scale (10 clients, 50 devices, 20 sensors, 5,000 loops — ~20× more data than medium, ~100× more than small).

Results

DB	Test	Time (s)	Throughput	Avg Lat	P99 Lat	Avg CPU	Peak CPU	Avg Mem	Peak Mem
INFLUXDB	BATCH-SMALL	289.46	17,273.50 pts/s	11.50 ms	36.89 ms	11.6%	16.8%	276.2 MB	355.7 MB
INFLUXDB	BATCH-LARGE	1088.32	4,594,226.34 pts/s	43.12 ms	252.72 ms	54.0%	76.4%	1693.9 MB	2903.0 MB
INFLUXDB	OUT-OF-ORDER	618.92	807,863.51 pts/s	24.54 ms	71.77 ms	28.0%	54.4%	1004.3 MB	2051.1 MB
INFLUXDB	WRITE	397.90	1,256,602.92 pts/s	15.78 ms	51.97 ms	32.9%	46.3%	937.9 MB	1213.4 MB
INFLUXDB	READ	811.43	1,310.24 pts/s	161.44 ms	1673.10 ms	86.3%	91.1%	960.7 MB	1208.3 MB
INFLUXDB	LATEST-POINT	117.70	424.79 pts/s	23.24 ms	29.11 ms	81.4%	86.3%	720.9 MB	728.2 MB
INFLUXDB	DOWNSAMPLE	47.49	13,685.81 pts/s	9.22 ms	14.24 ms	71.2%	80.6%	701.5 MB	707.8 MB
INFLUXDB	RANGE-QUERY	44.79	54,695.46 pts/s	8.68 ms	13.77 ms	67.5%	77.5%	700.4 MB	705.9 MB
INFLUXDB	VALUE-FILTER	2502.30	340.07 pts/s	496.81 ms	1617.36 ms	86.4%	89.3%	808.7 MB	862.6 MB
TIMESCALEDB	BATCH-SMALL	285.26	17,528.12 pts/s	11.30 ms	71.91 ms	10.3%	13.6%	331.0 MB	356.4 MB
TIMESCALEDB	BATCH-LARGE	3600.00	460,133.19 pts/s	434.41 ms	1894.39 ms	59.9%	78.2%	8321.9 MB	9275.4 MB
TIMESCALEDB	OUT-OF-ORDER	1283.68	389,505.22 pts/s	50.03 ms	336.15 ms	51.2%	71.9%	9066.4 MB	9241.6 MB
TIMESCALEDB	WRITE	1248.58	400,455.05 pts/s	48.72 ms	334.91 ms	51.4%	70.4%	9631.2 MB	10711.0 MB
TIMESCALEDB	READ	809.01	1,338.80 pts/s	155.66 ms	1850.06 ms	77.4%	83.6%	10690.6 MB	10700.8 MB
TIMESCALEDB	LATEST-POINT	4.27	11,698.79 pts/s	0.62 ms	1.48 ms	26.5%	68.5%	10572.8 MB	10588.2 MB
TIMESCALEDB	DOWNSAMPLE	4.81	135,174.34 pts/s	0.73 ms	1.60 ms	28.0%	71.1%	10572.8 MB	10588.2 MB
TIMESCALEDB	RANGE-QUERY	4.47	559,191.98 pts/s	0.66 ms	1.54 ms	26.8%	69.4%	10572.8 MB	10588.2 MB
TIMESCALEDB	VALUE-FILTER	2702.43	321.06 pts/s	510.05 ms	1804.40 ms	77.4%	83.5%	10690.6 MB	10700.8 MB
IOTDB	BATCH-SMALL	11.19	446,784.33 pts/s	0.40 ms	1.12 ms	18.0%	61.5%	2181.9 MB	2751.5 MB
IOTDB	BATCH-LARGE	130.42	38,336,216.39 pts/s	4.93 ms	27.34 ms	54.4%	82.7%	12376.4 MB	14673.9 MB
IOTDB	OUT-OF-ORDER	26.56	18,824,365.08 pts/s	0.94 ms	2.08 ms	32.5%	63.9%	14676.2 MB	14704.6 MB
IOTDB	WRITE	21.18	23,607,475.35 pts/s	0.76 ms	1.56 ms	25.2%	57.9%	14721.7 MB	14735.4 MB
IOTDB	READ	31.24	33,565.71 pts/s	5.98 ms	68.31 ms	70.7%	87.8%	14895.0 MB	14940.2 MB
IOTDB	LATEST-POINT	4.58	10,916.70 pts/s	0.68 ms	2.26 ms	21.5%	55.5%	15022.1 MB	15032.3 MB
IOTDB	DOWNSAMPLE	5.95	109,225.17 pts/s	0.95 ms	2.23 ms	28.8%	60.0%	15036.4 MB	15042.6 MB
IOTDB	RANGE-QUERY	7.23	345,619.38 pts/s	1.21 ms	2.61 ms	36.5%	63.3%	15032.3 MB	15032.3 MB
IOTDB	VALUE-FILTER	99.14	8,751.78 pts/s	19.42 ms	66.20 ms	76.1%	82.2%	15051.2 MB	15052.8 MB

Analysis

Critical corrections: TimescaleDB OOO and CPU saturation

These are the two largest reversals from the large-scale validation.

TimescaleDB out-of-order penalty at large scale was a contention artefact.

Run	WRITE pts/s	OOO pts/s	OOO penalty
Original large	358,761	275,951	23%
Rerun large	400,455	389,505	2.7%

The large-scale validation stated that "TimescaleDB out-of-order penalty surfaces at large scale" with a 23% drop attributed to decompression-recompression cycles as the hypertable grew. The rerun shows only a 2.7% drop — effectively the same negligible penalty seen at small and medium scale. The original 23% was caused by IoTDB's expanding heap (15+ GB) competing with TimescaleDB's OS page cache at the same time: out-of-order inserts into older chunks require reading those chunks from disk to decompress them, and when IoTDB had evicted those pages from the OS page cache, each OOO insert triggered a cold disk read. With exclusive access, the chunks remain in cache and decompression is cheap. The 23% claim should be struck.

TimescaleDB is not CPU-saturated at large scale.

Workload	Original avg CPU	Rerun avg CPU	Original P99	Rerun P99
READ	97.7%	77.4%	4232 ms	1850 ms
VALUE-FILTER	98.3%	77.4%	3945 ms	1804 ms

The large-scale validation concluded that TimescaleDB "runs out of compute" at this scale and that "adding more RAM or faster disks would not help — only a stronger CPU or horizontal sharding would." This was wrong. The 97.7% CPU average was the result of three databases and the Java benchmark client sharing a 6-core host simultaneously. With one database at a time, CPU stabilises at 77% and P99 latency falls by 56–54%. TimescaleDB was not at a hardware ceiling — it was starved of CPU cycles by competing processes.

InfluxDB: P99 improvements at large scale dwarf throughput gains

The pattern seen at medium scale is more pronounced at large scale. Throughput changes are modest (6–23%), but P99 latency drops are dramatic:

Test	Old pts/s	New pts/s	Δ	Old P99	New P99	P99 Δ
WRITE	1,178,162	1,256,602	+7%	170.92 ms	51.97 ms	−70%
OUT-OF-ORDER	760,888	807,863	+6%	254.05 ms	71.77 ms	−72%
BATCH-LARGE	3,742,857	4,594,226	+23%	374.60 ms	252.72 ms	−33%
LATEST-POINT	287	424	+48%	50.98 ms	29.11 ms	−43%

The 70–72% P99 drops on WRITE and OUT-OF-ORDER confirm that these spikes were I/O contention events: when IoTDB and TimescaleDB were writing aggressively to disk simultaneously, InfluxDB's TSM compaction engine was periodically starved of I/O bandwidth, producing rare but extreme latency outliers. LATEST-POINT improving +48% in throughput at large scale is a strong signal that TSM file seeks were being interrupted by disk I/O from other processes, not just the structural TSM block-scan cost that grows with stored files.

IoTDB: write surge, BATCH-SMALL slight regression

Test	Old pts/s	New pts/s	Δ	Old avg lat	New avg lat	Δ
BATCH-LARGE	27,562,372	38,336,216	+39%	6.93 ms	4.93 ms	−29%
WRITE	17,511,476	23,607,475	+35%	1.04 ms	0.76 ms	−27%
DOWNSAMPLE	65,411	109,225	+67%	1.74 ms	0.95 ms	−45%
OUT-OF-ORDER	16,326,483	18,824,365	+15%	1.09 ms	0.94 ms	−14%
READ	28,404	33,565	+18%	7.07 ms	5.98 ms	−15%
BATCH-SMALL	494,743	446,784	−10%	0.35 ms	0.40 ms	+14%

BATCH-SMALL is the only test that regresses (−10%). In the original large run, IoTDB's JVM had been active from the start of the entire multi-hour session, giving the JIT extensive warm-up time before any test ran. In the rerun the JVM starts fresh, and BATCH-SMALL is the first test — JIT is still compiling hot paths during it. The effect was less visible at medium scale because the 5× more data in medium means each test runs longer, providing some JIT warm-up during the test itself, but BATCH-SMALL still runs in only 6 s, which is insufficient. The cross-scale progression in the rerun is 28K → 81K → 446K pts/s, still showing clear JIT-driven superlinear scaling; the 494K figure from the original was pre-warmed.

All other IoTDB tests improve substantially. DOWNSAMPLE at +67% is the largest gain among analytics queries, consistent with the longer run giving the JIT more time to optimise the TsFile chunk aggregation path before this test is reached.

TimescaleDB: BATCH-LARGE still times out; point queries unchanged

BATCH-LARGE hits the 3,600 s wall again. Throughput is marginally higher (460K vs 421K pts/s) because the pgtune settings reduce WAL stall frequency, but the fundamental bottleneck — 50 hypertable partitions per write triggering simultaneous WAL flushes — is not addressable through configuration alone at this data volume.

Point queries (LATEST-POINT, DOWNSAMPLE, RANGE-QUERY) remain effectively identical between original and rerun (0.62 ms, 0.73 ms, 0.66 ms). These queries touch a fixed time window and their working set fits entirely in shared_buffers at all scales; they are genuinely insensitive to both dataset size and contention.

BATCH-SMALL P99 worsens slightly in the rerun (40.78 ms → 71.91 ms) despite stable throughput (17,787 → 17,528 pts/s). This is most likely run-to-run variance: the original large run had warmer OS state at BATCH-SMALL's start (being the first test in a long session), which smoothed checkpoint timing. The average latency (11.30 ms vs 11.14 ms) is effectively identical.

Validating large-scale-validation claims

Claim from large-scale validation	Rerun verdict
IoTDB write dominance widens at large scale — gap grows to 50× over TimescaleDB	Confirmed and strengthened — rerun: 23.6 M vs 400 K pts/s, ratio ~59×
TimescaleDB downsample is sub-millisecond across 100× data growth	Confirmed — 0.73 ms in rerun
TimescaleDB latest-point is O(1) w.r.t. dataset size	Confirmed — 0.62 ms in rerun
InfluxDB latest-point shows a clean linear progression (10.85 → 23.18 → 34.48 ms)	Partially wrong — rerun progression is 9.24 → 21.72 → 23.24 ms; the large-scale value is 32% lower than in the original, and the medium→large jump is 1.52 ms vs 11.3 ms. I/O contention inflated the original large figure; the growth trend is real but not linear
IoTDB value-filter advantage becomes sublinear at large scale (5.9× for 100× data)	Confirmed — rerun shows 19.42 ms at large vs 3.72 ms (small rerun) = 5.2× for ~100× data
TimescaleDB OOO penalty surfaces at large scale (23% drop)	Wrong — rerun shows 2.7% drop; the 23% was caused by IoTDB heap evicting TimescaleDB's page cache for OOO chunk reads
TimescaleDB becomes CPU-saturated on reads at large scale (97.7% avg CPU, P99 4,232 ms)	Wrong — rerun shows 77.4% CPU and 1,850 ms P99; CPU saturation was a resource-contention artefact
TimescaleDB batch-large hits the benchmark timeout at large scale	Confirmed — rerun also hits 3,600 s
IoTDB batch-small JIT saturation: 24K → 79K → 494K pts/s	Confirmed with correction — rerun progression is 28K → 81K → 446K pts/s; trend holds but 494K was inflated by unintentional JIT pre-warming in the original long session
IoTDB memory grows dynamically at large scale (~15.5 GB)	Confirmed — rerun reaches ~15,052 MB
InfluxDB memory is wrong at > 900 MB	Confirmed — rerun BATCH-LARGE peaks at 2,903 MB, slightly above the original 2,573 MB (the 8 GB TSM ceiling allows more caching when the host is not memory-contended)

Limitations

Cache warm-up: Read tests run sequentially on the same dataset. Later tests benefit from warmer OS page cache and DB in-memory structures. For production-quality isolation, each test should flush caches and restart containers between tests.
IoTDB memory buffering: IoTDB's write throughput is partly a consequence of deferred durability — data lives in RAM until the memtable is flushed. The numbers reflect in-memory write acknowledgement, not guaranteed persistence.
Single-node, single-machine: All three databases and the benchmark clients ran on the same host. Network latency (relevant for InfluxDB's HTTP API) is near zero, which flattens a real-world disadvantage.
JIT warm-up (IoTDB): IoTDB read performance improves over the duration of a session as the JVM JIT compiler optimises hot paths. Short runs understate steady-state read throughput; longer runs (medium/large scale) are more representative.

Test Environment

Host machine

Component	Details
CPU	Intel Core i5-11400F (6 cores / 12 threads, 2.60 GHz base, 4.40 GHz boost)
RAM	32 GB
Disk	500 GB (SSD NVME)
OS	NixOS (Linux 6.12)

Docker containers

All three containers run on the same host with a 28 GB mem_limit each. Containers start and stop sequentially — only one database is active at any given time during a benchmark session.

Container	Image
IoTDB	`apache/iotdb:1.3.0-standalone`
InfluxDB	`influxdb:2.7-alpine`
TimescaleDB	`timescale/timescaledb:latest-pg15` (PostgreSQL 15)

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
benchrunner		benchrunner
charts		charts
iot-benchmark @ ef9dcad		iot-benchmark @ ef9dcad
presentation		presentation
results		results
scripts		scripts
tcc1-latex		tcc1-latex
tcc2-latex		tcc2-latex
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
benchmark.py		benchmark.py
charts.py		charts.py
docker-compose.yml		docker-compose.yml
flake.lock		flake.lock
flake.nix		flake.nix

Folders and files

Latest commit

History

Repository files navigation

A Reproducible Benchmark of Apache IoTDB, InfluxDB, and TimescaleDB for IoT Workloads

Overview

Architecture

Tech Stack

Prerequisites

Setup

Nix

Non-Nix

Running

Options

Scale parameters

Generating Charts

Dependencies

Usage

--source values

Test Types

Write tests

Read tests

Results

Small scale

Medium scale

Large scale

Analysis

Write performance

Read performance

The latest-point throughput illusion

Memory usage

Summary

Medium-scale validation

What held up

False positives corrected

New findings from medium scale

Revised summary

Large-scale validation

What continues to hold

New findings from large scale

Final summary across all scales

Tuned configuration: small-scale re-run

Results

Analysis

TimescaleDB: substantial write improvement from pgtune

InfluxDB reads improved; BATCH-LARGE P99 dropped sharply

Memory measurements are now more meaningful

IoTDB reads show a slight regression

Net assessment

Tuned configuration: medium-scale re-run

Results

Analysis

TimescaleDB: write and read gains mirror small-scale pattern

InfluxDB: P99 drops far more than throughput

IoTDB: largest gains of any database, driven by exclusive resource access and JIT

Memory measurements

Validating medium-scale-validation claims

Tuned configuration: large-scale re-run

Results

Analysis

Critical corrections: TimescaleDB OOO and CPU saturation

InfluxDB: P99 improvements at large scale dwarf throughput gains

IoTDB: write surge, BATCH-SMALL slight regression

TimescaleDB: BATCH-LARGE still times out; point queries unchanged

Validating large-scale-validation claims

Limitations

Test Environment

Host machine

Docker containers

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

`--source` values

Packages