LC4J Benchmark Harness & Replication Repository

Abstract

Large Language Model (LLM) applications are increasingly embedded into production enterprise systems, most of which run on the Java Virtual Machine (JVM). Yet the dominant LLM orchestration frameworks—LangChain, LlamaIndex, and Haystack—are Python-native, forcing architects to choose between a polyglot architecture with cross-runtime overhead and the comparatively young JVM-native ecosystem represented by LangChain4j. No peer-reviewed, statistically rigorous comparison of these ecosystems exists. We present the first systematic head-to-head benchmark of LangChain4j against three Python-native frameworks across four enterprise workloads: single-turn completion, retrieval-augmented generation, three-step tool-using agents, and streaming chat. A deterministic mock-LLM stub isolates orchestration overhead from model inference latency, complemented by end-to-end measurements against a self-hosted Llama-3.1-8B-Instruct served by vLLM. We executed 1,440 independent runs on identical bare-metal hardware, using Mann–Whitney U tests with Bonferroni correction, Cliff’s δ effect sizes, and bootstrap confidence intervals. Under steady-state concurrent load, LangChain4j achieves 1.84× higher throughput and a 47.3% lower 99th-percentile orchestration latency than the best Python framework, but incurs a 2.6× resident-memory premium and a 13.7× cold-start penalty. We interpret these results with a queueing-theoretic model and derive a workload-to-framework decision rule. All artifacts are released for reproducibility.

Scripts

The scripts/ directory contains tools to regenerate the benchmark artifacts. The script generate_kaggle_dataset.py generates the ~18.6 GB Parquet dataset (containing detailed tracing and memory metrics) locally or natively on Kaggle Notebooks.

python scripts/generate_kaggle_dataset.py

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets/.aistudio		assets/.aistudio
harness		harness
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
index.html		index.html
metadata.json		metadata.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LC4J Benchmark Harness & Replication Repository

Abstract

Scripts

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LC4J Benchmark Harness & Replication Repository

Abstract

Scripts

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages