GitHub

###Purpose To compare throughput of various json implementations and to come up with the fastest way to process json log files.

See Makefile for details.

This benchmarks reading of a line-separated json file. To run do something like

make node JSON=path/to/my/json/dump.txt

pipejson contains 2 executables that multiplex stdin across multiple forked processes. Unfortunately I could not make this work > 400MB/s consistently.

The winning approach appears to be to shard data into multiple files and let Linux manage multiplexing IO.

lz4 appears to be the fastest compression algo. On my quadcore i5-3330S, I can decompress + process json at 450mb/s when using lz4 using the following command:

time make -j cxxp PIPE="lz4demo -d ~/work/data-telemetry/20130305-20130306.json.lz4 stdout"

###Generating Sample Data

Enable telemetry in your browser.
Close browser
Go to your profile dir/saved-telemetry-pings copy those files somewhere safe
Write a program to cat those files into a few gb of input data

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
gojson/src/jsonbench		gojson/src/jsonbench
pipejson		pipejson
JSON.java		JSON.java
Makefile		Makefile
README.md		README.md
bench.lua		bench.lua
json.cpp		json.cpp
json.h		json.h
jsonbench_sample.tar.gz		jsonbench_sample.tar.gz
loadjson.py		loadjson.py
node.js		node.js
rust.rs		rust.rs
spidermonkey.js		spidermonkey.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

tarasglek/jsonbench

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages