Skip to content

t1mm3/dgen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dgen

dgen is a flexible data generator for testing and benchmarking purpose. It aims at producing in the desired format quickly and, hence, can be used for generating large amounts of data.

Performance

This benchmark compares optimized single-threaded naive implementations with dgen. The naive implementations generate integers in the given domain and directly write a row to stdout. Note that the naive implementations are hard-coded for this benchmark and do not allow the generation of arbitrary CSVs. lut_printf replaces the integer to string conversion by a lookup table assuming the domain is small enough (64k).

Small benchmark

Observe that single-threaded dgen is a around 3.5x faster than the naive implementations whereas the parallel version of dgen using 4 threads is another 2x faster while still providing a deterministic output.

Further information

About

High-performance CSV data generator

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages