Skip to content
C++ C Jupyter Notebook CMake Python Makefile
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
cmake added license Sep 3, 2019
common
cpc fix CMakeLists.txt to capture renamed file Sep 23, 2019
fi
hll
kll Merge pull request #62 from apache/1.0.x-incubating Sep 4, 2019
python
theta
.gitignore
.gitmodules
CMakeLists.txt
DISCLAIMER-WIP
LICENSE
MANIFEST.in
Makefile
NOTICE
README.md
config.mk
pyproject.toml
setup.py

README.md

This is a C++ version of the DataSketches core library. See Apache DataSketches home

Apache DataSketches is an open source, high-performance library of stochastic streaming algorithms commonly called "sketches" in the data sciences. Sketches are small, stateful programs that process massive data as a stream and can provide approximate answers, with mathematical guarantees, to computationally difficult queries orders-of-magnitude faster than traditional, exact methods.

This code requires C++11. It was tested with GCC 4.8.5 (standard in RedHat at the time of this writing), GCC 8.2.0 and Apple LLVM version 10.0.1 (clang-1001.0.46.4)

This includes Python bindings. For the Python interface, see the README notes in the python subdirectory.

This library was intended to be header-only, but this goal was not fully achieved yet with CPC sketch code. This work is in progress.

Building and running unit tests requires CppUnit.

Installing CppUnit on OSX: brew install cppunit

Installing CppUnit on RHEL: yum install cppunit-devel

There are currently two ways of building: using existing make files and generating make files using cmake. Exsisting make files might not work on all platforms or with all C++ compilers. Generating make files using cmake should solve this problem, but it currently requires cmake version 3.12.0 or later that might not be readily available as a package on all platforms.

Installing the latest cmake on OSX: brew install cmake

Building and running unit tests using existing make files:

$ make
$ make test

Building and running unit tests using cmake:

$ mkdir build
$ cd build
$ cmake ..
$ make
$ make test
You can’t perform that action at this time.