Skip to content
Java Sketch Characterization Code.
Java
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
src
tools
.gitattributes
.gitignore
.travis.yml
DISCLAIMER-WIP
LICENSE
NOTICE
README.md
pom.xml

README.md

Characterization

We define characterization as the task of comprehensively measuring accuracy or speed performance of our library. These characterization tests are often long running (some can run for days) and very resource intensive, which makes them unsuitable for including in unit tests. The code in this repository are some of the test suites we use to create some of the plots on our website and provide evidence for our speed and accuracy claims. This code is shared here so that others can duplicate our own characterizations.

The code here is shared "as-is" and does not pretend to have the same level of quality as the primary repositories (jave, pig, hive and vector). This code is not archived to Maven Central and will change from time-to-time as we grow these characterization suites.

Documentation

DataSketches Library Website

Build Instructions

JDK8 is Required Compiler

This DataSketches component is pure Java and you must compile using JDK 8.

Recommended Build Tool

This DataSketches component is structured as a Maven project and Maven is the recommended Build Tool.

There are two types of tests: normal unit tests and tests run by the strict profile.

To run normal unit tests:

$ mvn clean test

To run the strict profile tests:

$ mvn clean test -P strict

Dependencies

Run-time

See the pom.xml for the top-level dependencies.

Testing

See the pom.xml file for test dependencies.

Resources

Issues for datasketches-checkstyle

Forum

Dev mailing list

You can’t perform that action at this time.