Skip to content

Releases: apache/datasketches-hive

datasketches-hive-1.2.0

07 Mar 23:36
Compare
Choose a tag to compare

This is a maintenance release to make this Apache Hive component work with the latest versions of datasketches-java-3.1.0 and datasketches-memory-2.0.0

Apache Release 1.1.0-incubating

06 Jul 21:20
Compare
Choose a tag to compare
  • This release fixes critical bug
  • updates datasketches-java dependency to 1.3.0-incubating
  • minor licensing fixes
  • minor code cleanup.

Apache Release 1.0.0-incubating

11 Oct 20:39
Compare
Choose a tag to compare

This is the initial Apache release for this component.

  • The Java package structure has been changed to org.apache.datasketches
  • The file license headers have been updated with the Apache license header
  • The LICENSE, NOTICE, and DISCLAIMER-WIP files have been added and/or updated.

No other significant code changes from the prior version.

sketches-hive-0.13.0

21 Mar 18:27
Compare
Choose a tag to compare
  • Based on sketches-core-0.13.0
  • CPC sketch UDFs
  • KLL sketch UDFs
  • additional quantiles sketch UDFs: toString, getN, getCDF
  • additional HLL sketch UDFs: SketchToString, getEstimateAndErrorBounds

sketches-hive-0.11.0

10 Apr 00:08
Compare
Choose a tag to compare

Compatibility with sketches-core-0.11.0

sketches-hive-0.10.5: new core, HLL late init fix, char and varchar

09 Nov 23:29
Compare
Choose a tag to compare
  • based on sketches-core-0.10.3
  • support HLL sketch late init from Hive
  • support char and varchar types as HLL and Theta sketch input

sketches-hive-0.10.4: use sketches-core-0.10.2

23 Oct 21:33
Compare
Choose a tag to compare

This is a maintenance release to use the latest sketches-core-0.10.2

Sketches core 0.10.1, new Tuple sketch UDFs, performance improvement

15 Sep 20:11
Compare
Choose a tag to compare
  • This is based on sketches-core-0.10.1 and memory-0.10.3
  • New Tuple sketch UDFs: ArrayOfDoublesSketchesTTestUDF, ArrayOfDoublesSketchToMeansUDF, ArrayOfDoublesSketchToVariancesUDF, ArrayOfDoublesSketchToEstimateAndErrorBoundsUDF, ArrayOfDoublesSketchToNumberOfRetainedEntriesUDF, ArrayOfDoublesSketchToQuantilesSketchUDF
  • Performance improvement: wrap() is used instead of heapify() in HLL UDFs

HllSketch performance improvement for strings

30 Jun 22:45
Compare
Choose a tag to compare
  • HLL DataToSketchUDAF: Input strings are converted to char[] before passing to HllSketch. This is substantially faster than passing strings due to avoiding UTF-8 conversion process. Warning: effectively a different hash function is used for strings. So unions of sketches produced by this version and the previous version will have no overlap, and therefore produce incorrect results. We recommend upgrading to this version, and, if any sketches have been created with string inputs and stored, we recommend recomputing them from the raw data.

HLL sketch UDAFs and UDFs

26 Jun 23:20
Compare
Choose a tag to compare
  • added DataToSketchUDAF, UnionSketchUDAF, SketchToEstimateUDF and UnionSketchUDF for HLL sketch