Update sketches #2965
Labels
dependencies
Updates/changes to Maven or other dependencies
enhancement
Improvement to existing functionality/feature
Milestone
Describe the new feature you'd like
Building on from #2825, we should encourage not using the clearspring
HyperLogLogPlus
. The associated aggregator and serialisers should be deprecated, and we should encourage users to use datasketches'HllSketch
instead. This should be done in code (as tracked by this ticket), as well as in our docs, tracked by gchq/gaffer-doc#93.An additional complexity is that the datasketches code has moved from
com.yahoo.datasketches:sketches-core
toorg.apache.datasketches:datasketches-java
and had many version updates. We should figure out if the new module is compatible with the old module and our aggregator/serialisers. If so I suggest we just update the dependency, if not we should provide another set of classes to interact with the new apache datasketches and deprecate the yahoo ones.Why do you want this feature?
As described in gchq/gaffer-doc#93, datasketches'
HllSketch
is more performant. It is also actively maintained, whereas clearspring'sHyperLogLogPlus
is not.The text was updated successfully, but these errors were encountered: