Fast and accurate genomic distances using HyperLogLog
-
Updated
Jan 19, 2023 - C++
Fast and accurate genomic distances using HyperLogLog
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Estimating k-mer coverage histogram of genomics data
SetSketch: Filling the Gap between MinHash and HyperLogLog
Integrates DuckDB with the high-performance Apache DataSketches library. This extension enables users to perform approximate analytics on large-scale datasets using state-of-the-art streaming algorithms, all from within DuckDB.
Real-time cardinality estimator built for the Future of Database Programming Contest. Implements histogram-based methods, HyperLogLog, MCV tracking, and optimized data structures for high-performance. Achieved first place in the competition
HyperLogLog++ Implementation with C++14
Network Anomaly Detection Using Probabilistic Data Structures
HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard
C++ program for estimating the entropy of incoming traffic.
Implementation of HyperLogLog algorithm to count number of unique elements in data stream.
Add a description, image, and links to the hyperloglog topic page so that developers can more easily learn about it.
To associate your repository with the hyperloglog topic, visit your repo's landing page and select "manage topics."