• The Challenge
  • The Major Sketch Families
  • Sketch Origins
  • Sketch Elements
  • Key Features
  • Large Scale Computing
  • Architecture
  • Overview Slide Deck 18Sep17
  • ATI Slide Deck 1Nov17
  • Research

  • Frequent Items Overview
  • Frequent Items Java Example
  • Frequent Items Pig UDFs
  • Frequent Items Hive UDFs
  • Frequent Items Error Table
  • Frequent Items References
  • HLL Sketch
  • HLL vs HLL++
  • HLL Sketch Java Example
  • HLL Sketch Pig UDFs
  • HLL Sketch Hive UDFs
  • HLL Map Sketch
  • Memory Package
  • Quantiles Overview
  • Quantiles Accuracy and Size
  • Quantiles Sketch Java Example
  • Quantiles Sketch Pig UDFs
  • Quantiles Sketch Hive UDFs
  • Optimal Quantile Approximation in Streams
  • Quantiles References
  • Reservoir Sampling
  • Reservoir Sampling Performance
  • Reservoir Sampling Java Example
  • Reservoir Sampling Pig UDFs
  • VarOpt Sampling
  • VarOpt Sampling Java Example
  • VarOpt Sampling Pig UDFs
  • Theta Sketch Framework
  • Theta Sketch Java Example
  • Theta Sketch Spark Example
  • The Inverse Estimate
  • Empty Sketch
  • First Estimator
  • Better Estimator
  • Rejection Rules
  • Update V(kth) Rule
  • Set Operations
  • Basic Accuracy
  • Accuracy Plots
  • Relative Error Table
  • SetOp Accuracy
  • Unions With Different k
  • Theta Sketch Size
  • Update Speed
  • Merge Speed
  • Theta Sketch Pig UDFs
  • Theta Sketch Hive UDFs
  • Integration with Druid
  • Memory Package
  • p-Sampling
  • Theta Sketch Framework (PDF)
  • Sketch Equations (PDF)
  • DataSketches (PDF)
  • Confidence Intervals Notes
  • Merging Algorithm Notes
  • Theta References
  • Tuple Sketch Overview
  • Tuple Sketch Java Example
  • Tuple Sketch Pig UDFs
  • Tuple Sketch Hive UDFs
  • Creating Command Line Executables
  • Who Uses
  • License