Repository logo
  • English
  • العربية
  • বাংলা
  • Català
  • Čeština
  • Deutsch
  • Ελληνικά
  • Español
  • Suomi
  • Français
  • Gàidhlig
  • हिंदी
  • Magyar
  • Italiano
  • Қазақ
  • Latviešu
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Српски
  • Svenska
  • Türkçe
  • Yкраї́нська
  • Tiếng Việt
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Scholalry Output
  3. Publications
  4. A framework for estimating stream expression cardinalities
 
  • Details

A framework for estimating stream expression cardinalities

Source
Leibniz International Proceedings in Informatics Lipics
ISSN
18688969
Date Issued
2016-03-01
Author(s)
Dasgupta, Anirban  
Lang, Kevin J.
Rhodes, Lee
Thaler, Justin
DOI
10.4230/LIPIcs.ICDT.2016.6
Volume
48
Abstract
Given m distributed data streams A1, . . . ,Am, we consider the problem of estimating the number of unique identifiers in streams defined by set expressions over A1, . . . ,Am. We identify a broad class of algorithms for solving this problem, and show that the estimators output by any algorithm in this class are perfectly unbiased and satisfy strong variance bounds. Our analysis unifies and generalizes a variety of earlier results in the literature. To demonstrate its generality, we describe several novel sampling algorithms in our class, and show that they achieve a novel tradeoff between accuracy, space usage, update speed, and applicability.
URI
https://d8.irins.org/handle/IITG2025/21945
Subjects
Data Stream Algorithms | Distinct Elements | Mergeability | Set Operations | Sketching
IITGN Knowledge Repository Developed and Managed by Library

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify