TY - GEN
T1 - Faster update time for turnstile streaming algorithms
AU - Alman, Josh
AU - Yu, Huacheng
N1 - Publisher Copyright:
Copyright © 2020 by SIAM
PY - 2020
Y1 - 2020
N2 - In this paper, we present a new algorithm for maintaining linear sketches in turnstile streams with faster update time. As an application, we show that log n Count sketches or CountMin sketches with a constant number of columns (i.e., buckets) can be implicitly maintained in worst-case O(log0.582 n) update time using O(log n) words of space, on a standard word RAM with word-size w = Θ(log n). The exponent 0.582 ≈ 2ω/3 − 1, where ω is the current matrix multiplication exponent. Due to the numerous applications of linear sketches, our algorithm improves the update time for many streaming problems in turnstile streams, in the high success probability setting, without using more space, including `2 norm estimation, `2 heavy hitters, point query with `1 or `2 error, etc. Our algorithm generalizes, with the same update time and space, to maintaining log n linear sketches, where each sketch 1. partitions the coordinates into k < logo(1) n buckets using a c-wise independent hash function for constant c, 2. maintains the sum of coordinates for each bucket. Moreover, if arbitrary word operations are allowed, the update time can be further improved to O(log0.187 n), where 0.187 ≈ ω/2 − 1. Our update algorithm is adaptive, and it circumvents the non-adaptive cell-probe lower bounds for turnstile streaming algorithms by Larsen, Nelson and Nguyên (STOC'15). On the other hand, our result also shows that proving unconditional cell-probe lower bound for the update time seems very difficult, even if the space is restricted to be (nearly) the optimum. If ω = 2, the cell-probe update time of our algorithm would be logo(1) n. Hence, proving any higher lower bound would imply ω > 2.
AB - In this paper, we present a new algorithm for maintaining linear sketches in turnstile streams with faster update time. As an application, we show that log n Count sketches or CountMin sketches with a constant number of columns (i.e., buckets) can be implicitly maintained in worst-case O(log0.582 n) update time using O(log n) words of space, on a standard word RAM with word-size w = Θ(log n). The exponent 0.582 ≈ 2ω/3 − 1, where ω is the current matrix multiplication exponent. Due to the numerous applications of linear sketches, our algorithm improves the update time for many streaming problems in turnstile streams, in the high success probability setting, without using more space, including `2 norm estimation, `2 heavy hitters, point query with `1 or `2 error, etc. Our algorithm generalizes, with the same update time and space, to maintaining log n linear sketches, where each sketch 1. partitions the coordinates into k < logo(1) n buckets using a c-wise independent hash function for constant c, 2. maintains the sum of coordinates for each bucket. Moreover, if arbitrary word operations are allowed, the update time can be further improved to O(log0.187 n), where 0.187 ≈ ω/2 − 1. Our update algorithm is adaptive, and it circumvents the non-adaptive cell-probe lower bounds for turnstile streaming algorithms by Larsen, Nelson and Nguyên (STOC'15). On the other hand, our result also shows that proving unconditional cell-probe lower bound for the update time seems very difficult, even if the space is restricted to be (nearly) the optimum. If ω = 2, the cell-probe update time of our algorithm would be logo(1) n. Hence, proving any higher lower bound would imply ω > 2.
UR - http://www.scopus.com/inward/record.url?scp=85084033855&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084033855&partnerID=8YFLogxK
U2 - 10.1137/1.9781611975994.110
DO - 10.1137/1.9781611975994.110
M3 - Conference contribution
AN - SCOPUS:85084033855
T3 - Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
SP - 1803
EP - 1813
BT - 31st Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2020
A2 - Chawla, Shuchi
PB - Association for Computing Machinery
T2 - 31st Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2020
Y2 - 5 January 2020 through 8 January 2020
ER -