TY - GEN
T1 - Tight Analyses of Ordered and Unordered Linear Probing
AU - Braverman, Mark
AU - Kuszmaul, William
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Linear-probing hash tables have been classically believed to support insertions in time Θ(x2), where 1-1x is the load factor of the hash table. Recent work by Bender, Kuszmaul, and Kuszmaul (FOCS'21), however, has added a new twist to this story: in some versions of linear probing, if the maximum load factor is at most 1-1x, then the amortized expected time per insertion will never exceed x polylog x (even in workloads that operate continuously at a load factor of 1-1x). Determining the exact asymptotic value for the amortized insertion time remains open. In this paper, we settle the amortized complexity with matching upper and lower bounds of Θ(xlog1.5x). Along the way, we also obtain tight bounds for the so-called path surplus problem, a problem in combinatorial geometry that has been shown to be closely related to linear probing. We also show how to extend Bender et al.'s bounds to say something not just about ordered linear probing (the version they study) but also about classical linear probing, in the form that is most widely implemented in practice.
AB - Linear-probing hash tables have been classically believed to support insertions in time Θ(x2), where 1-1x is the load factor of the hash table. Recent work by Bender, Kuszmaul, and Kuszmaul (FOCS'21), however, has added a new twist to this story: in some versions of linear probing, if the maximum load factor is at most 1-1x, then the amortized expected time per insertion will never exceed x polylog x (even in workloads that operate continuously at a load factor of 1-1x). Determining the exact asymptotic value for the amortized insertion time remains open. In this paper, we settle the amortized complexity with matching upper and lower bounds of Θ(xlog1.5x). Along the way, we also obtain tight bounds for the so-called path surplus problem, a problem in combinatorial geometry that has been shown to be closely related to linear probing. We also show how to extend Bender et al.'s bounds to say something not just about ordered linear probing (the version they study) but also about classical linear probing, in the form that is most widely implemented in practice.
KW - data structures
KW - hash tables
KW - linear probing
KW - randomized algorithms
KW - robin hood hashing
UR - https://www.scopus.com/pages/publications/85213071421
UR - https://www.scopus.com/pages/publications/85213071421#tab=citedBy
U2 - 10.1109/FOCS61266.2024.00046
DO - 10.1109/FOCS61266.2024.00046
M3 - Conference contribution
AN - SCOPUS:85213071421
T3 - Proceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS
SP - 606
EP - 635
BT - Proceedings - 2024 IEEE 65th Annual Symposium on Foundations of Computer Science, FOCS 2024
PB - IEEE Computer Society
T2 - 65th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2024
Y2 - 27 October 2024 through 30 October 2024
ER -