TY - GEN
T1 - ALEX
T2 - 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020
AU - Ding, Jialin
AU - Minhas, Umar Farooq
AU - Yu, Jia
AU - Wang, Chi
AU - Do, Jaeyoung
AU - Li, Yinan
AU - Zhang, Hantian
AU - Chandramouli, Badrish
AU - Gehrke, Johannes
AU - Kossmann, Donald
AU - Lomet, David
AU - Kraska, Tim
N1 - Publisher Copyright:
© 2020 Association for Computing Machinery.
PY - 2020/6/14
Y1 - 2020/6/14
N2 - Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be learned. The original work by Kraska et al. shows that a learned index beats a B+ tree by a factor of up to three in search time and by an order of magnitude in memory footprint. However, it is limited to static, read-only workloads. In this paper, we present a new learned index called ALEX which addresses practical issues that arise when implementing learned indexes for workloads that contain a mix of point lookups, short range queries, inserts, updates, and deletes. ALEX effectively combines the core insights from learned indexes with proven storage and indexing techniques to achieve high performance and low memory footprint. On read-only workloads, ALEX beats the learned index from Kraska et al. by up to 2.2X on performance with up to 15X smaller index size. Across the spectrum of read-write workloads, ALEX beats B+ trees by up to 4.1X while never performing worse, with up to 2000X smaller index size. We believe ALEX presents a key step towards making learned indexes practical for a broader class of database workloads with dynamic updates.
AB - Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be learned. The original work by Kraska et al. shows that a learned index beats a B+ tree by a factor of up to three in search time and by an order of magnitude in memory footprint. However, it is limited to static, read-only workloads. In this paper, we present a new learned index called ALEX which addresses practical issues that arise when implementing learned indexes for workloads that contain a mix of point lookups, short range queries, inserts, updates, and deletes. ALEX effectively combines the core insights from learned indexes with proven storage and indexing techniques to achieve high performance and low memory footprint. On read-only workloads, ALEX beats the learned index from Kraska et al. by up to 2.2X on performance with up to 15X smaller index size. Across the spectrum of read-write workloads, ALEX beats B+ trees by up to 4.1X while never performing worse, with up to 2000X smaller index size. We believe ALEX presents a key step towards making learned indexes practical for a broader class of database workloads with dynamic updates.
KW - access methods
KW - B+ tree
KW - learned data structures
KW - learned indexes
UR - https://www.scopus.com/pages/publications/85086254139
UR - https://www.scopus.com/inward/citedby.url?scp=85086254139&partnerID=8YFLogxK
U2 - 10.1145/3318464.3389711
DO - 10.1145/3318464.3389711
M3 - Conference contribution
AN - SCOPUS:85086254139
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 969
EP - 984
BT - SIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
PB - Association for Computing Machinery
Y2 - 14 June 2020 through 19 June 2020
ER -