Learning Multi-Dimensional Indexes

Vikram Nathan, Jialin Ding, Mohammad Alizadeh, Tim Kraska

Research output: Chapter in Book/Report/Conference proceedingConference contribution

179 Scopus citations

Abstract

Scanning and filtering over multi-dimensional tables are key operations in modern analytical database engines. To optimize the performance of these operations, databases often create clustered indexes over a single dimension or multi-dimensional indexes such as R-Trees, or use complex sort orders (e.g., Z-ordering). However, these schemes are often hard to tune and their performance is inconsistent across different datasets and queries. In this paper, we introduce Flood, a multi-dimensional in-memory read-optimized index that automatically adapts itself to a particular dataset and workload by jointly optimizing the index structure and data storage layout. Flood achieves up to three orders of magnitude faster performance for range scans with predicates than state-of-the-art multi-dimensional indexes or sort orders on real-world datasets and workloads. Our work serves as a building block towards an end-to-end learned database system.

Original languageEnglish (US)
Title of host publicationSIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
PublisherAssociation for Computing Machinery
Pages985-1000
Number of pages16
ISBN (Electronic)9781450367356
DOIs
StatePublished - Jun 14 2020
Externally publishedYes
Event2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 - Portland, United States
Duration: Jun 14 2020Jun 19 2020

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020
Country/TerritoryUnited States
CityPortland
Period6/14/206/19/20

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems

Keywords

  • databases
  • in-memory
  • indexing
  • multi-dimensional
  • primary index

Fingerprint

Dive into the research topics of 'Learning Multi-Dimensional Indexes'. Together they form a unique fingerprint.

Cite this