Object storage on CRAQ: High-throughput chain replication for read-mostly workloads

Jeff Terrace, Michael J. Freedman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

29 Scopus citations

Abstract

Massive storage systems typically replicate and partition data over many potentially-faulty components to provide both reliability and scalability. Yet many commercially-deployed systems, especially those designed for interactive use by customers, sacrifice stronger consistency properties in the desire for greater availability and higher throughput. This paper describes the design, implementation, and evaluation of CRAQ, a distributed object-storage system that challenges this inflexible tradeoff. Our basic approach, an improvement on Chain Replication, maintains strong consistency while greatly improving read throughput. By distributing load across all object replicas, CRAQ scales linearly with chain size without increasing consistency coordination. At the same time, it exposes non-committed operations for weaker consistency guarantees when this suffices for some applications, which is especially useful under periods of high system churn. This paper explores additional design and implementation considerations for geo-replicated CRAQ storage across multiple datacenters to provide locality-optimized operations. We also discuss multi-object atomic updates and multicast optimizations for large-object updates.

Original languageEnglish (US)
Title of host publicationProceedings of the 2009 USENIX Annual Technical Conference
PublisherUSENIX Association
Pages143-158
Number of pages16
ISBN (Electronic)9781931971683
StatePublished - Jan 1 2019
Event2009 USENIX Annual Technical Conference - San Diego, United States
Duration: Jun 14 2009Jun 19 2009

Publication series

NameProceedings of the 2009 USENIX Annual Technical Conference

Conference

Conference2009 USENIX Annual Technical Conference
Country/TerritoryUnited States
CitySan Diego
Period6/14/096/19/09

All Science Journal Classification (ASJC) codes

  • General Computer Science

Fingerprint

Dive into the research topics of 'Object storage on CRAQ: High-throughput chain replication for read-mostly workloads'. Together they form a unique fingerprint.

Cite this