Catching the microburst culprits with snappy

Xiaoqi Chen, Shir Landau Feibish, Yaron Koral, Jennifer L. Rexford, Ori Rottenstreich

Research output: Chapter in Book/Report/Conference proceedingConference contribution

48 Scopus citations

Abstract

Short-lived traffic surges, known as microbursts, can cause periods of unexpectedly high packet delay and loss on a link. Today, preventing microbursts requires deploying switches with larger packet buffers (incurring higher cost) or running the network at low utilization (sacrificing efficiency). Instead, we argue that switches should detect microbursts as they form, and take corrective action before the situation gets worse. This requires an efficient way for switches to identify the particular flows responsible for a microburst, and handle them automatically (e.g., by pacing, marking, or rerouting the packets). However, collecting fine-grained statistics about queue occupancy in real time is challenging, even with emerging programmable data planes. We present Snappy, which identifies the flows responsible for a microburst in real time. Snappy maintains multiple snapshots of the occupants of the queue over time, where each snapshot is a compact data structure that makes efficient use of data-plane memory. As each new packet arrives, Snappy updates one snapshot and also estimates the fraction of the queue occupied by the associated flow. Our simulations with data-center packet traces show that Snappy can target the flows responsible for microbursts at the sub-millisecond level.

Original languageEnglish (US)
Title of host publicationSelfDN 2018 - Proceedings of the 2018 Afternoon Workshop on Self-Driving Networks, Part of SIGCOMM 2018
PublisherAssociation for Computing Machinery, Inc
Pages22-28
Number of pages7
ISBN (Print)9781450359146
DOIs
StatePublished - Aug 7 2018
Event2018 Afternoon Workshop on Self-Driving Networks, SelfDN 2018 - Budapest, Hungary
Duration: Aug 24 2018 → …

Publication series

NameSelfDN 2018 - Proceedings of the 2018 Afternoon Workshop on Self-Driving Networks, Part of SIGCOMM 2018

Other

Other2018 Afternoon Workshop on Self-Driving Networks, SelfDN 2018
Country/TerritoryHungary
CityBudapest
Period8/24/18 → …

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Catching the microburst culprits with snappy'. Together they form a unique fingerprint.

Cite this