Blink: Fast connectivity recovery entirely in the data plane

Thomas Holterbach, Edgar Costa Molero, Maria Apostolaki, Alberto Dainotti, Stefano Vissicchio, Laurent Vanbever

Research output: Chapter in Book/Report/Conference proceedingConference contribution

87 Scopus citations

Abstract

We present Blink, a data-driven system that leverages TCPinduced signals to detect failures directly in the data plane. The key intuition behind Blink is that a TCP flow exhibits a predictable behavior upon disruption: retransmitting the same packet over and over, at epochs exponentially spaced in time. When compounded over multiple flows, this behavior creates a strong and characteristic failure signal. Blink efficiently analyzes TCP flows to: (i) select which ones to track; (ii) reliably and quickly detect major traffic disruptions; and (iii) recover connectivity-all this, completely in the data plane. We present an implementation of Blink in P4 together with an extensive evaluation on real and synthetic traffic traces. Our results indicate that Blink: (i) achieves sub-second rerouting for large fractions of Internet traffic; and (ii) prevents unnecessary traffic shifts even in the presence of noise. We further show the feasibility of Blink by running it on an actual Tofino switch.

Original languageEnglish (US)
Title of host publicationProceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019
PublisherUSENIX Association
Pages161-176
Number of pages16
ISBN (Electronic)9781931971492
StatePublished - 2019
Externally publishedYes
Event16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019 - Boston, United States
Duration: Feb 26 2019Feb 28 2019

Publication series

NameProceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019

Conference

Conference16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019
Country/TerritoryUnited States
CityBoston
Period2/26/192/28/19

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Blink: Fast connectivity recovery entirely in the data plane'. Together they form a unique fingerprint.

Cite this