Ravana: Controller fault-tolerance in software-defined networking

Research output: Chapter in Book/Report/Conference proceedingConference contribution

110 Scopus citations

Abstract

Software-defined networking (SDN) offers greater flexibility than traditional distributed architectures, at the risk of the controller being a single point-of-failure. Unfortunately, existing fault-tolerance techniques, such as replicated state machine, are insufficient to ensure correct network behavior under controller failures. The challenge is that, in addition to the application state of the controllers, the switches maintain hard state that must be handled consistently. Thus, it is necessary to incorporate switch state into the system model to correctly offer a "logically centralized" controller. We introduce Ravana, a fault-tolerant SDN controller platform that processes the control messages transactionally and exactly once (at both the controllers and the switches). Ravana maintains these guarantees in the face of both controller and switch crashes. The key insight in Ravana is that replicated state machines can be extended with lightweight switch-side mechanisms to guarantee correctness, without involving the switches in an elaborate consensus protocol. Our prototype implementation of Ravana enables unmodified controller applications to execute in a fault-tolerant fashion. Experiments show that Ravana achieves high throughput with reasonable overhead, compared to a single controller, with a failover time under 100ms.

Original languageEnglish (US)
Title of host publicationSymposium on Software Defined Networking (SDN) Research, SOSR 2015
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450334518
DOIs
StatePublished - Jun 17 2015
Event1st ACM SIGCOMM Symposium on Software Defined Networking Research, SOSR 2015 - Santa Clara, United States
Duration: Jun 17 2015Jun 18 2015

Publication series

NameSymposium on Software Defined Networking (SDN) Research, SOSR 2015

Other

Other1st ACM SIGCOMM Symposium on Software Defined Networking Research, SOSR 2015
Country/TerritoryUnited States
CitySanta Clara
Period6/17/156/18/15

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Keywords

  • Fault-Tolerance
  • OpenFlow
  • Replicated State Machines
  • Software-Defined Networking

Fingerprint

Dive into the research topics of 'Ravana: Controller fault-tolerance in software-defined networking'. Together they form a unique fingerprint.

Cite this