Profiling network performance for multi-tier data center applications

Minlan Yu, Albert Greenberg, Dave Maltz, Jennifer Rexford, Lihua Yuan, Srikanth Kandula, Changhoon Kim

Research output: Contribution to conferencePaperpeer-review

87 Scopus citations

Abstract

Network performance problems are notoriously tricky to diagnose, and this is magnified when applications are often split into multiple tiers of application components spread across thousands of servers in a data center. Problems often arise in the communication between the tiers, where either the application or the network (or both!) could be to blame. In this paper, we present SNAP, a scalable network-application profiler that guides developers in identifying and fixing performance problems. SNAP passively collects TCP statistics and socket-call logs with low computation and storage overhead, and correlates across shared resources (e.g., host, link, switch) and connections to pinpoint the location of the problem (e.g., send buffer mismanagement, TCP/application conflicts, application-generated microbursts, or network congestion). Our one-week deployment of SNAP in a production data center (with over 8,000 servers and over 700 application components) has already helped developers uncover 15 major performance problems in application software, the network stack on the server, and the underlying network.

Original languageEnglish (US)
Pages57-70
Number of pages14
StatePublished - Jan 1 2011
Event8th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2011 - Boston, United States
Duration: Mar 30 2011Apr 1 2011

Conference

Conference8th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2011
Country/TerritoryUnited States
CityBoston
Period3/30/114/1/11

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Profiling network performance for multi-tier data center applications'. Together they form a unique fingerprint.

Cite this