TY - GEN
T1 - Identifying performance bottlenecks in CDNs through TCP-level monitoring
AU - Sun, Peng
AU - Yu, Minlan
AU - Freedman, Michael Joseph
AU - Rexford, Jennifer L.
PY - 2011
Y1 - 2011
N2 - Content distribution networks (CDNs) need to make decisions, such as server selection and routing, to improve performance for their clients. The performance may be limited by various factors such as packet loss in the network, a small receive buffer at the client, or constrained server CPU and disk resources. Conventional measurement techniques are not effective for distinguishing these performance problems: application-layer logs are too coarse-grained, while network-level traces are too expensive to collect all the time. We argue that passively monitoring the transport-level statistics in the server's network stack is a better approach. This paper presents a tool for monitoring and analyzing TCP statistics, and an analysis of a CoralCDN node in PlanetLab for six weeks. Our analysis shows that more than 10% of connections are server-limited at least 40% of the time, and many connections are limited by the congestion window despite no packet loss. Still, we see that clients in 377 Autonomous Systems (ASes) experience persistent packet loss. By separating network congestion from other performance problems, our analysis provides a much more accurate view of the performance of the network paths than what is possible with server logs alone.
AB - Content distribution networks (CDNs) need to make decisions, such as server selection and routing, to improve performance for their clients. The performance may be limited by various factors such as packet loss in the network, a small receive buffer at the client, or constrained server CPU and disk resources. Conventional measurement techniques are not effective for distinguishing these performance problems: application-layer logs are too coarse-grained, while network-level traces are too expensive to collect all the time. We argue that passively monitoring the transport-level statistics in the server's network stack is a better approach. This paper presents a tool for monitoring and analyzing TCP statistics, and an analysis of a CoralCDN node in PlanetLab for six weeks. Our analysis shows that more than 10% of connections are server-limited at least 40% of the time, and many connections are limited by the congestion window despite no packet loss. Still, we see that clients in 377 Autonomous Systems (ASes) experience persistent packet loss. By separating network congestion from other performance problems, our analysis provides a much more accurate view of the performance of the network paths than what is possible with server logs alone.
KW - content distribution network
KW - performance bottleneck
KW - tcp
UR - http://www.scopus.com/inward/record.url?scp=80053038839&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053038839&partnerID=8YFLogxK
U2 - 10.1145/2018602.2018615
DO - 10.1145/2018602.2018615
M3 - Conference contribution
AN - SCOPUS:80053038839
SN - 9781450308007
T3 - Proceedings of the 1st ACM SIGCOMM Workshop on Measurements Up the Stack, W-MUST'11
SP - 49
EP - 54
BT - Proceedings of the 1st ACM SIGCOMM Workshop on Measurements Up the Stack, W-MUST'11
T2 - 1st ACM SIGCOMM Workshop on Measurements Up the Stack, W-MUST'11, Co-located with SIGCOMM 2011
Y2 - 15 August 2011 through 19 August 2011
ER -