TY - GEN
T1 - Clove
T2 - 13th International Conference on Emerging Networking EXperiments and Technologies, CoNEXT 2017
AU - Katta, Naga
AU - Ghag, Aditi
AU - Hira, Mukesh
AU - Keslassy, Isaac
AU - Bergman, Aran
AU - Kim, Changhoon
AU - Rexford, Jennifer L.
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/11/28
Y1 - 2017/11/28
N2 - Most datacenters still use Equal Cost Multi-Path (ECMP), which performs congestion-oblivious hashing of fows over multiple paths, leading to an uneven distribution of traffic. Alternatives to ECMP come with deployment challenges, as they require either changing the tenant VM network stacks (e.g., MPTCP) or replacing all of the switches (e.g., CONGA). We argue that the hypervisor provides a unique point for implementing load-balancing algorithms that are easy to deploy, while still reacting quickly to congestion. We propose Clove, a scalable load-balancer that (i) runs entirely in the hypervisor, requiring no modifications to tenant VM networking stacks or physical switches, and (ii) works on any topology and adapts quickly to topology changes and traffic shifts. Clove relies on standard ECMP in physical switches, discovers paths using a novel traceroute mechanism, uses software-based fowlet-switching, and continuously learns congestion (or path utilization) state using standard switch features. It then manipulates packet-header fields in the hypervisor switch to direct traffic over less congested paths. Clove achieves 1.5 to 7 times smaller flow-completion times at 70% network load than other load-balancing algorithms that work with existing hardware. Clove also captures some 80% of the performance gain of best-of-breed hardware-based load-balancing algorithms like CONGA that require new equipment.
AB - Most datacenters still use Equal Cost Multi-Path (ECMP), which performs congestion-oblivious hashing of fows over multiple paths, leading to an uneven distribution of traffic. Alternatives to ECMP come with deployment challenges, as they require either changing the tenant VM network stacks (e.g., MPTCP) or replacing all of the switches (e.g., CONGA). We argue that the hypervisor provides a unique point for implementing load-balancing algorithms that are easy to deploy, while still reacting quickly to congestion. We propose Clove, a scalable load-balancer that (i) runs entirely in the hypervisor, requiring no modifications to tenant VM networking stacks or physical switches, and (ii) works on any topology and adapts quickly to topology changes and traffic shifts. Clove relies on standard ECMP in physical switches, discovers paths using a novel traceroute mechanism, uses software-based fowlet-switching, and continuously learns congestion (or path utilization) state using standard switch features. It then manipulates packet-header fields in the hypervisor switch to direct traffic over less congested paths. Clove achieves 1.5 to 7 times smaller flow-completion times at 70% network load than other load-balancing algorithms that work with existing hardware. Clove also captures some 80% of the performance gain of best-of-breed hardware-based load-balancing algorithms like CONGA that require new equipment.
UR - http://www.scopus.com/inward/record.url?scp=85040218099&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85040218099&partnerID=8YFLogxK
U2 - 10.1145/3143361.3143401
DO - 10.1145/3143361.3143401
M3 - Conference contribution
AN - SCOPUS:85040218099
T3 - CoNEXT 2017 - Proceedings of the 2017 13th International Conference on emerging Networking EXperiments and Technologies
SP - 323
EP - 335
BT - CoNEXT 2017 - Proceedings of the 2017 13th International Conference on emerging Networking EXperiments and Technologies
PB - Association for Computing Machinery, Inc
Y2 - 12 December 2017 through 15 December 2017
ER -