TY - GEN
T1 - Measurement methods for fast and accurate blackhole identification with binary tomography
AU - Cunha, Ítalo
AU - Teixeira, Renata
AU - Feamster, Nick
AU - Diot, Christophe
PY - 2009
Y1 - 2009
N2 - Binary tomography-the process of identifying faulty network links through coordinated end-to-end probes-is a promising method for detecting failures that the network does not automatically mask (e.g., network "blackholes"). Because tomography is sensitive to the quality of the input, however, näive end-to-end measurements can introduce inaccuracies. This paper develops two methods for generating inputs to binary tomography algorithms that improve their inference speed and accuracy. Failure confirmation is a perpath probing technique to distinguish packet losses caused by congestion from persistent link or node failures. Aggregation strategies combine path measurements from unsynchronized monitors into a set of consistent observations. When used in conjunction with existing binary tomography algorithms, our methods identify all failures that are longer than two measurement cycles, while inducing relatively few false alarms. In two wide-area networks, our techniques decrease the number of alarms by as much as two orders of magnitude. Compared to the state of the art in binary tomography, our techniques increase the identification rate and avoid hundreds of false alarms.
AB - Binary tomography-the process of identifying faulty network links through coordinated end-to-end probes-is a promising method for detecting failures that the network does not automatically mask (e.g., network "blackholes"). Because tomography is sensitive to the quality of the input, however, näive end-to-end measurements can introduce inaccuracies. This paper develops two methods for generating inputs to binary tomography algorithms that improve their inference speed and accuracy. Failure confirmation is a perpath probing technique to distinguish packet losses caused by congestion from persistent link or node failures. Aggregation strategies combine path measurements from unsynchronized monitors into a set of consistent observations. When used in conjunction with existing binary tomography algorithms, our methods identify all failures that are longer than two measurement cycles, while inducing relatively few false alarms. In two wide-area networks, our techniques decrease the number of alarms by as much as two orders of magnitude. Compared to the state of the art in binary tomography, our techniques increase the identification rate and avoid hundreds of false alarms.
KW - Diagnosis
KW - Network tomography
KW - Troubleshooting
UR - http://www.scopus.com/inward/record.url?scp=84877759417&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84877759417&partnerID=8YFLogxK
U2 - 10.1145/1644893.1644924
DO - 10.1145/1644893.1644924
M3 - Conference contribution
AN - SCOPUS:84877759417
SN - 9781605587707
T3 - Proceedings of the ACM SIGCOMM Internet Measurement Conference, IMC
SP - 254
EP - 266
BT - IMC 2009 - Proceedings of the 2009 ACM SIGCOMM Internet Measurement Conference
T2 - 2009 9th ACM SIGCOMM Internet Measurement Conference, IMC 2009
Y2 - 4 November 2009 through 6 November 2009
ER -