TY - JOUR
T1 - Spectral redemption in clustering sparse networks
AU - Krzakala, Florent
AU - Moore, Cristopher
AU - Mossel, Elchanan
AU - Neeman, Joe
AU - Sly, Allan
AU - Zdeborová, Lenka
AU - Zhang, Pan
PY - 2013/12/24
Y1 - 2013/12/24
N2 - Spectral algorithms are classic approaches to clustering and community detection in networks. However, for sparse networks the standard versions of these algorithms are suboptimal, in some cases completely failing to detect communities evenwhen other algorithms such as belief propagation can do so. Here, we present a class of spectral algorithms based on a nonbacktracking walk on the directed edges of the graph. The spectrum of this operator is much betterbehaved than that of the adjacency matrix or other commonly used matrices,maintaining a strong separation between the bulk eigenvalues and the eigenvalues relevant to community structure even in the sparse case. We show that our algorithm is optimal for graphs generated by the stochastic block model, detecting communities all of the way down to the theoretical limit.We also show the spectrum of the nonbacktracking operator for some real-world networks, illustrating its advantages over traditional spectral clustering.
AB - Spectral algorithms are classic approaches to clustering and community detection in networks. However, for sparse networks the standard versions of these algorithms are suboptimal, in some cases completely failing to detect communities evenwhen other algorithms such as belief propagation can do so. Here, we present a class of spectral algorithms based on a nonbacktracking walk on the directed edges of the graph. The spectrum of this operator is much betterbehaved than that of the adjacency matrix or other commonly used matrices,maintaining a strong separation between the bulk eigenvalues and the eigenvalues relevant to community structure even in the sparse case. We show that our algorithm is optimal for graphs generated by the stochastic block model, detecting communities all of the way down to the theoretical limit.We also show the spectrum of the nonbacktracking operator for some real-world networks, illustrating its advantages over traditional spectral clustering.
UR - http://www.scopus.com/inward/record.url?scp=84891363833&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84891363833&partnerID=8YFLogxK
U2 - 10.1073/pnas.1312486110
DO - 10.1073/pnas.1312486110
M3 - Article
C2 - 24277835
AN - SCOPUS:84891363833
SN - 0027-8424
VL - 110
SP - 20935
EP - 20940
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 52
ER -