TY - JOUR
T1 - Finding a good neighbor, near and fast
AU - Chazelle, Bernard
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2008/1/1
Y1 - 2008/1/1
N2 - You haven't read it yet, but you can already tell this article is going to be one long jumble of words, numbers, and punctuation marks. Indeed, but look at it differently, as a text classifier would, and you will see a single point in high dimension, with word frequencies acting as coordinates. Or take the background on your flat panel display: a million colorful pixels teaming up to make quite a striking picture. Yes, but also one single point in 106-dimensional space - that is, if you think of each pixel's RGB intensity as a separate coordinate. In fact, you don't need to look hard to find complex, heterogeneous data encoded as clouds of points in high dimension. They routinely surface in applications as diverse as medical imaging, bioinformatics, astrophysics, and finance.
AB - You haven't read it yet, but you can already tell this article is going to be one long jumble of words, numbers, and punctuation marks. Indeed, but look at it differently, as a text classifier would, and you will see a single point in high dimension, with word frequencies acting as coordinates. Or take the background on your flat panel display: a million colorful pixels teaming up to make quite a striking picture. Yes, but also one single point in 106-dimensional space - that is, if you think of each pixel's RGB intensity as a separate coordinate. In fact, you don't need to look hard to find complex, heterogeneous data encoded as clouds of points in high dimension. They routinely surface in applications as diverse as medical imaging, bioinformatics, astrophysics, and finance.
UR - http://www.scopus.com/inward/record.url?scp=37549028856&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=37549028856&partnerID=8YFLogxK
U2 - 10.1145/1327452.1327493
DO - 10.1145/1327452.1327493
M3 - Article
AN - SCOPUS:37549028856
SN - 0001-0782
VL - 51
SP - 115
JO - Communications of the ACM
JF - Communications of the ACM
IS - 1
ER -