TY - GEN
T1 - Adaptive subgradient methods for online learning and stochastic optimization
AU - Duchi, John
AU - Hazan, Elad
AU - Singer, Yoram
PY - 2010
Y1 - 2010
N2 - We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning. The adaptation, in essence, allows us to find needles in haystacks in the form of very predictive yet rarely observed features. Our paradigm stems from recent advances in online learning which employ proximal functions to control the gradient steps of the algorithm. We describe and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies the task of setting a learning rate and results in regret guarantees that are provably as good as the best proximal function that can be chosen in hindsight. We corroborate our theoretical results with experiments on a text classification task, showing substantial improvements for classification with sparse datasets.
AB - We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning. The adaptation, in essence, allows us to find needles in haystacks in the form of very predictive yet rarely observed features. Our paradigm stems from recent advances in online learning which employ proximal functions to control the gradient steps of the algorithm. We describe and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies the task of setting a learning rate and results in regret guarantees that are provably as good as the best proximal function that can be chosen in hindsight. We corroborate our theoretical results with experiments on a text classification task, showing substantial improvements for classification with sparse datasets.
UR - http://www.scopus.com/inward/record.url?scp=80052423377&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052423377&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:80052423377
SN - 9780982252925
T3 - COLT 2010 - The 23rd Conference on Learning Theory
SP - 257
EP - 269
BT - COLT 2010 - The 23rd Conference on Learning Theory
T2 - 23rd Conference on Learning Theory, COLT 2010
Y2 - 27 June 2010 through 29 June 2010
ER -