Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.
We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative grad...