License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.FSTTCS.2017.2
URN: urn:nbn:de:0030-drops-83941
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/8394/
Go to the corresponding LIPIcs Volume Portal


Jain, Prateek ; Kakade, Sham M. ; Kidambi, Rahul ; Netrapalli, Praneeth ; Pillutla, Venkata Krishna ; Sidford, Aaron

A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares)

pdf-format:
LIPIcs-FSTTCS-2017-2.pdf (0.4 MB)


Abstract

This work provides a simplified proof of the statistical minimax
optimality of (iterate averaged) stochastic gradient descent (SGD), for
the special case of least squares. This result is obtained by
analyzing SGD as a stochastic process and by sharply characterizing
the stationary covariance matrix of this process. The finite rate optimality characterization captures the
constant factors and addresses model mis-specification.

BibTeX - Entry

@InProceedings{jain_et_al:LIPIcs:2018:8394,
  author =	{Prateek Jain and Sham M. Kakade and Rahul Kidambi and Praneeth Netrapalli and Venkata Krishna Pillutla and Aaron Sidford},
  title =	{{A Markov Chain Theory Approach to Characterizing the Minimax Optimality  of Stochastic Gradient Descent  (for Least Squares)}},
  booktitle =	{37th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2017)},
  pages =	{2:1--2:10},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-055-2},
  ISSN =	{1868-8969},
  year =	{2018},
  volume =	{93},
  editor =	{Satya Lokam and R. Ramanujam},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2018/8394},
  URN =		{urn:nbn:de:0030-drops-83941},
  doi =		{10.4230/LIPIcs.FSTTCS.2017.2},
  annote =	{Keywords: Stochastic Gradient Descent, Minimax Optimality, Least Squares Regression}
}

Keywords: Stochastic Gradient Descent, Minimax Optimality, Least Squares Regression
Collection: 37th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2017)
Issue Date: 2018
Date of publication: 12.02.2018


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI