Hybrid deterministic-stochastic methods for data fittingM. P. Friedlander and M. Schmidt
SIAM J. on Scientific Computing, 34(3), 2012 AbstractMany structured data-fitting applications require the solution of an optimization problem involving a sum over a potentially large number of measurements. Incremental gradient algorithms offer inexpensive iterations by sampling a subset of the terms in the sum; these methods can make great progress initially, but often slow as they approach a solution. In contrast, full-gradient methods achieve steady convergence at the expense of evaluating the full objective and gradient on each iteration. We explore hybrid methods that exhibit the benefits of both approaches. Rate-of-convergence analysis shows that by controlling the sample size in an incremental-gradient algorithm, it is possible to maintain the steady convergence rates of full-gradient methods. We detail a practical quasi-Newton implementation based on this approach. Numerical experiments illustrate its potential benefits. Reproducible researchThe following code facilitates the reproduction of all figures appearing in the paper. DownloadsMost scripts use pre-computed data files to save time. When a required data file is missing it is automatically regenerated. To regenerate all experiments, simply delete the intermediate files in the batching/results directory. Getting startedDownload the zip archive in a suitable directory and unzip. Next, start Matlab, change to the resulting hybrid directory and type addpath(genpath(pwd)) to set up all required paths. # Run demo, similar Figure 5.5: >> example_batching # Te-run the experiments in the paper (assumes existing results files removed): >> expBatching_runAll # Reproduce all the plots in the paper: >> expBatching_plotAll BibTeX
@article{FriedlanderSchmidt2012,
author = {M. P. Friedlander and M. Schmidt},
title = {Hybrid deterministic-stochastic methods for data fitting},
journal = {SIAM J. Scientific Computing},
volume = {34},
number = {3},
year = {2012}
}
|