Scaling Machine Learning via Prioritized Optimization

dc.contributor.advisorGuestrin, Carlos
dc.contributor.advisorFazel, Maryam
dc.contributor.authorJohnson, Tyler Bridge
dc.date.accessioned2019-02-22T17:01:39Z
dc.date.available2019-02-22T17:01:39Z
dc.date.issued2019-02-22
dc.date.submitted2018
dc.descriptionThesis (Ph.D.)--University of Washington, 2018
dc.description.abstractTo learn from large datasets, modern machine learning applications rely on scalable training algorithms. Typically such algorithms employ stochastic updates, parallelism, or both. This work develops scalable algorithms via a third approach: prioritized optimization. We first propose a method for prioritizing challenging tasks when training deep models. Our robust approximate importance sampling procedure (RAIS) speeds up stochastic gradient descent by sampling minibatches non-uniformly. By approximating the ideal sampling distribution using robust optimization, RAIS provides much of the benefit of exact importance sampling with little overhead and minimal hyperparameters. In the second part of this work, we develop strategies for prioritizing optimization when solving convex problems with piecewise linear structure. Our BlitzWS working set algorithm offers unique theoretical guarantees and solves several classic machine learning problems very efficiently in practice. We also propose a closely related safe screening test, BlitzScreen, which is state-of-the-art for safe screening in multiple ways. Our final contribution is a “stingy update” rule for coordinate descent. Our StingyCD algorithm prioritizes optimization variables by eliminating provably useless computation. StingyCD requires only simple changes to CD and results in significant speed-ups in practice.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherJohnson_washington_0250E_19432.pdf
dc.identifier.urihttp://hdl.handle.net/1773/43259
dc.language.isoen_US
dc.rightsnone
dc.subjectMachine learning
dc.subjectOptimization
dc.subjectElectrical engineering
dc.subjectComputer science
dc.subject.otherElectrical engineering
dc.titleScaling Machine Learning via Prioritized Optimization
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Johnson_washington_0250E_19432.pdf
Size:
17.4 MB
Format:
Adobe Portable Document Format