Principal Curves in High Dimensions

dc.contributor.advisorMeila, Marina
dc.contributor.authorMilchgrub, Alon
dc.date.accessioned2021-10-29T16:20:00Z
dc.date.available2021-10-29T16:20:00Z
dc.date.issued2021-10-29
dc.date.submitted2021
dc.descriptionThesis (Ph.D.)--University of Washington, 2021
dc.description.abstractPrincipal curves are curves passing through the "center" of a dataset and generalize the notion of principal components to non-linear curves, thus providing more meaningful insights to the structure of the data. SCMS is an elegant and robust method for recovering the principal curves of a dataset. However, its dependence on evaluating the Hessian of the density function at every step and the need for an abundance of probes make it prohibitively computationally intensive when dealing with large and high-dimensional data. In this paper we present L-SCMS, an approximation algorithm for SCMS which reduces the computational complexity from n^2 to n, where n is the dimensionality of the data. We also present MorsePCS, an extension to SCMS that provides with additional information on the structure of the data, while reducing the number of steps required. Finally, we provide empirical evaluation of our MorsePCS, indicating that the runtime guarantees do realize in practice, while maintaining the quality of the curves produced
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherMilchgrub_washington_0250E_23302.pdf
dc.identifier.urihttp://hdl.handle.net/1773/47989
dc.language.isoen_US
dc.rightsCC BY-SA
dc.subjectL-BFGS
dc.subjectlearning
dc.subjectmanifold
dc.subjectSCMS
dc.subjectComputer science
dc.subjectStatistics
dc.subject.otherComputer science and engineering
dc.titlePrincipal Curves in High Dimensions
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Milchgrub_washington_0250E_23302.pdf
Size:
1.22 MB
Format:
Adobe Portable Document Format