Interpretable Data Phenotyping for Healthcare via Unsupervised Learning

dc.contributor.advisorTeredesai, Ankur
dc.contributor.authorAllen, Christine
dc.date.accessioned2020-08-14T03:23:33Z
dc.date.issued2020-08-14
dc.date.submitted2020
dc.descriptionThesis (Master's)--University of Washington, 2020
dc.description.abstractHealthcare applications of machine learning tend toward greater requirements for model transparency than most applications. Yet the often high dimensionality of the data presents a significant impediment to meeting this requirement, particularly as it relates to the underlying relationships contributing to an individual prediction. Thus emerged the concept of "data phenotypes", clinically relevant groupings that facilitate population statistics and reduce barriers in the development of quality machine learning models. However, the results of current phenotyping methods are often difficult to interpret, and they often require clarification from an experienced clinician to be useful. This is a problem for administration-level prediction problems in particular, for example Length of Stay prediction, because those developing the models are not commonly clinicians, and because the results of these models are often desired with a fast turnaround. With the above in mind, this thesis reviews the utility of four prominent phenotyping approaches: k-means, agglomerative clustering, non-negative matrix factorization, and non-negative tensor factorization. We propose variants of the four approaches with the goal of producing distinct feature membership. We then show that our proposals can produce easily understandable phenotypes at no detriment to prediction performance over some real healthcare tasks.
dc.embargo.lift2021-08-14T03:23:33Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherAllen_washington_0250O_21925.pdf
dc.identifier.urihttp://hdl.handle.net/1773/45774
dc.language.isoen_US
dc.rightsCC BY-NC
dc.subject
dc.subjectComputer science
dc.subject.otherComputer science and systems
dc.titleInterpretable Data Phenotyping for Healthcare via Unsupervised Learning
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Allen_washington_0250O_21925.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format