Towards Better Generalization: Model, Data, and Explicit Knowledge

Bagherinezhad, Hessam

Towards Better Generalization: Model, Data, and Explicit Knowledge

dc.contributor.advisor	Farhadi, Ali
dc.contributor.author	Bagherinezhad, Hessam
dc.date.accessioned	2020-10-26T20:41:09Z
dc.date.available	2020-10-26T20:41:09Z
dc.date.issued	2020-10-26
dc.date.submitted	2020
dc.description	Thesis (Ph.D.)--University of Washington, 2020
dc.description.abstract	In this dissertation, I explore three ways to make models more generalizable. 1) Through explicit knowledge extraction. Explicit knowledge enables models to correct their predictions, and in some cases to break a complex task into smaller pieces where each can be trained with less amount of data. 2) Through reducing model complexity. It is known that over- parameterized complex Convolutional Neural Networks (CNNs) often overfit to the given training set, and are therefore less generalizable. In this dissertation, I explore redesigning convolutional layers that outperform standard CNNs under few shot training scenario. 3) Through making labels more informative. I study the current data labeling paradigm, and present how labels for a simple image classification task are noisy. Noisy labels contribute to less generalizability. This is due to the fact that our over-parameterized models overfit to the noisy signal that is specific to that training set; therefore, they act poorly on an unseen test set. For explicit knowledge extraction, I first explore estimating and modeling Newtonian physics of a scene, and then explore extracting information about sizes of objects without any supervision required. For reducing model complexity, I explore redesigning Convolutional layers to reduce their complexity by sharing a dictionary of vectors among different convolutions. For label noise reduction, I explore making the training more accurate by refining the labels of a dataset with a dynamic label generator, called Label Refinery.
dc.embargo.terms	Open Access
dc.format.mimetype	application/pdf
dc.identifier.other	Bagherinezhad_washington_0250E_22235.pdf
dc.identifier.uri	http://hdl.handle.net/1773/46431
dc.language.iso	en_US
dc.rights	CC BY
dc.subject	Computer Vision
dc.subject	Machine Learning
dc.subject	Natural Language Processing
dc.subject	Statistical Modeling
dc.subject	Computer science
dc.subject	Computer engineering
dc.subject.other	Computer science and engineering
dc.title	Towards Better Generalization: Model, Data, and Explicit Knowledge
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Bagherinezhad_washington_0250E_22235.pdf
Size:: 24 MB
Format:: Adobe Portable Document Format

Download

Collections

Computer science and engineering