Towards Better Generalization: Model, Data, and Explicit Knowledge
| dc.contributor.advisor | Farhadi, Ali | |
| dc.contributor.author | Bagherinezhad, Hessam | |
| dc.date.accessioned | 2020-10-26T20:41:09Z | |
| dc.date.available | 2020-10-26T20:41:09Z | |
| dc.date.issued | 2020-10-26 | |
| dc.date.submitted | 2020 | |
| dc.description | Thesis (Ph.D.)--University of Washington, 2020 | |
| dc.description.abstract | In this dissertation, I explore three ways to make models more generalizable. 1) Through explicit knowledge extraction. Explicit knowledge enables models to correct their predictions, and in some cases to break a complex task into smaller pieces where each can be trained with less amount of data. 2) Through reducing model complexity. It is known that over- parameterized complex Convolutional Neural Networks (CNNs) often overfit to the given training set, and are therefore less generalizable. In this dissertation, I explore redesigning convolutional layers that outperform standard CNNs under few shot training scenario. 3) Through making labels more informative. I study the current data labeling paradigm, and present how labels for a simple image classification task are noisy. Noisy labels contribute to less generalizability. This is due to the fact that our over-parameterized models overfit to the noisy signal that is specific to that training set; therefore, they act poorly on an unseen test set. For explicit knowledge extraction, I first explore estimating and modeling Newtonian physics of a scene, and then explore extracting information about sizes of objects without any supervision required. For reducing model complexity, I explore redesigning Convolutional layers to reduce their complexity by sharing a dictionary of vectors among different convolutions. For label noise reduction, I explore making the training more accurate by refining the labels of a dataset with a dynamic label generator, called Label Refinery. | |
| dc.embargo.terms | Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Bagherinezhad_washington_0250E_22235.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/46431 | |
| dc.language.iso | en_US | |
| dc.rights | CC BY | |
| dc.subject | Computer Vision | |
| dc.subject | Machine Learning | |
| dc.subject | Natural Language Processing | |
| dc.subject | Statistical Modeling | |
| dc.subject | Computer science | |
| dc.subject | Computer engineering | |
| dc.subject.other | Computer science and engineering | |
| dc.title | Towards Better Generalization: Model, Data, and Explicit Knowledge | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Bagherinezhad_washington_0250E_22235.pdf
- Size:
- 24 MB
- Format:
- Adobe Portable Document Format
