Intelligence Through the Lens of Interaction
| dc.contributor.advisor | Farhadi, Ali | |
| dc.contributor.advisor | Mottaghi, Roozbeh | |
| dc.contributor.author | Ehsani, kiana | |
| dc.date.accessioned | 2021-10-29T16:20:07Z | |
| dc.date.issued | 2021-10-29 | |
| dc.date.submitted | 2021 | |
| dc.description | Thesis (Ph.D.)--University of Washington, 2021 | |
| dc.description.abstract | In this thesis, I will discuss the problem of acquiring visual intelligence from the interaction, focusing on two aspects of visual understanding: (1) visual perception and (2) embodied intelligence. To address the first question, I designed experiments to learn visual representations by observing animals and humans interact with the visual world. Further, I investigated the idea of learning perception from hands-on interaction -- acquiring generalizable physical understanding by predicting the forces applied in an observed video and trying to replicate the motion observed in simulation, with no additional supervision provided. To address the second question, I discuss our findings on training intelligent embodied agents using interaction from two perspectives. I designed a training paradigm that enables learning-to-learn from interactions. This training regime helps us to continue to learn from our interactions even during inference time. Moreover, I introduce a visually rich object manipulation framework, ManipulaTHOR, which opens the gate for directly training embodied agents to interact intelligently in a physically realistic environment via low-level object manipulation and navigation. | |
| dc.embargo.lift | 2026-10-03T16:20:07Z | |
| dc.embargo.terms | Restrict to UW for 5 years -- then make Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Ehsani_washington_0250E_23359.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/47997 | |
| dc.language.iso | en_US | |
| dc.rights | none | |
| dc.subject | artificial intelligence | |
| dc.subject | computer vision | |
| dc.subject | embodied AI | |
| dc.subject | interaction | |
| dc.subject | perception | |
| dc.subject | Computer science | |
| dc.subject.other | Computer science and engineering | |
| dc.title | Intelligence Through the Lens of Interaction | |
| dc.type | Thesis |
