Intelligence Through the Lens of Interaction

dc.contributor.advisorFarhadi, Ali
dc.contributor.advisorMottaghi, Roozbeh
dc.contributor.authorEhsani, kiana
dc.date.accessioned2021-10-29T16:20:07Z
dc.date.issued2021-10-29
dc.date.submitted2021
dc.descriptionThesis (Ph.D.)--University of Washington, 2021
dc.description.abstractIn this thesis, I will discuss the problem of acquiring visual intelligence from the interaction, focusing on two aspects of visual understanding: (1) visual perception and (2) embodied intelligence. To address the first question, I designed experiments to learn visual representations by observing animals and humans interact with the visual world. Further, I investigated the idea of learning perception from hands-on interaction -- acquiring generalizable physical understanding by predicting the forces applied in an observed video and trying to replicate the motion observed in simulation, with no additional supervision provided. To address the second question, I discuss our findings on training intelligent embodied agents using interaction from two perspectives. I designed a training paradigm that enables learning-to-learn from interactions. This training regime helps us to continue to learn from our interactions even during inference time. Moreover, I introduce a visually rich object manipulation framework, ManipulaTHOR, which opens the gate for directly training embodied agents to interact intelligently in a physically realistic environment via low-level object manipulation and navigation.
dc.embargo.lift2026-10-03T16:20:07Z
dc.embargo.termsRestrict to UW for 5 years -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherEhsani_washington_0250E_23359.pdf
dc.identifier.urihttp://hdl.handle.net/1773/47997
dc.language.isoen_US
dc.rightsnone
dc.subjectartificial intelligence
dc.subjectcomputer vision
dc.subjectembodied AI
dc.subjectinteraction
dc.subjectperception
dc.subjectComputer science
dc.subject.otherComputer science and engineering
dc.titleIntelligence Through the Lens of Interaction
dc.typeThesis

Files