Generalizable Object Tracking in Complex Real-World Scenes with Contextual Cues and Memory

dc.contributor.advisorHwang, Jenq-Neng
dc.contributor.authorYang, Cheng-Yen
dc.date.accessioned2026-04-20T15:28:23Z
dc.date.issued2026-04-20
dc.date.submitted2026
dc.descriptionThesis (Ph.D.)--University of Washington, 2026
dc.description.abstractMultiple-Object Tracking (MOT) serves as a cornerstone of computer vision, yet achieving robust data association remains a significant challenge in dynamic environments characterized by frequent occlusions. This dissertation investigates the strategic integration of spatial contextual cues and hierarchical memory to enhance tracking stability. To bridge the gap between camera and image space, we first analyze three distinct spatial perspectives: an extrinsic approach leveraging drone metadata for maritime scenarios, an intrinsic method utilizing self-calibration for multi-camera consistency, and a depth-aware modality to prioritize non-occluded objects in dense crowds. Building upon these spatial foundations, we leverage vision foundation models to introduce SAMURAI, a motion-aware zero-shot tracker, and SAMURAI++, a unified framework that reconciles tracking-by-detection and tracking-by-query paradigms. By maintaining dual-horizon memory—short-term and long-term—for each tracklet, this work achieves superior identity preservation and cross-domain generalizability without the need for task-specific fine-tuning. Collectively, these contributions demonstrate that the synergy of temporal memory and spatial context provides a robust trajectory toward generalizable object tracking in complex, real-world scenes.
dc.embargo.lift2028-04-09T15:28:23Z
dc.embargo.termsRestrict to UW for 2 years -- then make Open Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherYang_washington_0250E_29235.pdf
dc.identifier.urihttps://hdl.handle.net/1773/55498
dc.language.isoen_US
dc.rightsCC BY-NC
dc.subjectDeep Learning
dc.subjectObject Tracking
dc.subjectArtificial intelligence
dc.subjectComputer science
dc.subject.otherElectrical and computer engineering
dc.titleGeneralizable Object Tracking in Complex Real-World Scenes with Contextual Cues and Memory
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yang_washington_0250E_29235.pdf
Size:
62.43 MB
Format:
Adobe Portable Document Format