Exploiting Image Resolution Holistically in Computer Vision Models

Loading...
Thumbnail Image

Authors

Yan, Eddie

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Modern computer vision systems are built upon a complex stack of hardware and software, from general purpose processors to specialized accelerators, and low level operator libraries to expressive deep learning frameworks. However, from this complexity arises many opportunities for optimization across the stack. From the lens of image resolution, a fundamental hyperparameter of computer vision, we propose methods for optimizing models and characterize the space of choices as introduced by the hyperparameter of resolution. In the process, we cover related topics such as object scale (as introduced by data augmentations), image storage (including methods for efficient multi-resolution storage), and deep learning kernel tuning. An understanding of these topics allows us to consider resolution with respect to deep learning choices holistically, enabling efficient inference from the metrics of computational cost, latency, accuracy, and storage bandwidth use. We describe the mechanisms by which we enable efficient inference according to these metrics, spanning kernel tuning, image data layout, and model architecture pipelines.

Description

Thesis (Ph.D.)--University of Washington, 2020

Citation

DOI