Object Recognition and Semantic Scene Labeling for RGB-D Data

Lai, Kevin Kar Wai

Object Recognition and Semantic Scene Labeling for RGB-D Data

dc.contributor.advisor	Fox, Dieter	en_US
dc.contributor.author	Lai, Kevin Kar Wai	en_US
dc.date.accessioned	2014-02-24T18:24:58Z
dc.date.available	2014-02-24T18:24:58Z
dc.date.issued	2014-02-24
dc.date.submitted	2013	en_US
dc.description	Thesis (Ph.D.)--University of Washington, 2013	en_US
dc.description.abstract	The availability of RGB-D (Kinect-like) cameras has led to an explosive growth of research on robot perception. RGB-D cameras provide high resolution (640 x 480) synchronized videos of both color (RGB) and depth (D) at 30 frames per second. This dissertation demonstrates the thesis that combining of RGB and depth at high frame rates is helpful for various recognition tasks including object recognition, object detection, and semantic scene labeling. We present the RGB-D Object Dataset, a large dataset of 250,000 RGB-D images of 300 objects in 51 categories, and 22 RGB-D videos of objects in indoor home and office environments. We introduce algorithms for object recognition in RGB-D images that perform category, instance, and pose recognition in a scalable manner. We also present HMP3D, an unsupervised feature learning approach for 3D point cloud data, and demonstrate that HMP3D can be used to learn hierarchies of features from different attributes including color, gradient, shape, and surface normal orientation. Finally, we present a scene labeling approach for scenes constructed from RGB-D videos. The approach uses features learned from both individual RGB-D images and 3D point clouds constructed from entire video sequences. Through these applications, this thesis demonstrates the importance of designing new features and algorithms that specifically utilize the advantages of RGB-D cameras over traditional cameras and range sensors.	en_US
dc.embargo.terms	No embargo	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.other	Lai_washington_0250E_12640.pdf	en_US
dc.identifier.uri	http://hdl.handle.net/1773/25070
dc.language.iso	en_US	en_US
dc.rights	Copyright is held by the individual authors.	en_US
dc.subject	object categorization; object detection; object recognition; RGB-D cameras; scene understanding; semantic scene labeling	en_US
dc.subject.other	Computer science	en_US
dc.subject.other	Robotics	en_US
dc.subject.other	Artificial intelligence	en_US
dc.subject.other	computer science and engineering	en_US
dc.title	Object Recognition and Semantic Scene Labeling for RGB-D Data	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Lai_washington_0250E_12640.pdf
Size:: 20.78 MB
Format:: Adobe Portable Document Format

Download

Collections

Computer science and engineering