Automated Parallelization to Improve Usability and Efficiency of Distributed Neural Network Training

dc.contributor.advisorFukuda, Munehiro
dc.contributor.authorGrabaskas, Nathaniel J
dc.date.accessioned2018-04-24T22:16:44Z
dc.date.available2018-04-24T22:16:44Z
dc.date.issued2018-04-24
dc.date.submitted2018
dc.descriptionThesis (Master's)--University of Washington, 2018
dc.description.abstractThe recent success of Deep Neural Networks (DNNs) [1] has triggered a race to build larger and larger DNNs [2]; however, a known limitation is the training speed [3]. To solve this speed problem, distributed neural network training has become an increasingly large area of research [4], [5]. Usability, the complexity for a machine learning or data scientist to implement distributed neural network training, is an aspect rarely considered, yet critical. There is strong evidence growing complexity has a direct impact on development effort, maintainability, and fault proneness of software [6]–[8]. We investigated, if automation can greatly reduce the implementation complexity of distributing neural network training across multiple devices without loss of computational efficiency when compared to manual parallelization. Experiments were conducted using Convolutional Neural Networks (CNN) and Multi-Layer Perceptron (MLP) networks to perform image classification on CIFAR-10 and MNIST datasets. Hardware consisted of an embedded, four node NVIDIA Jetson TX1 cluster. Torch Automatic Distributed Neural Network (TorchAD-NN) reduces the implementation complexity of data parallel neural network training by more than 90% and providing components, with near zero implementation complexity, to easily parallelize all or only select fully-connected neural layers.
dc.embargo.termsOpen Access
dc.format.mimetypeapplication/pdf
dc.identifier.otherGrabaskas_washington_0250O_18319.pdf
dc.identifier.urihttp://hdl.handle.net/1773/41725
dc.language.isoen_US
dc.rightsCC BY
dc.subjectAutomated
dc.subjectDistributed
dc.subjectNeural Networks
dc.subjectParallel
dc.subjectComputer science
dc.subject.otherComputing and software systems
dc.titleAutomated Parallelization to Improve Usability and Efficiency of Distributed Neural Network Training
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Grabaskas_washington_0250O_18319.pdf
Size:
1.36 MB
Format:
Adobe Portable Document Format