Repository logo
 

Object Detection Networks at the Edge: Hardware Optimization and Intelligent Transportation Systems Applications

dc.contributor.advisorKo, Seok-Bum
dc.contributor.committeeMemberKlymyshyn, David
dc.contributor.committeeMemberChen, Li
dc.contributor.committeeMemberChung, Chi
dc.contributor.committeeMemberTeng, Daniel
dc.contributor.committeeMemberWu, Fangxiang
dc.creatorYepez Rodriguez, Juan F
dc.creator.orcid0000-0001-6766-1127
dc.date.accessioned2021-11-17T17:11:57Z
dc.date.available2023-11-17T06:05:10Z
dc.date.created2021-10
dc.date.issued2021-11-17
dc.date.submittedOctober 2021
dc.date.updated2021-11-17T17:11:58Z
dc.description.abstractIn recent years, deep learning (DL) and especially Convolutional Neural Networks (CNNs) have become a key component of many computer vision systems and applications due to their demonstrated capability to accurately process visual information. Object detection is one of the most important and challenging problems capable of being solved by DL; in general, higher object detection accuracy can be achieved by DL compared to other techniques. However, DL tends to require expensive GPUs or cloud-based services (the latter requiring a high internet bandwidth, latency, and other associated costs), making DL applications traditionally very expensive to implement in practice. This thesis emphasizes the optimization of DL computation for object detection and proposes designs of real-time Intelligent Transportation Systems (ITS) applications at the edge using hardware accelerators. To optimize DL computation, a novel stride 2 Winograd method is proposed for deep neural network (DNN) inference optimization. The proposed method provides new algorithms that trade expensive multiplications for cheap additions, thereby increasing efficiency by vastly decreasing computational complexity. The proposed algorithms support 1D, 2D, and 3D input for CNNs. Additionally, a novel Processing Element (PE) is proposed to process stride one and two convolution in the same FPGA module. These algorithms, implemented using a GPU and an FPGA, are demonstrated to provide better efficiency compared to regular convolution implementations for a variety of kernels. Additionally, three ITS applications are proposed. The first application is a License Plate Localization (LPL) system constructed using an architecture comprised of bottleneck depth-separable convolutions with inverted residuals. The second proposed application is a novel two stage real-time deep CNN recognition system for decals issued by the Commercial Vehicle Safety Alliance (CVSA). The third proposed application is a novel three stage real-time deep learning-based edge system for hazardous materials (HAZMATs) recognition. The designed custom object detection architectures for ITS applications are capable of highly accurate real-time prediction on edge computing devices (Intel, Google, and/or NVIDIA), thus providing enormous cost and performance advantages compared to current implementations.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10388/13677
dc.subjectObject Detection, convolutional neural networks, edge device, hardware optimization, intelligent transportation systems
dc.titleObject Detection Networks at the Edge: Hardware Optimization and Intelligent Transportation Systems Applications
dc.typeThesis
dc.type.materialtext
local.embargo.terms2023-11-17
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineElectrical Engineering
thesis.degree.grantorUniversity of Saskatchewan
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy (Ph.D.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
YEPEZRODRIGUEZ-DISSERTATION-2021.pdf
Size:
4.18 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.27 KB
Format:
Plain Text
Description: