Repository logo
 

Superscalar RISC-V Processor with SIMD Vector Extension

dc.contributor.advisorKo, Seok-bum
dc.contributor.committeeMemberDinh, Anh
dc.contributor.committeeMemberMakaroff, Dwight
dc.contributor.committeeMemberBui, Francis
dc.creatorHe, Jiongrui
dc.date.accessioned2020-09-22T22:58:06Z
dc.date.available2021-09-22T06:05:10Z
dc.date.created2020-08
dc.date.issued2020-09-22
dc.date.submittedAugust 2020
dc.date.updated2020-09-22T22:58:06Z
dc.description.abstractWith the increasing number of digital products in the market, the need for robust and highly configurable processors rises. The demand is convened by the stable and extensible open-sourced RISC-V instruction set architecture. RISC-V processors are becoming popular in many fields of applications and research. This thesis presents a dual-issue superscalar RISC-V processor design with dynamic execution. The proposed design employs the global sharing scheme for branch prediction and Tomasulo algorithm for out-of-order execution. The processor is capable of speculative execution with five checkpoints. Data flow in the instruction dispatch and commit stages is optimized to achieve higher instruction throughput. The superscalar processor is extended with a customized vector instruction set of single-instruction-multiple-data computations to specifically improve the performance on machine learning tasks. According to the definition of the proposed vector instruction set, the scratchpad memory and element-wise arithmetic units are implemented in the vector co-processor. Different test programs are evaluated on the fully-tested superscalar processor. Compared to the reference work, the proposed design improves 18.9% on average instruction throughput and 4.92% on average prediction hit rate, with 16.9% higher operating clock frequency synthesized on the Intel Arria 10 FPGA board. The forward propagation of a convolution neural network model is evaluated by the standalone superscalar processor and the integration of the vector co-processor. The vector program with software-level optimizations achieves 9.53× improvement on instruction throughput and 10.18× improvement on real-time throughput. Moreover, the integration also provides 2.22× energy efficiency compared with the superscalar processor along.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10388/13040
dc.subjectprocessor
dc.subjectCNN
dc.subjectcomputer architecture
dc.titleSuperscalar RISC-V Processor with SIMD Vector Extension
dc.typeThesis
dc.type.materialtext
local.embargo.terms2021-09-22
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineElectrical Engineering
thesis.degree.grantorUniversity of Saskatchewan
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (M.Sc.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HE-THESIS-2020.pdf
Size:
2.67 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.27 KB
Format:
Plain Text
Description: