Repository logo
 

Recognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoder

dc.contributor.advisorTakaya, Kunioen_US
dc.contributor.advisorChen, X. B. (Daniel)en_US
dc.contributor.committeeMemberKo, Seok-Bumen_US
dc.contributor.committeeMemberKarki, Rajeshen_US
dc.contributor.committeeMemberGander, Roberten_US
dc.creatorCui, Yingen_US
dc.date.accessioned2007-01-12T08:44:18Zen_US
dc.date.accessioned2013-01-04T04:23:49Z
dc.date.available2007-01-15T08:00:00Zen_US
dc.date.available2013-01-04T04:23:49Z
dc.date.created2007-01en_US
dc.date.issued2007-01-15en_US
dc.date.submittedJanuary 2007en_US
dc.description.abstractLinear Predictive Coding (LPC) has been used to compress and encode speech signals for digital transmission at a low bit rate. The Partial Correlation (PARCOR) parameter associated with LPC that represents a vocal tract model based on a lattice filter structure is considered for speech recognition. For the same purpose, the use of FIR coefficients and the frequency response of AR model were previously investigated. In this thesis, we investigate the mechanics of the speech production process in human beings and discuss the place and manner of articulation for each of the major phoneme classes of American English. Then we characterize some typical vowel and consonant phonemes by using the eighth order PARCOR parameter associated with LPC.This thesis explores a method to detect phonemes from a continuous stream of speech. The system being developed slides a time window of 16 ms and calculates PARCOR parameters continuously, feeding them to a phoneme classifier. The phoneme classifier is a supervised classifier that requires training. The training uses TIMIT speech database, which contains the recordings of 630 speakers of 8 major dialects of American English. The training data are grouped into the vowel group including phoneme [ae], [iy] and [uw] and the consonant group including [sh] and [f]. After the training, the decision rule is derived. We design two classifiers in this thesis, one is a vowel classifier and the other one is a consonant classifier, both of them use the maximum likelihood decision rule to classify unknown phonemes. The results of classification of vowel and consonant in a one-syllable word are shown in the thesis. The correct classification rate is 65:22% for the vowel group. The correct classification rate is 93:51% for the consonant group. The results indicate that PARCOR parameters have the potential capability to characterize the phoneme.en_US
dc.identifier.urihttp://hdl.handle.net/10388/etd-01122007-084418en_US
dc.language.isoen_USen_US
dc.subjectLPCen_US
dc.subjectPARCORen_US
dc.titleRecognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoderen_US
dc.type.genreThesisen_US
dc.type.materialtexten_US
thesis.degree.departmentElectrical Engineeringen_US
thesis.degree.disciplineElectrical Engineeringen_US
thesis.degree.grantorUniversity of Saskatchewanen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Science (M.Sc.)en_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis0115.pdf
Size:
1.18 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
905 B
Format:
Plain Text
Description: