Repository logo

Recognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoder

dc.contributor.advisorTakaya, Kunioen_US
dc.contributor.advisorChen, X. B. (Daniel)en_US
dc.contributor.committeeMemberKo, Seok-Bumen_US
dc.contributor.committeeMemberKarki, Rajeshen_US
dc.contributor.committeeMemberGander, Roberten_US
dc.creatorCui, Yingen_US 2007en_US
dc.description.abstractLinear Predictive Coding (LPC) has been used to compress and encode speech signals for digital transmission at a low bit rate. The Partial Correlation (PARCOR) parameter associated with LPC that represents a vocal tract model based on a lattice filter structure is considered for speech recognition. For the same purpose, the use of FIR coefficients and the frequency response of AR model were previously investigated. In this thesis, we investigate the mechanics of the speech production process in human beings and discuss the place and manner of articulation for each of the major phoneme classes of American English. Then we characterize some typical vowel and consonant phonemes by using the eighth order PARCOR parameter associated with LPC.This thesis explores a method to detect phonemes from a continuous stream of speech. The system being developed slides a time window of 16 ms and calculates PARCOR parameters continuously, feeding them to a phoneme classifier. The phoneme classifier is a supervised classifier that requires training. The training uses TIMIT speech database, which contains the recordings of 630 speakers of 8 major dialects of American English. The training data are grouped into the vowel group including phoneme [ae], [iy] and [uw] and the consonant group including [sh] and [f]. After the training, the decision rule is derived. We design two classifiers in this thesis, one is a vowel classifier and the other one is a consonant classifier, both of them use the maximum likelihood decision rule to classify unknown phonemes. The results of classification of vowel and consonant in a one-syllable word are shown in the thesis. The correct classification rate is 65:22% for the vowel group. The correct classification rate is 93:51% for the consonant group. The results indicate that PARCOR parameters have the potential capability to characterize the phoneme.en_US
dc.titleRecognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoderen_US
dc.type.materialtexten_US Engineeringen_US Engineeringen_US of Saskatchewanen_US of Science (M.Sc.)en_US


Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
1.18 MB
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
905 B
Plain Text