Recognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoder

Cui, Ying

Recognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoder

dc.contributor.advisor	Takaya, Kunio	en_US
dc.contributor.advisor	Chen, X. B. (Daniel)	en_US
dc.contributor.committeeMember	Ko, Seok-Bum	en_US
dc.contributor.committeeMember	Karki, Rajesh	en_US
dc.contributor.committeeMember	Gander, Robert	en_US
dc.creator	Cui, Ying	en_US
dc.date.accessioned	2007-01-12T08:44:18Z	en_US
dc.date.accessioned	2013-01-04T04:23:49Z
dc.date.available	2007-01-15T08:00:00Z	en_US
dc.date.available	2013-01-04T04:23:49Z
dc.date.created	2007-01	en_US
dc.date.issued	2007-01-15	en_US
dc.date.submitted	January 2007	en_US
dc.description.abstract	Linear Predictive Coding (LPC) has been used to compress and encode speech signals for digital transmission at a low bit rate. The Partial Correlation (PARCOR) parameter associated with LPC that represents a vocal tract model based on a lattice filter structure is considered for speech recognition. For the same purpose, the use of FIR coefficients and the frequency response of AR model were previously investigated. In this thesis, we investigate the mechanics of the speech production process in human beings and discuss the place and manner of articulation for each of the major phoneme classes of American English. Then we characterize some typical vowel and consonant phonemes by using the eighth order PARCOR parameter associated with LPC.This thesis explores a method to detect phonemes from a continuous stream of speech. The system being developed slides a time window of 16 ms and calculates PARCOR parameters continuously, feeding them to a phoneme classifier. The phoneme classifier is a supervised classifier that requires training. The training uses TIMIT speech database, which contains the recordings of 630 speakers of 8 major dialects of American English. The training data are grouped into the vowel group including phoneme [ae], [iy] and [uw] and the consonant group including [sh] and [f]. After the training, the decision rule is derived. We design two classifiers in this thesis, one is a vowel classifier and the other one is a consonant classifier, both of them use the maximum likelihood decision rule to classify unknown phonemes. The results of classification of vowel and consonant in a one-syllable word are shown in the thesis. The correct classification rate is 65:22% for the vowel group. The correct classification rate is 93:51% for the consonant group. The results indicate that PARCOR parameters have the potential capability to characterize the phoneme.	en_US
dc.identifier.uri	http://hdl.handle.net/10388/etd-01122007-084418	en_US
dc.language.iso	en_US	en_US
dc.subject	LPC	en_US
dc.subject	PARCOR	en_US
dc.title	Recognition of phonemes In a continuous speech stream by means of PARCOR parameters In LPC vocoder	en_US
dc.type.genre	Thesis	en_US
dc.type.material	text	en_US
thesis.degree.department	Electrical Engineering	en_US
thesis.degree.discipline	Electrical Engineering	en_US
thesis.degree.grantor	University of Saskatchewan	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science (M.Sc.)	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: thesis0115.pdf
Size:: 1.18 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 905 B
Format:: Plain Text
Description:

Download

Collections

Graduate Theses and Dissertations