City University of Hong Kong

CityU Institutional Repository >
3_CityU Electronic Theses and Dissertations >
ETD - Dept. of Electronic Engineering  >
EE - Master of Philosophy  >

Please use this identifier to cite or link to this item:

Title: Robust feature extraction techniques for speech recognition
Authors: Wong, Ka-fai
Department: Dept. of Electronic Engineering
Degree: Master of Philosophy
Issue Date: 1994
Publisher: City Polytechnic of Hong Kong
Subjects: Speech processing systems
Notes: 105, [7] leaves : ill. ; 30 cm.
Bibliography: leaves [4]-[6] (2nd group)
CityU Call Number: TK7882.S65 W65 1994
Thesis (M.Phil.)--City Polytechnic of Hong Kong, 1994
Type: Thesis
Abstract: The performance of conventional LPC-based speech recognition systems degrades rapidly under noisy environment. Recent research works concentrate in three areas to deal with this problem. They are the enhancement preprocessing, robust modeling, and reduction of mismatch in noise conditions between the testing and reference templates. In this thesis, all these three strategies are addressed. Second-order Volterra Prediction filter (SVP) is proposed as a new nonlinear modeling technique for speech. It exploits the inherent nonlinear characteristics inside speech by introducing quadratic kernel into the prediction analysis. It is applied as a preprocessor module since' it produces substantial signal-to-noise ratio (SNR) improvement in speech enhancement for speech of low input SNRs. Two-Sided Linear Prediction coding (TSLP) is proposed as a robust feature extraction technique. It utilizes both the past and future samples in prediction with symmetric or asymmetric weightings. The special pole structure produces sharp and enhanced spectral peaks which make TSLP successful in describing the characteristics of noisy speech. As a result, better spectral matching is achieved and the performance of noisy speech recognition can be improved The reduced rank subspace approach is adopted to minimize the mismatch in noise conditions for testing and training patterns. It partitions the vector space of noisy speech into signal subspace and noise subspace based on some rank determination methods. This approach can integrate with the TSLP model. More robust features can then be extracted by making use of the noise-resisted signal subspace. The new feature extraction techniques are compared to several benchmark algorithms using an isolated word speaker dependent Cantonese speech recognition system under noisy environment. Simulation results show that the new techniques provide better recognition rates than other benchmark algorithms.
Online Catalog Link:
Appears in Collections:EE - Master of Philosophy

Files in This Item:

File Description SizeFormat
fulltext.html159 BHTMLView/Open
abstract.html159 BHTMLView/Open

Items in CityU IR are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0!
DSpace Software © 2013 CityU Library - Send feedback to Library Systems
Privacy Policy · Copyright · Disclaimer