Visual speech for recognition system

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/602

Title:	Visual speech for recognition system
Authors:	Tang, Tsz Kin
Department:	Department of Computer Engineering and Information Technology
Issue Date:	2002
Supervisor:	Dr. Leung S H. Assessor: Dr. Leung C S
Abstract:	When performing speech recognition, we have to face the problem of noise around. The lip reading system is mainly a support for speech recognition. In this project, a system is developed to extract the user lip information from an image grabber, and use such information to provide a prediction of possible words that the user speak. In order to extract the required lips information, several parts are implemented to achieve the target. They are Lip Position location, Lip Clustering, and Recognition. The Lip Position initialization is used to find out the lip position from a grabbed image. It can help to reduce the complexity of Lip Clustering. After that, image is segment into lips and non-lips color. The method is Fuzzy clustering. The image is then fitted in a lips model to find the contour. Finally, it is the part of recognition. The Hidden Markov Model (HMM) was used to train and perform the recognition. Some parameter from the extracted contour is chosen for training the model. The result shows accurate recognition on words with special lips shape change. But for the words with similar lips movement will give wrongly recognition.
Appears in Collections:	Computer Engineering & Information Technology - Undergraduate Final Year Projects

Files in This Item:

File	Size	Format
fulltext.html	164 B	HTML	View/Open