<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>DSpace Collection:</title>
    <link>http://dspace.cityu.edu.hk:80/handle/2031/740</link>
    <description />
    <pubDate>Fri, 31 May 2013 18:28:09 GMT</pubDate>
    <dc:date>2013-05-31T18:28:09Z</dc:date>
    <item>
      <title>The development of an automatic real-time lipreading system</title>
      <link>http://dspace.cityu.edu.hk:80/handle/2031/4729</link>
      <description>Title: The development of an automatic real-time lipreading system
Authors: Wang, Shilin (王士林)
Abstract: Using visual information in automatic speech recognition has aroused the&#xD;
interest of many researchers in recent years because the visual information will help&#xD;
enhancing the robustness of the system. This thesis presents the studies of developing&#xD;
a real-time system for automatic speech recognition solely based on the visual cues&#xD;
from the lip shape and movement. Lip segmentation, modeling, visual feature&#xD;
extraction and recognition are the major issues of the system and new algorithms for&#xD;
these issues are presented in this thesis.&#xD;
For lip segmentation, most of the widely used methods are based on the color&#xD;
or intensity information. However, for the lip images with low contrast, these methods&#xD;
will not achieve satisfactory results. In our study, both the color information and&#xD;
spatial location are integrated into a fuzzy clustering framework. The lip image is&#xD;
represented in CIELAB and CIELUV color space, where the luminance and&#xD;
chromaticity information is separated and the distance between any two points in the&#xD;
color space is proportional to their perceptual color difference. By integrating the&#xD;
spatial information, the proposed algorithm can differentiate the pixels with similar&#xD;
color but located in different regions. From the experimental results, the proposed&#xD;
algorithm outperforms other lip segmentation techniques especially for images with&#xD;
low color contrast. An extension of the algorithm has also been developed to solve the&#xD;
lip segmentation problem with the presence of beards, which is regarded as a difficult&#xD;
problem for lip region segmentation.&#xD;
For the lip contour modeling and extraction, accuracy, robustness and&#xD;
efficiency are the primary concerns. A 16-point model is employed to describe the lip&#xD;
contour. Some geometric constraints are applied to ensure that the extracted lip&#xD;
contour is physically meaningful. Based on the membership distribution derived from&#xD;
the lip segmentation procedure, a region-based cost function is defined, which is&#xD;
much more robust than the edge-based and intensity-based cost functions. A point-driven&#xD;
optimization procedure with some fast implementation techniques is used for&#xD;
model fitting and thus the lip contour can be obtained in an efficient manner.&#xD;
A visual feature set containing the geometric parameters, lip shape descriptors,&#xD;
and inner mouth information is obtained from the lip model for visual speech&#xD;
recognition purpose. A spline representation is employed to translate the discrete-&#xD;
sampled visual features into the continuous domain. The spline coefficients in the&#xD;
same word class are constrained to have the same mean and covariance matrix and&#xD;
can be estimated from the training data by the EM algorithm. In the speaker&#xD;
independent recognition task, a multi-model approach is proposed to overcome the&#xD;
difficulty due to the large variation caused by different speakers. By comparing with&#xD;
the HMM, the proposed method gives better result especially when only limited&#xD;
training data is available.&#xD;
An automatic lipreading system has been implemented and running on a 1.9&#xD;
GHz PC. An accuracy of 96% for the speaker dependent recognition and 88% for the&#xD;
speaker independent recognition have been achieved. With the efficient&#xD;
implementation of all the algorithms, the system is able to process images at a rate&#xD;
higher than 25 frames/sec, leaving room for additional tasks in real-time applications.
Notes: xi, 134 leaves : ill. (some col.) ; 30 cm.; Thesis (Ph.D.)--City University of Hong Kong, 2004; Includes bibliographical references (leaves 120-127); CityU Call Number: TK7895.S65 W36 2004</description>
      <pubDate>Thu, 01 Jan 2004 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">http://dspace.cityu.edu.hk:80/handle/2031/4729</guid>
      <dc:date>2004-01-01T00:00:00Z</dc:date>
    </item>
    <item>
      <title>Algorithmic approaches for IP address lookup and packet classification</title>
      <link>http://dspace.cityu.edu.hk:80/handle/2031/4356</link>
      <description>Title: Algorithmic approaches for IP address lookup and packet classification
Authors: Liu, Cutson Ngai-hang (廖毅恒)
Abstract: The basic function of an Internet router is to forward packets towards their destinations. As the applications in the Internet are evolving towards multimedia-oriented and QoS-aware, more complex analysis of the incoming packet is performed by the routers in addition to the IP address lookup. Routers are required to classify packets based on multiple header fields into equivalence classes called flows. The challenges for the IP address lookup and packet classification are driven by the following factors: (i) the rapid growth of communication bandwidth, (ii) the expected growth of the routing table and filter database, (iii) the deployment of IPv6, and (iv) fast incremental update. This thesis presents algorithmic approaches to solve these problems. The first contribution of this work is the design of an efficient hardware solution for IP address lookup. The problem is modeled as a searching problem on a binary-trie. The binary-trie is partitioned into fixed size non-overlapping sub-trees. Each sub-tree is represented using a bit-vector and can be searched in parallel for the best matching prefix. The address lookup is implemented using hardware pipeline with a throughput of one lookup per memory access. A distinguishing feature of our design is that routing table entries are not replicated in the data structure. Hence, table update can be done in constant time with only a few memory accesses. Our design can be extended to IPv6. The second contribution of this work is the design of a two-stage approach for packet classification. The first stage determines the best matching source and destination prefix pair and the second stage determines the highest priority matching filter by comparing the remaining fields against a short list of candidate rules in parallel. The first stage 2- dimenional search problem is reduced to a l-dimenional search problem by a process called filter decomposition. The decomposed filters are organized as a height-balanced search tree. The search operation is speeded up by parallel processing techniques to achieve a throughput of one packet per memory cycle. Our design is scalable to larger rule databases and IPv6. It also allows the incremental updates the data structures.
Notes: CityU Call Number: TK5105.875.I57 L5847 2005; Includes bibliographical references (leaves 86-90); Thesis (Ph.D.)--City University of Hong Kong, 2005; ix, 90 leaves : ill. ; 30 cm.</description>
      <pubDate>Sat, 01 Jan 2005 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">http://dspace.cityu.edu.hk:80/handle/2031/4356</guid>
      <dc:date>2005-01-01T00:00:00Z</dc:date>
    </item>
  </channel>
</rss>

