Please use this identifier to cite or link to this item:
http://dspace.cityu.edu.hk/handle/2031/7897
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yao, Ting (姚霆) | en_US |
dc.date.accessioned | 2015-09-25T02:08:08Z | |
dc.date.accessioned | 2017-09-19T09:19:39Z | |
dc.date.accessioned | 2019-02-12T08:41:21Z | - |
dc.date.available | 2015-09-25T02:08:08Z | |
dc.date.available | 2017-09-19T09:19:39Z | |
dc.date.available | 2019-02-12T08:41:21Z | - |
dc.date.issued | 2014 | en_US |
dc.identifier.other | cs2015-004 | en_US |
dc.identifier.uri | http://144.214.8.231/handle/2031/7897 | - |
dc.description | CityU Call Number: QA76.575 .Y37 2014 | en_US |
dc.description | xiii, 147 leaves : ill. 30 cm. | en_US |
dc.description | Thesis (Ph.D.)--City University of Hong Kong, 2014. | en_US |
dc.description | Includes bibliographical references (leaves 134-145) | en_US |
dc.description.abstract | This thesis investigates the problem of multimedia search under the umbrella of knowledge transfer by considering three cases: 1) how to exploit visual patterns from the initial ranked list to boost search precision, 2) how to leverage the external knowledge as a prior to help the search, and 3) how to explore the largely available click-through data (i.e., crowdsourcing human intelligence) for annotation and search. A common practice for improving search performance is to rerank the initial visual documents returned from a search engine by seeking consensus from various visual features. We propose a new reranking algorithm, named circular reranking, that reinforces the mutual exchange of information across multiple modalities for improving search performance, following the philosophy that strong performing modality could learn from weaker ones, while weak modality does benefit from interacting with stronger ones. Technically, circular reranking conducts multiple runs of random walks through exchanging the ranking scores among different features in a cyclic manner. Moreover, we study several properties of circular reranking, including how and which order of information propagation should be configured to fully exploit the potential of modalities for reranking. For the transfer of external knowledge, we first systematically analyze the different factors that lead to the success and failure of transferring classifiers. A simple yet innovative and practical model is proposed for predicting the transfer from the clues such as the distribution shift of data, concept category and concept contextual relationship. Next, we develop the semi-supervised domain adaptation with subspace learning and transfer RankBoost algorithms for one-to-one domain adaptation and multiple-to-one domain adaptation, respectively. The former aims to jointly explore invariant low-dimensional structures across domains to correct data distribution mismatch and leverage available unlabeled target examples to exploit the underlying intrinsic information in the target domain. The later extends the generic RankBoost learning framework for transferring knowledge from multiple sources. To investigate the use of click-through data, we devise a novel video similarity measurement based on polynomial semantic indexing. Two mappings to project queries and video documents into a common latent space are learnt by minimizing the margin ranking loss of the observed query-video pairs on the click-through bipartite. Then the dot product in the latent space is taken as the similarity function between videos and the video similarity is further applied for three major tasks in video tagging: tag assignment, ranking, and enrichment. Later, to bridge the user intention gap and allow direct comparison of text queries and visual images, click-through-based cross-view learning approach is presented for image search. The objective is formalized as a latent space learning by jointly minimizing the distance between the mappings of query and image in the latent space and preserving the inherent structure in each original space. We evaluate all the proposed techniques on several large-scale real-world image and video datasets. Experimental evaluations demonstrate promising results of our techniques, and their advantages to be applied to various multimedia search applications. | en_US |
dc.publisher | City University of Hong Kong | en_US |
dc.rights | This work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner. | en_US |
dc.rights | Access is unrestricted. | en_US |
dc.subject.lcsh | Multimedia systems. | en_US |
dc.subject.lcsh | Database searching. | en_US |
dc.title | Multimedia search by self, external, and crowdsourcing knowledge | en_US |
dc.title.alternative | Duo mei ti sou suo : cong yuan shi dui xiang, xiang guan zi yuan dao qun ti zhi hui fen xi | en_US |
dc.title.alternative | 多媒體搜索 : 從原始對象, 相關資源到群體智慧分析 | en_US |
dc.type | thesis | en_US |
dc.contributor.department | Department of Computer Science | en_US |
dc.degree.discipline | Computer Science | en_US |
dc.degree.level | doctoral | en_US |
dc.degree.name | Doctor of Philosophy | en_US |
dc.description.award | Won the 2015 SIGMM Outstanding Ph.D. Thesis Award. | en_US |
dc.description.fulltext | Award winning work is available. | en_US |
dc.identifier.catalog | http://lib.cityu.edu.hk/record=b4693549 | en_US |
Appears in Collections: | Student Works With External Awards |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
abstract.html | 132 B | HTML | View/Open | |
fulltext.html | 132 B | HTML | View/Open | |
award_news.html | 123 B | HTML | View/Open |
Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.