Please use this identifier to cite or link to this item:
Title: Finding word senses in tagging system
Authors: Hung, Yuk Man (洪彧文)
Department: Department of Computer Science
Issue Date: 2011
Supervisor: Supervisor: Dr. Poon, C K; First Reader: Dr. Ngo, C W; Second Reader: Dr. Wang, L
Subjects: Semantics -- Data processing.
Computational linguistics.
Description: Nominated as OAPS (Outstanding Academic Papers by Students) paper by Department in 2011-12.
Citation: Hung, Y. M. (2011). Finding word senses in tagging system (Outstanding Academic Papers by Students (OAPS)). Retrieved from City University of Hong Kong, CityU Institutional Repository.
Abstract: Tagging is popular in blog and social website because a tag can search and describe a file instead of predefined category. However, people tend to give the tags liberally resulting in many similar, obsolete and ambiguous tags within the system. Searching efficiency can be severely reduced. Some situations also affect searching accuracy, such as the polysemous or synonymous tags. Although latent semantic indexing (LSI) can disambiguate the word senses but in slow computation time as the large dataset. An approach of combining random projection with LSI is proposed to speed up the performance. However, the result of a context-based document system shows that the performance is only improved in a smaller dataset. For this observation, combining random projection with LSI in tag-based system should be able to gain improvement as the dataset in tag-based system should be smaller than a context-based system. Also, to the best of my knowledge, this approach has not been experimented in tag-based system. In this project, the results show that LSI with random projection is able to reduce 34% running time with about 70% accuracy. Also, I attempt to apply adaptive folding-up algorithm to update SVD dynamically, but it does not always retrieve high accuracy result.
Appears in Collections:Computer Science - Undergraduate Final Year Projects
OAPS - Dept. of Computer Science

Files in This Item:
File SizeFormat 
fulltext.html148 BHTMLView/Open

Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.