Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/6429
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHung, Yuk Man (洪彧文)en_US
dc.date.accessioned2011-12-20T09:05:42Z
dc.date.accessioned2017-09-19T08:26:50Z
dc.date.accessioned2019-01-22T03:40:33Z-
dc.date.available2011-12-20T09:05:42Z
dc.date.available2017-09-19T08:26:50Z
dc.date.available2019-01-22T03:40:33Z-
dc.date.issued2011en_US
dc.identifier.citationHung, Y. M. (2011). Finding word senses in tagging system (Outstanding Academic Papers by Students (OAPS)). Retrieved from City University of Hong Kong, CityU Institutional Repository.
dc.identifier.other2011cshym471en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/6429-
dc.descriptionNominated as OAPS (Outstanding Academic Papers by Students) paper by Department in 2011-12.
dc.description.abstractTagging is popular in blog and social website because a tag can search and describe a file instead of predefined category. However, people tend to give the tags liberally resulting in many similar, obsolete and ambiguous tags within the system. Searching efficiency can be severely reduced. Some situations also affect searching accuracy, such as the polysemous or synonymous tags. Although latent semantic indexing (LSI) can disambiguate the word senses but in slow computation time as the large dataset. An approach of combining random projection with LSI is proposed to speed up the performance. However, the result of a context-based document system shows that the performance is only improved in a smaller dataset. For this observation, combining random projection with LSI in tag-based system should be able to gain improvement as the dataset in tag-based system should be smaller than a context-based system. Also, to the best of my knowledge, this approach has not been experimented in tag-based system. In this project, the results show that LSI with random projection is able to reduce 34% running time with about 70% accuracy. Also, I attempt to apply adaptive folding-up algorithm to update SVD dynamically, but it does not always retrieve high accuracy result.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is unrestricted.en_US
dc.subjectSemantics -- Data processing.
dc.subjectComputational linguistics.
dc.titleFinding word senses in tagging systemen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.description.supervisorSupervisor: Dr. Poon, C K; First Reader: Dr. Ngo, C W; Second Reader: Dr. Wang, Len_US
Appears in Collections:Computer Science - Undergraduate Final Year Projects 
OAPS - Dept. of Computer Science 

Files in This Item:
File SizeFormat 
fulltext.html148 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer