Please use this identifier to cite or link to this item:
http://dspace.cityu.edu.hk/handle/2031/6429
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hung, Yuk Man (洪彧文) | en_US |
dc.date.accessioned | 2011-12-20T09:05:42Z | |
dc.date.accessioned | 2017-09-19T08:26:50Z | |
dc.date.accessioned | 2019-01-22T03:40:33Z | - |
dc.date.available | 2011-12-20T09:05:42Z | |
dc.date.available | 2017-09-19T08:26:50Z | |
dc.date.available | 2019-01-22T03:40:33Z | - |
dc.date.issued | 2011 | en_US |
dc.identifier.citation | Hung, Y. M. (2011). Finding word senses in tagging system (Outstanding Academic Papers by Students (OAPS)). Retrieved from City University of Hong Kong, CityU Institutional Repository. | |
dc.identifier.other | 2011cshym471 | en_US |
dc.identifier.uri | http://144.214.8.231/handle/2031/6429 | - |
dc.description | Nominated as OAPS (Outstanding Academic Papers by Students) paper by Department in 2011-12. | |
dc.description.abstract | Tagging is popular in blog and social website because a tag can search and describe a file instead of predefined category. However, people tend to give the tags liberally resulting in many similar, obsolete and ambiguous tags within the system. Searching efficiency can be severely reduced. Some situations also affect searching accuracy, such as the polysemous or synonymous tags. Although latent semantic indexing (LSI) can disambiguate the word senses but in slow computation time as the large dataset. An approach of combining random projection with LSI is proposed to speed up the performance. However, the result of a context-based document system shows that the performance is only improved in a smaller dataset. For this observation, combining random projection with LSI in tag-based system should be able to gain improvement as the dataset in tag-based system should be smaller than a context-based system. Also, to the best of my knowledge, this approach has not been experimented in tag-based system. In this project, the results show that LSI with random projection is able to reduce 34% running time with about 70% accuracy. Also, I attempt to apply adaptive folding-up algorithm to update SVD dynamically, but it does not always retrieve high accuracy result. | en_US |
dc.rights | This work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner. | en_US |
dc.rights | Access is unrestricted. | en_US |
dc.subject | Semantics -- Data processing. | |
dc.subject | Computational linguistics. | |
dc.title | Finding word senses in tagging system | en_US |
dc.contributor.department | Department of Computer Science | en_US |
dc.description.supervisor | Supervisor: Dr. Poon, C K; First Reader: Dr. Ngo, C W; Second Reader: Dr. Wang, L | en_US |
Appears in Collections: | Computer Science - Undergraduate Final Year Projects OAPS - Dept. of Computer Science |
Files in This Item:
File | Size | Format | |
---|---|---|---|
fulltext.html | 148 B | HTML | View/Open |
Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.