Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/537
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChang, Matthew Chor Ming
dc.date.accessioned2006-01-20T06:26:16Z
dc.date.accessioned2017-09-19T08:51:31Z
dc.date.accessioned2019-02-12T06:53:44Z-
dc.date.available2006-01-20T06:26:16Z
dc.date.available2017-09-19T08:51:31Z
dc.date.available2019-02-12T06:53:44Z-
dc.date.issued2003
dc.identifier.other2003csccm818
dc.identifier.urihttp://144.214.8.231/handle/2031/537-
dc.description.abstractIn this project, I study the problem of email clustering. I have implemented an email client system, CEMail system, for addressing and testing an email clustering algorithm with specially consideration of the email characteristics. I have first found out the characteristics of email characteristics which I can distinguish from other kinds of document clustering. The characteristics help to design an algorithm with a high accurate rate for specially classifying emails. I then apply the notion of resemblance to measure the similarity between emails. Then the emails are classified by using k-nearest neighbor classification model. I will show the efficiency on the supervised learning rate, the comparison with Naïve Bayesian classification method, the elimination of heavy pre-processing, and the high accurate rate by using the approach. I have realized the approach by implementing an application called CEMail system in Java. I have also tested it by clustering the incoming email corpus which is received from various companies for a period of time. I found the accurate rate of clustering is as high as 95%.
dc.format.extent164 bytes
dc.format.mimetypetext/html
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.
dc.rightsAccess is restricted to CityU users.
dc.titleDocument clustering in email client systemen
dc.contributor.departmentDepartment of Computer Scienceen
dc.description.supervisorDr. C.K. Poon. First Reader: Dr. Y.T. Yu. Second Reader: Pro. Horace IP
Appears in Collections:Computer Science - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html164 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer