Please use this identifier to cite or link to this item:
http://dspace.cityu.edu.hk/handle/2031/537
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chang, Matthew Chor Ming | |
dc.date.accessioned | 2006-01-20T06:26:16Z | |
dc.date.accessioned | 2017-09-19T08:51:31Z | |
dc.date.accessioned | 2019-02-12T06:53:44Z | - |
dc.date.available | 2006-01-20T06:26:16Z | |
dc.date.available | 2017-09-19T08:51:31Z | |
dc.date.available | 2019-02-12T06:53:44Z | - |
dc.date.issued | 2003 | |
dc.identifier.other | 2003csccm818 | |
dc.identifier.uri | http://144.214.8.231/handle/2031/537 | - |
dc.description.abstract | In this project, I study the problem of email clustering. I have implemented an email client system, CEMail system, for addressing and testing an email clustering algorithm with specially consideration of the email characteristics. I have first found out the characteristics of email characteristics which I can distinguish from other kinds of document clustering. The characteristics help to design an algorithm with a high accurate rate for specially classifying emails. I then apply the notion of resemblance to measure the similarity between emails. Then the emails are classified by using k-nearest neighbor classification model. I will show the efficiency on the supervised learning rate, the comparison with Naïve Bayesian classification method, the elimination of heavy pre-processing, and the high accurate rate by using the approach. I have realized the approach by implementing an application called CEMail system in Java. I have also tested it by clustering the incoming email corpus which is received from various companies for a period of time. I found the accurate rate of clustering is as high as 95%. | |
dc.format.extent | 164 bytes | |
dc.format.mimetype | text/html | |
dc.rights | This work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner. | |
dc.rights | Access is restricted to CityU users. | |
dc.title | Document clustering in email client system | en |
dc.contributor.department | Department of Computer Science | en |
dc.description.supervisor | Dr. C.K. Poon. First Reader: Dr. Y.T. Yu. Second Reader: Pro. Horace IP | |
Appears in Collections: | Computer Science - Undergraduate Final Year Projects |
Files in This Item:
File | Size | Format | |
---|---|---|---|
fulltext.html | 164 B | HTML | View/Open |
Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.