Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/8216
Full metadata record
DC FieldValueLanguage
dc.contributor.authorXu, Yisi (許逸思)en_US
dc.date.accessioned2016-01-07T01:24:09Z
dc.date.accessioned2017-09-19T08:28:45Z
dc.date.accessioned2019-01-22T03:47:44Z-
dc.date.available2016-01-07T01:24:09Z
dc.date.available2017-09-19T08:28:45Z
dc.date.available2019-01-22T03:47:44Z-
dc.date.issued2015en_US
dc.identifier.citationXu, Y. (2015). Research studies on possible improvements to the Aho-Corasick string matching algorithm (Outstanding Academic Papers by Students (OAPS)). Retrieved from City University of Hong Kong, CityU Institutional Repository.en_US
dc.identifier.other2015eexy110en_US
dc.identifier.otheree2015-4382-xy110en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/8216-
dc.descriptionNominated as OAPS (Outstanding Academic Papers by Students) paper by Department in 2015-16.en_US
dc.descriptionConference paper developed from this OAPS paper: Xu, Y., & Pao, D. (2015). Space-time tradeoff in the Aho-Corasick string matching algorithm. In 2015 IEEE Conference on Communications and Network Security (pp. 713-714). IEEE. doi: 10.1109/CNS.2015.7346899.en_US
dc.description.abstractFour data structures (AC-basic, AC-expanded, AC-bitVec and AC-compressed) are implemented in this project in order to realize Aho-Corasick (AC) algorithm. Statistics on space and time requirements of these four implementations on scanning various input files are collected and compared. In AC-expanded, a two-dimensional array is used to store the fully expanded transition rule table, which takes up the most memory space among the four. AC-basic uses linked-lists to save memory space, but the processing speed is lowered by this practice. AC-bitVec transforms the linked-lists to bit vectors to lower the time and memory requirement. AC-compressed is a reasonable improvement over the above three, in which the transition rule table is compressed by perfect hashing and elimination of transition edges. Comparing the results obtained by running these four versions, the processing speed of AC-compressed reaches 3.9 to 7.8 times that of AC-basic, while it is 34% to 83% of that of AC-expanded. Regarding memory space requirement, AC-compressed needs only 3.8% to 6.3% the space of AC-expanded, and 1.7 to 2.6 times of that of AC-basic. Thus, the proposed AC-compressed implementation can be served as a satisfying trade-off between memory and speed over the basic version and expanded version.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is unrestricted.en_US
dc.subjectMatching theory.en_US
dc.subjectComputer algorithms.en_US
dc.titleResearch Studies on Possible Improvements to the Aho-Corasick String Matching Algorithmen_US
dc.contributor.departmentDepartment of Electronic Engineeringen_US
dc.description.courseEE4382 Projecten_US
dc.description.programmeBachelor of Engineering (Honours) in Information Engineeringen_US
dc.description.supervisorSupervisor: Dr. PAO, Derek C W; Assessor: Dr. SO, H Cen_US
Appears in Collections:Electrical Engineering - Undergraduate Final Year Projects 
OAPS - Dept. of Electrical Engineering 

Files in This Item:
File Description SizeFormat 
fulltext.html145 BHTMLView/Open
conference_paper.html126 BHTMLView/Open
authorpage-Xu_Yisi.htm159 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer