Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/9083
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChan, Wing Sheungen_US
dc.date.accessioned2019-01-29T04:58:48Z
dc.date.accessioned2019-02-12T06:54:06Z-
dc.date.available2019-01-29T04:58:48Z
dc.date.available2019-02-12T06:54:06Z-
dc.date.issued2018en_US
dc.identifier.other2018cscws179en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/9083-
dc.description.abstractDSE is a public examination for students to get into university. It plays a vital role in determining someone's career path and study path. Therefore, this project aims to analyze the most commonly examined topics for future public examination. By revealing the knowledge points and frequently examined formats, it is hoped that students can revise in a more efficient and effective way. There are total 30 years of Mathematics pastpapers to be analyzed. Throughout the project, three stages are involved, which is text mining, classifier building for topic classification and analysis, and question/topic prediction. Python is used for implementing the machine learning algorithm as it contains many essential packages on building the project. For the first two stages of project, Bag of Words model is introduced. Other techniques, for example, preprocessing the documents by removing stopwods and stemming; standardizing the mathematic equation format to increase the classifier's understanding of words; finding the relatively important keywords and knowledge points of each topic; customizing token patterns to leave distinguishable features in feature vector. To evaluate the system, two classifiers (logistic regression and multinomial Naïve Bayes) are employed with various parameters setting of distinct vectorizers. For the last stage, 2018 HKDSE mathematics paper is predicted. It reveals commonly examined question structure. Not just the implementation and result, limitations and future improvements of this project are also discussed.en_US
dc.titleExamination Paper Question Analysisen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.description.supervisorSupervisor: Prof. Wang, Jianping; First Reader: Dr. Wong, Ka Chun Raymond; Second Reader: Dr. Yu, Yuen Taken_US
Appears in Collections:Computer Science - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html148 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer