Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/8303
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChi, Fung Cheung (池鳳翔)en_US
dc.date.accessioned2016-03-07T09:12:47Z
dc.date.accessioned2017-09-19T08:26:52Z
dc.date.accessioned2019-01-22T03:40:35Z-
dc.date.available2016-03-07T09:12:47Z
dc.date.available2017-09-19T08:26:52Z
dc.date.available2019-01-22T03:40:35Z-
dc.date.issued2015en_US
dc.identifier.citationChi, F. C. (2015). ActiveCrowd: Integrating active learning with Amazon Mechanical Turk (Outstanding Academic Papers by Students (OAPS)). Retrieved from City University of Hong Kong, CityU Institutional Repository.en_US
dc.identifier.othercs2015-4514-cfc035en_US
dc.identifier.othercs2015-005en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/8303-
dc.description.abstractMachine learning is a technique that builds classification and prediction models through learning from samples. It is proven to be useful in scientific research such as DNA pattern recognition and climate modeling. It is also adopted in many real life applications, including spam filtering, image searching and optical character recognition (OCR). Theoretically, the more samples being provided to a learning model, the more accurate the model can be. However, supervised learning requires that samples be provided along with their labels, which can be expensive to obtain in terms of the human power required for labeling tasks. It greatly hinders the adoption of machine learning in resource limited environment. Meanwhile, crowdsourcing allows requestors to obtain scalable workforce and services from a large crowd of people. Amazon Mechanical Turk (MTurk) is one popular online crowdsourcing platform which enables requestors to publish requests to more than 500 thousands registered workers. It has potential to solve the problem of sample labeling, but so far no integration of machine learning and crowdsourcing is implemented in a way that can serve general machine learning purposes. In this project, a machine learning framework named ActiveCrowd was designed and implemented to allow anyone who has basic programming knowledge to build machine learning model for general purposes. The framework adopted active learning technique and integrated scikit-learn, which is a superior machine learning library written in Python and published under BSD license, with Amazon Mechanical Turk as the label annotator in a low cost and efficient manner. The framework is able to reduce the implementation effort required for building machine learning models and makes the supervised learning process completely automated.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is unrestricted.en_US
dc.titleActiveCrowd: integrating active learning with Amazon Mechanical Turken_US
dc.typeResearch Projecten_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.description.awardWon the Second Runner-up in the Final Year Project Competition 2014-2015 organized by the IEEE (Hong Kong) Computational Intelligence Chapter; the Third Prize in the Challenge Cup 2015 - Hong Kong University Students Extra-Curriculum Technology Contest under the category of Information Technology; and the Merit Award in Crossover 2015 Pan-Pearl River Delta Region Universities IT Project Competition (Hong Kong SAR Region).en_US
dc.description.courseCS4514 Projecten_US
dc.description.programmeBachelor of Science (Honours) in Computer Scienceen_US
dc.description.supervisorDr. Nutanong, Saranaen_US
Appears in Collections:OAPS - Dept. of Computer Science 
Student Works With External Awards 

Files in This Item:
File Description SizeFormat 
fulltext.html154 BHTMLView/Open
authorpage-Chi_Fung_Cheung.htm167 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer