Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/8379
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChi, Fung Cheungen_US
dc.date.accessioned2016-06-06T06:41:36Z
dc.date.accessioned2017-09-19T08:51:02Z
dc.date.accessioned2019-02-12T06:53:13Z-
dc.date.available2016-06-06T06:41:36Z
dc.date.available2017-09-19T08:51:02Z
dc.date.available2019-02-12T06:53:13Z-
dc.date.issued2015en_US
dc.identifier.other2015cscfc035en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/8379-
dc.description.abstractMachine learning is a technique that builds classification and prediction models through learning from samples. It is proven to be useful in scientific research such as DNA pattern recognition and climate modeling. It is also adopted in many real life applications, including spam filtering, image searching and optical character recognition (OCR). Theoretically, the more samples being provided to a learning model, the more accurate the model can be. However, supervised learning requires that samples be provided along with their labels, which can be expensive to obtain in terms of the human power required for labeling tasks. It greatly hinders the adoption of machine learning in resource limited environment. Meanwhile, crowdsourcing allows requestors to obtain scalable workforce and services from a large crowd of people. Amazon Mechanical Turk (MTurk) is one popular online crowdsourcing platform which enables requestors to publish requests to more than 500 thousands registered workers. It has potential to solve the problem of sample labeling, but so far no integration of machine learning and crowdsourcing is implemented in a way that can serve general machine learning purposes. In this project, a machine learning framework named ActiveCrowd was designed and implemented to allow anyone who has basic programming knowledge to build machine learning model for general purposes. The framework adopted active learning technique and integrated scikit-learn, which is a superior machine learning library written in Python and published under BSD license, with Amazon Mechanical Turk as the label annotator in a low cost and efficient manner. The framework is able to reduce the implementation effort required for building machine learning models and makes the supervised learning process completely automated.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is restricted to CityU users.en_US
dc.titleActiveCrowd: Integrating Active Learning with Amazon Mechanical Turken_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.description.supervisorDr. Nutanong, Saranaen_US
Appears in Collections:Computer Science - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html146 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer