Please use this identifier to cite or link to this item:
http://dspace.cityu.edu.hk/handle/2031/8956
Title: | Data mining and data clustering |
Authors: | Chung, Uen Yan |
Department: | Department of Electronic Engineering |
Issue Date: | 2018 |
Supervisor: | Supervisor: Prof. Chow, Tommy W S; Assessor: Dr. Tang, Wallace K S |
Abstract: | US movie industry, which is renowned as Hollywood, is the oldest and largest film industry in the world. Being the top amongst the world, United State has the largest market share in terms of revenue. Predicting the success of a movie, would be the primary task for marketers and theatres, which helps to plan marketing strategy after releasing a movie. Given that the online data is enormous and constantly updated nowadays, information would be available from online database. IMDB, which stands for "Internet Movie Database", is one of the popular database that gathered information of film and TV programs. In order to evaluate the approach to predict how successful a movie is, data of over 1000 US movies released between 2011 and 2016 would be collected from IMDB by data mining techniques. In this project, the movie gross would be the target data that can indicate the success of a movie. The extracted data, which includes MPAA rating, genre and user rating, would be used to predict movie gross by three classification methods - Naïve Bayes Classification, logistic regression and support vector machine. |
Appears in Collections: | Electrical Engineering - Undergraduate Final Year Projects |
Files in This Item:
File | Size | Format | |
---|---|---|---|
fulltext.html | 148 B | HTML | View/Open |
Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.