City University of Hong Kong

CityU Institutional Repository >
3_CityU Electronic Theses and Dissertations >
ETD - Dept. of Computer Science  >
CS - Master of Philosophy  >

Please use this identifier to cite or link to this item:

Title: Automatic extraction of learning object metadata (LOM) from HTML web pages
Other Titles: Zi dong hua ti qu wang ye zhong de xue xi wu jian yuan shu ju
Authors: Tang, Wai Yuen (鄧威遠)
Department: Dept. of Computer Science
Degree: Master of Philosophy
Issue Date: 2007
Publisher: City University of Hong Kong
Subjects: HTML (Document markup language)
Web sites
Notes: CityU Call Number: Z666.7.T36 2007
Includes bibliographical references (leaves 126-130)
Thesis (M.Phil.)--City University of Hong Kong, 2007
vii, 168 leaves : ill. (some col.) ; 30 cm.
Type: Thesis
Abstract: Obtaining learning resources from the Internet is common nowadays. However, locating relevant learning resources on the Internet for learning is difficult due to the loose structure of the web. Even with the help of search engines and keywords searching, the search results are too huge for manual selection and they are usually of poor relevancy. There is also no way to specify attributes, like keywords, author, type of media, etc. of a learning object for searching, not to mention its level of interactivity and difficulty. It is difficult to have a standard way for describing learning objects and allowing users to use these descriptions during searching. In order to solve these problems, learning technology standard such as IEEE LOM is emerged to provide a standard metadata set for describing learning resources, helping users to identify relevant learning objects easily. However, there are too many attributes in the standard which will make the metadata difficult to create and thus users reluctant to use. This thesis discusses the problems of locating relevant learning resources on the Internet; discusses the results of literature reviews on search engines and learning technology standards; analyzes the IEEE LOM and HTML information; introduces the methods for automating the extraction of LOM from HTML web pages; explains the design and implementation of the automatic extraction framework of LOM; reports the experiment for testing and evaluating the heuristic rules used in the automatic extraction framework of LOM; concludes the research project and explains the future extension of the automatic extraction framework of LOM.
Online Catalog Link:
Appears in Collections:CS - Master of Philosophy

Files in This Item:

File Description SizeFormat
fulltext.html159 BHTMLView/Open
abstract.html159 BHTMLView/Open

Items in CityU IR are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0!
DSpace Software © 2013 CityU Library - Send feedback to Library Systems
Privacy Policy · Copyright · Disclaimer