Please use this identifier to cite or link to this item:
http://dspace.cityu.edu.hk/handle/2031/3651
Title: | Sketching and sampling techniques for colored range queries |
Authors: | Chow, Yacoob Wai Chung |
Department: | Department of Computer Science |
Issue Date: | 2006 |
Supervisor: | Dr. Poon C K. First Reader: Dr. Liu W Y. Second Reader: Prof. Jia Xiaohua |
Abstract: | The large amount of memory required by storing information is always a problem of many data administrator. Sometimes, data points are associated with categorical attributes and queries are interested in information about the categories of the points. We used colors to represent the categorical information. A colored range query asks for the information of the colors of points lying within a query region. To reduce storage and improve query response time, we apply synopsis data structure to summarize a very large colored vector to answer the colored range queries. This project will show the feasibility of answering colored range queries by synopsis data structures and discuss various implementation problems and solutions. L0 sketch and coordinated sample will be selected as the synopsis data structure for colored range count query in this project. We first test their performance on estimating the number of distinct non-zero elements in a vector; this is the original usage of them. Then we will evaluate their performances on answering the range queries in terms of approximate accuracy, memory usage and speed to compare which synopsis data structure is better. We also tried some variations on the L0 sketch to measure the effects of different setting. |
Appears in Collections: | Computer Science - Undergraduate Final Year Projects |
Files in This Item:
File | Size | Format | |
---|---|---|---|
fulltext.html | 164 B | HTML | View/Open |
Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.