Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/9444
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHui, Ka Hoen_US
dc.date.accessioned2021-11-16T05:56:59Z-
dc.date.available2021-11-16T05:56:59Z-
dc.date.issued2021en_US
dc.identifier.other2021eehkh734en_US
dc.identifier.urihttp://dspace.cityu.edu.hk/handle/2031/9444-
dc.description.abstractGraphs are a data structure made of edges and nodes and both of them can be labelled by sequence, so that it is commonly used to represent a set of sequences, especially for some complex sequence like genome which makes up with a sequence of different proteins. Different sets of genome sequences may have some similar parts, by making use of the properties of the graph, a large graph can represent many combinations of genome sequences and each path in the graph spells a concatenated sequence, which reduces the redundancy part between each sequence. Besides, having an efficient way to accomplish align sequencing reads is very important for analysing genome assembly. In the project. I compared different data structures and algorithms related to graphs and familiar with different kinds of graph structures such as directed acyclic graphs and cyclic graphs. After that, I implemented a data structure to have a sequence analysis on a large graph such as retrieving the sequence from the graph by using a path. I have also developed two algorithms that can accomplish the align sequencing reads which including is sequence-to-graph matching and semi-global sequence-to-graph alignment by considering the time complexity. For the sequence-to-Graph Matching, the algorithm that I developed provide a time complexity O(|M||E|) to solve the problem, and for the sequence-to-Graph Alignment, the second algorithm I developed provide a time complexity O((|V|+|M|)|E|) to retrieve a path. (where |M| is the length of the searching sequence, |E| and |V| are the number of edges and nodes in the graph respectively.)en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is restricted to CityU users.en_US
dc.titleDevelop an efficient path finding algorithm for very large graphsen_US
dc.contributor.departmentDepartment of Electrical Engineeringen_US
dc.description.supervisorSupervisor: Dr. Sun, Yanni; Assessor: Prof. Chen, Guanrongen_US
Appears in Collections:Electrical Engineering - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html148 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer