RNA sequence assembly using base pair bond structure

dc.contributor.advisorEvans, Patricia
dc.contributor.authorFannoush, Nabil
dc.date.accessioned2023-03-01T16:27:55Z
dc.date.available2023-03-01T16:27:55Z
dc.date.issued2016
dc.date.updated2023-03-01T15:02:21Z
dc.description.abstractThe process of determining nucleic acid (DNA and RNA) sequences is affected by limitations in technology that limit the length of sequence that can be read at once. Longer sequences are thus read in pieces, and so an important part of the process is to re-assemble these pieces, using overlapping pieces from multiple copies. Sequence assembly is a complex computational problem, particularly if the target sequence is one that is hitherto unknown and so there is no reference that helps to assemble the sequence pieces correctly. The advent of a new generation of high throughput technology that can quickly read shorter sequences makes the assembly an even more critical step because of the increased likelihood of missing regions in the final re-assembled sequence. Current sequence assembly algorithms are designed to seek a best result, from a series of characters perspective, in the form of a shortest possible ‘superstring’. However, DNA, RNA and other biological sequences are more than just strings, and so a shortest possible string does not equate to how the pieces were originally ordered, which is ultimately the goal of sequence assembly. This work proposes an alternative approach of including known structure properties and conditions that govern the building and design of biological sequences to help the re-assembly of RNA sequences, and includes tests whose results show noticeable increase in accuracy of the sequence assembly result in comparison to past approaches and that using structure also enables some shorter sequences to be assembled from a single copy.
dc.description.copyright© Nabil Fannoush, 2016
dc.formattext/xml
dc.format.extentviii, 97 pages
dc.format.mediumelectronic
dc.identifier.urihttps://unbscholar.lib.unb.ca/handle/1882/13903
dc.language.isoen_CA
dc.publisherUniversity of New Brunswick
dc.rightshttp://purl.org/coar/access_right/c_abf2
dc.subject.disciplineComputer Science
dc.titleRNA sequence assembly using base pair bond structure
dc.typemaster thesis
thesis.degree.disciplineComputer Science
thesis.degree.fullnameMaster of Computer Science
thesis.degree.grantorUniversity of New Brunswick
thesis.degree.levelmasters
thesis.degree.nameM.C.S.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
item.pdf
Size:
949.36 KB
Format:
Adobe Portable Document Format