Drexel University Home Pagewww.drexel.edu DREXEL UNIVERSITY LIBRARIES HOMEPAGE >>

iDEA: Drexel E-repository and Archives > Drexel Theses and Dissertations > Drexel Theses and Dissertations > New approaches to improving organisms detection and gene prediction in metagenomes

Please use this identifier to cite or link to this item: http://hdl.handle.net/1860/3541

Title: New approaches to improving organisms detection and gene prediction in metagenomes
Authors: Yok, Non G.
Keywords: Electric engineering;Molecular microbiology;Microorganisms--Molecular aspects;Gene prediction
Issue Date: 11-Jul-2011
Abstract: This thesis investigates microorganism detection and gene prediction on the DNA level from environmental samples such as air, water, soil, and the human body. In applications such as homeland security, it is important to detect pathogens and their expressed genes in the air or ground. Chemical solutions have difficulty detecting trace amounts and are costly. Technologies such as DNA microarrays and next-generation sequencing are becoming cost-efficient, thus it is of interest to use these technologies to discover organisms and genes present in a sample, especially when in \fine" amounts. The thesis will impact organism and gene discovery, which will aid pharmaceutical and bio-fuel discovery, as well as new methods in forensics and homeland security. To enable microarrays to detect more organisms utilizing fewer probes, we discuss the advantage of using specially-designed compressive sensing microarrays (CSM), which is a DNA-based sensor array that operates using group testing and compressive sensing (CS) principles. We improve compressive sensing DNA microarrays by introducing probe picking algorithm that iteratively searches for the most optimal probe. For predicting reads from next-generation sequencing of DNA, we introduce a new hybrid algorithm, Homology-Abinitio, which combines ab-initio annotation, using improved models, with sequence similarity comparisons. After benchmarking well-known gene prediction programs for metagenomic reads { GeneMark, MetaGeneAnnotator (MGA) and Orphelia { we then combined them to gain better accuracy (f-measure), especially for short reads. In addition, we show how using metatran-scriptomic data can significantly improve gene prediction specificity. Each previous algorithm and Homology-Abinitio is tested with the most diverse test set to-date, including genes from 96 species.
URI: http://hdl.handle.net/1860/3541
Appears in Collections:Drexel Theses and Dissertations

Files in This Item:

File Description SizeFormat
Yok_Non.pdf2.37 MBAdobe PDFView/Open
View Statistics

Items in iDEA are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0! iDEA Software Copyright © 2002-2010  Duraspace - Feedback