Thèses sur le sujet « Gene network reconstruction »
Créez une référence correcte selon les styles APA, MLA, Chicago, Harvard et plusieurs autres
Consultez les 31 meilleures thèses pour votre recherche sur le sujet « Gene network reconstruction ».
À côté de chaque source dans la liste de références il y a un bouton « Ajouter à la bibliographie ». Cliquez sur ce bouton, et nous générerons automatiquement la référence bibliographique pour la source choisie selon votre style de citation préféré : APA, MLA, Harvard, Vancouver, Chicago, etc.
Vous pouvez aussi télécharger le texte intégral de la publication scolaire au format pdf et consulter son résumé en ligne lorsque ces informations sont inclues dans les métadonnées.
Parcourez les thèses sur diverses disciplines et organisez correctement votre bibliographie.
ACERBI, ENZO. « Continuos time Bayesian networks for gene networks reconstruction ». Doctoral thesis, Università degli Studi di Milano-Bicocca, 2014. http://hdl.handle.net/10281/52709.
Texte intégralFichtenholtz, Alexander Michael. « In silico bacterial gene regulatory network reconstruction from sequence ». Thesis, Boston University, 2012. https://hdl.handle.net/2144/32880.
Texte intégralPLEASE NOTE: Boston University Libraries did not receive an Authorization To Manage form for this thesis or dissertation. It is therefore not openly accessible, though it may be available by request. If you are the author or principal advisor of this work and would like to request open access for it, please contact us at open-help@bu.edu. Thank you.
DNA sequencing techniques have evolved to the point where one can sequence millions of bases per minute, while our capacity to use this information has been left behind. One particularly notorious example is in the area of gene regulatory networks. A molecular study of gene regulation proceeds one protein at a time, requiring bench scientists months of work purifying transcription factors and performing DNA footprinting studies. Massive scale options like ChIP-Seq and microarrays are a step up, but still require considerable resources in terms of manpower and materials. While computational biologists have developed methods to predict protein function from sequence, gene locations from sequence, and even metabolic networks from sequence, the space of regulatory network reconstruction from sequence remains virtually untouched. Part of the reason comes from the fact that the components of a regulatory interaction, such as transcription factors and binding sites, are difficult to detect. The other, more prominent reason, is that there exists no "recognition code" to determine which transcription factors will bind which sites. I've created a pipeline to reconstruct regulatory networks starting from an unannotated complete genomic sequence for a prokaryotic organism. The pipeline predicts necessary information, such as gene locations and transcription factor sequences, using custom tools and third party software. The core step is to determine the likelihood of interaction between a TF and a binding site using a black box style recognition code developed by applying machine learning methods to databases of prokaryotic regulatory interactions. I show how one can use this pipeline to reconstruct the virtually unknown regulatory network of Bacillus anthracis.
2031-01-01
Li, Song. « Integrate qualitative biological knowledge for gene regulatory network reconstruction with dynamic Bayesian networks ». [Ames, Iowa : Iowa State University], 2007.
Trouver le texte intégralSteiger, Edgar [Verfasser]. « Efficient Sparse-Group Bayesian Feature Selection for Gene Network Reconstruction / Edgar Steiger ». Berlin : Freie Universität Berlin, 2018. http://d-nb.info/1170876633/34.
Texte intégralKröger, Stefan. « Bioinformatic analyses for T helper cell subtypes discrimination and gene regulatory network reconstruction ». Doctoral thesis, Humboldt-Universität zu Berlin, 2017. http://dx.doi.org/10.18452/18122.
Texte intégralWithin the last two decades high-throughput gene expression screening technologies have led to a rapid accumulation of experimental data. The amounts of information available have enabled researchers to contrast and combine multiple experiments by synthesis, one of such approaches is called meta-analysis. In this thesis, we build a large gene expression data set based on publicly available studies for further research on T cell subtype discrimination and the reconstruction of T cell specific gene regulatory events. T cells are immune cells which have the ability to differentiate into subtypes with distinct functions, initiating and contributing to a variety of immune processes. To date, an unsolved problem in understanding the immune system is how T cells obtain a specific subtype differentiation program, which relates to subtype-specific gene regulatory mechanisms. We present an assembled expression data set which describes a specific T cell subset, regulatory T (Treg) cells, which can be further categorized into natural Treg (nTreg) and induced Treg (iTreg) cells. In our analysis we have addressed specific challenges in regulatory T cell research: (i) discriminating between different Treg cell subtypes for characterization and functional analysis, and (ii) reconstructing T cell subtype specific gene regulatory mechanisms which determine the differences in subtype-specific roles for the immune system. Our meta-analysis strategy combines more than one hundred microarray experiments. This data set is applied to a machine learning based strategy of extracting surface protein markers to enable Treg cell subtype discrimination. We identified a set of 41 genes which distinguish between nTregs and iTregs based on gene expression profile only. Evaluation of six of these genes confirmed their discriminative power which indicates that our approach is suitable to extract candidates for robust discrimination between experiment classes. Next, we identify gene regulatory interactions using existing reconstruction algorithms aiming to extend the number of known gene-gene interactions for Treg cells. We applied eleven GRN reconstruction tools based on expression data only and compared their performance. Taken together, our results suggest that the available methods are not yet sufficient to extend the current knowledge by inferring so far unreported Treg specific interactions. Finally, we present an approach of integrating multiple data sets based on different high-throughput technologies to reconstruct a subtype-specific GRN. We constructed a Th2 cell specific gene regulatory network of 100 genes. While 89 of these are known to be related to Th2 cell differentiation, we were able to attribute 11 new candidate genes with a function in Th2 cell differentiation. We show that our approach to data integration does, in principle, allow for the reconstruction of a complex network. Future availability of more and more consistent data may enable the use of the concept of GRN reconstruction to improve understanding causes and mechanisms of cellular differentiation in the immune system and beyond and, ultimately, their dysfunctions and diseases.
Chen, Wei, et 陈玮. « A factor analysis approach to transcription regulatory network reconstruction using gene expression data ». Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2012. http://hub.hku.hk/bib/B49617783.
Texte intégralpublished_or_final_version
Electrical and Electronic Engineering
Doctoral
Doctor of Philosophy
Henderson, David Allen. « Reconstruction of metabolic pathways by the exploration of gene expression data with factor analysis ». Diss., Virginia Tech, 2001. http://hdl.handle.net/10919/30089.
Texte intégralPh. D.
Kröger, Stefan [Verfasser], Ulf [Gutachter] Leser, Joachim [Gutachter] Selbig et Nils [Gutachter] Blüthgen. « Bioinformatic analyses for T helper cell subtypes discrimination and gene regulatory network reconstruction / Stefan Kröger ; Gutachter : Ulf Leser, Joachim Selbig, Nils Blüthgen ». Berlin : Humboldt-Universität zu Berlin, 2017. http://d-nb.info/118933108X/34.
Texte intégralAravena, Duarte Andrés Octavio. « Probabilistic and constraint based modelling to determine regulation events from heterogeneous biological data ». Phd thesis, Université Rennes 1, 2013. http://tel.archives-ouvertes.fr/tel-00988255.
Texte intégralMolnar, Istvan, David Lopez, Jennifer Wisecaver, Timothy Devarenne, Taylor Weiss, Matteo Pellegrini et Jeremiah Hackett. « Bio-crude transcriptomics : Gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa)* ». BioMed Central, 2012. http://hdl.handle.net/10150/610020.
Texte intégralWerhli, Adriano Velasque. « Reconstruction of gene regulatory networks from postgenomic data ». Thesis, University of Edinburgh, 2007. http://hdl.handle.net/1842/3198.
Texte intégralDeng, Wenping. « Algorithms for Reconstruction of Gene Regulatory Networks from High-Throughput Gene Expression Data ». Thesis, Michigan Technological University, 2019. http://pqdtopen.proquest.com/#viewpdf?dispub=13420080.
Texte intégralUnderstanding gene interactions in complex living systems is one of the central tasks in system biology. With the availability of microarray and RNA-Seq technologies, a multitude of gene expression datasets has been generated towards novel biological knowledge discovery through statistical analysis and reconstruction of gene regulatory networks (GRN). Reconstruction of GRNs can reveal the interrelationships among genes and identify the hierarchies of genes and hubs in networks. The new algorithms I developed in this dissertation are specifically focused on the reconstruction of GRNs with increased accuracy from microarray and RNA-Seq high-throughput gene expression data sets.
The first algorithm (Chapter 2) focuses on modeling the transcriptional regulatory relationships between transcription factors (TF) and pathway genes. Multiple linear regression and its regularized version, such as Ridge regression and LASSO, are common tools that are usually used to model the relationship between predictor variables and dependent variable. To deal with the outliers in gene expression data, the group effect of TFs in regulation and to improve the statistical efficiency, it is proposed to use Huber function as loss function and Berhu function as penalty function to model the relationships between a pathway gene and many or all TFs. A proximal gradient descent algorithm was developed to solve the corresponding optimization problem. This algorithm is much faster than the general convex optimization solver CVX. Then this Huber-Berhu regression was embedded into partial least square (PLS) framework to deal with the high dimension and multicollinearity property of gene expression data. The result showed this method can identify the true regulatory TFs for each pathway gene with high efficiency.
The second algorithm (Chapter 3) focuses on building multilayered hierarchical gene regulatory networks (ML-hGRNs). A backward elimination random forest (BWERF) algorithm was developed for constructing an ML-hGRN operating above a biological pathway or a biological process. The algorithm first divided construction of ML-hGRN into multiple regression tasks; each involves a regression between a pathway gene and all TFs. Random forest models with backward elimination were used to determine the importance of each TF to a pathway gene. Then the importance of a TF to the whole pathway was computed by aggregating all the importance values of the TF to the individual pathway gene. Next, an expectation maximization algorithm was used to cut the TFs to form the first layer of direct regulatory relationships. The upper layers of GRN were constructed in the same way only replacing the pathway genes by the newly cut TFs. Both simulated and real gene expression data were used to test the algorithms and demonstrated the accuracy and efficiency of the method.
The third algorithm (Chapter 4) focuses on Joint Reconstruction of Multiple Gene Regulatory Networks (JRmGRN) using gene expression data from multiple tissues or conditions. In the formulation, shared hub genes across different tissues or conditions were assumed. Under the framework of the Gaussian graphical model, JRmGRN method constructs the GRNs through maximizing a penalized log-likelihood function. It was formulated as a convex optimization problem, and then solved it with an alternating direction method of multipliers (ADMM) algorithm. Both simulated and real gene expression data manifested JRmGRN had better performance than existing methods.
Kentzoglanakis, Kyriakos. « Reconstructing gene regulatory networks : a swarm intelligence framework ». Thesis, University of Portsmouth, 2010. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.523619.
Texte intégralGroß, Torsten. « Network Inference from Perturbation Data : Robustness, Identifiability and Experimental Design ». Doctoral thesis, Humboldt-Universität zu Berlin, 2021. http://dx.doi.org/10.18452/22355.
Texte intégral'Omics' technologies provide extensive quantifications of components of biological systems but rarely characterize the interactions between them. To fill this gap, various network reconstruction methods have been developed over the past twenty years. Using perturbation data, these methods can deduce functional mechanisms in gene regulation, signal transduction, intra-cellular communication and many other cellular processes. Nevertheless, this reverse engineering problem remains essentially unsolved because inferred networks are often based on inapt assumptions, lack interpretability as well as a rigorous description of identifiability. To overcome these shortcoming, this thesis first presents a novel inference method which is based on a simple response logic. The underlying assumptions are so mild that the approach is suitable for a wide range of applications while also outperforming existing methods in standard benchmark data sets. For MAPK and PI3K signalling pathways in an adenocarcinoma cell line, it derived plausible network hypotheses, which explain distinct sensitivities of PI3K mutants to targeted inhibitors. Second, an intuitive maximum-flow problem is shown to describe identifiability of network interactions. This analytical result allows to devise identifiable effective network models in underdetermined settings and to optimize the design of costly perturbation experiments. Benchmarked on a database of human pathways, full network identifiability is obtained with less than a third of the perturbations that are needed in random experimental designs. Finally, the thesis presents mathematical advances within Modular Response Analysis (MRA), which is a popular framework to quantify network interaction strengths. It is shown that MRA can be approximated as an analytically solvable total least squares problem. This insight drastically reduces computational complexity, which allows to model much bigger networks and to handle novel large-scale perturbation data.
Pournara, Iosifina-Vasiliki. « Reconstructing gene networks by passive and active Bayesian learning ». Thesis, Birkbeck (University of London), 2005. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.417001.
Texte intégralThomas, Spencer Angus. « Synthesis, analysis and reconstruction of gene regulatory networks using evolutionary algorithms ». Thesis, University of Surrey, 2014. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.659110.
Texte intégralMalysheva, Valeriya. « Reconstruction of gene regulatory networks defining the cell fate transition processes ». Thesis, Strasbourg, 2016. http://www.theses.fr/2016STRAJ084/document.
Texte intégralThe cell fate acquisition is a highly complex phenomenon that involves a plethora of intrinsic and extrinsic instructive signals. However, despite the important progress in identification of key regulatory factors of this process, the mechanistic links between transcription factors, epigenome and chromatin structure which coordinate the regulation of cell differentiation and deregulation of gene networks during cell transformation are largely unknown. To address these questions for two model systems of cell fate transitions, namely the neuronal and endodermal cell differentiation induced by the morphogen retinoic acid and the stepwise tumorigenesis of primary human cells, we conducted integrative transcriptome, epigenome and chromatin architecture studies. Through extensive integration with thousands of available genomic data sets, we deciphered the gene regulatory networks of these processes and revealed new insights in the molecular circuitry of cell fate acquisition. The understanding of regulatory mechanisms that underlie the cell fate decision processes not only brings the fundamental understanding of cause-and-consequence relationships inside the cell, but also open the doors to the directed trans-differentiation
Ghanbari, Mahsa [Verfasser]. « Association measures and prior information in the reconstruction of gene networks / Mahsa Ghanbari ». Berlin : Freie Universität Berlin, 2016. http://d-nb.info/1104733757/34.
Texte intégralLiu, Jinhua [Verfasser]. « Bioinformatic Reconstruction of Gene Regulatory Networks Controlling EMT and Mesoderm Formation / Jinhua Liu ». Berlin : Freie Universität Berlin, 2020. http://d-nb.info/1218530537/34.
Texte intégralHasegawa, Takanori. « Reconstructing Biological Systems Incorporating Multi-Source Biological Data via Data Assimilation Techniques ». 京都大学 (Kyoto University), 2015. http://hdl.handle.net/2433/195985.
Texte intégralAit-Hamlat, Adel. « Reconstruction de réseaux de gènes à partir de données d'expression par déconvolution centrée autour des hubs ». Electronic Thesis or Diss., Sorbonne université, 2019. http://www.theses.fr/2019SORUS011.
Texte intégralGene regulatory networks (GRNs) are graphs in which nodes are genes and edges represent causal relationships from regulator genes, towards their downstream targets. One important topological property of GRNs is that a small number of their nodes have a large number of connections whereas the majority of the genes have few connections. The highly connected nodes are called hubs ; they allow any two nodes to be connected by relatively short paths in sparse networks. HubNeD (Hub-centered network deconvolution) is a novel method that exploits topological properties of GRNs to reconstruct them from steady state expression profiles. It works in three steps : firstly, a clustering step extracts genes that are considered solely regulated by grouping them in highly homogeneous co-regulation communities. Secondly, hub are inferred from the remaining genes, by analyzing the similarities of their correlation profiles to the genes in the co-regulations communities. Thirdly, an adjacency matrix is computed by a hub-centered deconvolution of the Pearson correlation scores. This last step penalizes direct connections between non-hubs, thus reducing the rate of false positives. The original strategy of preceding GRN reconstruction by a hub selection step, allows HubNeD to habe the highest performances on expression datasets associated with the two well established experimentally curated GRNs of E. Coli and Saccharomyces cerevisiae
江益志. « Integrating gene expression, SNP markers, and gene-gene interaction towards network reconstruction ». Thesis, 2010. http://ndltd.ncl.edu.tw/handle/97695667099035745199.
Texte intégralLai, Jhih-Siang, et 賴至祥. « Graph-Based Clustering Approaches for Gene Network Reconstruction ». Thesis, 2009. http://ndltd.ncl.edu.tw/handle/17481425002494051273.
Texte intégral國立臺灣大學
醫學工程學研究所
97
To understand regulatory relationships between genes in real life. Biologists often use RNA interference (RNAi) or knockout genes to observe the response in the real life system. Informationists try to reconstruct regulatory relationship between genes from mRNA expression profile by algorithms or mathematic models. There are several phases involved in gene regulation such as transcription, post-transcriptional modifications, translation, mRNA degradation and post-translational modifications .Time is essential for all these phases to be completed and many researches analyze regulation via these features. In this study, we use two methods to reconstruct regulatory relationships between genes. One is a graph partition algorithm named Normalized Cuts for partitioning off genes into functional gene network. The other method, PARE (Pattern Recognition Approach), an algorithm based on time-lagged non-linear feature of the profile, is to infer regulation between genes. In addition, we use yeast microarray to construct gene regulatory networks and check results from KEGG pathway database, BIOGRID interaction database and MIPS database. Comparing our F score result with Dynamic Bayesian Network developed by Kim, et al., it shows that our method performs better than theirs. Finally, we apply our method to a real case in yeast microarray in which yox1 and yhp1 are both deleted and we analyze its mRNA expression time profile. Although mechanisms between phases in cell cycle are not clear, yox1 and yhp1 are two genes known controlling duration of a cell in G1 phase by negative feedback. We successfully find networks associated with cell cycle and one of the networks is associated with cell mitosis. In the future, we hope to decipher more mechanisms between phases in cell cycle.
« Computational models for efficient reconstruction of gene regulatory network ». Thesis, 2011. http://library.cuhk.edu.hk/record=b6075380.
Texte intégralThesis (Ph.D.)--Chinese University of Hong Kong, 2011.
Includes bibliographical references (leaves 129-148).
Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Abstract also in Chinese.
Lin, Chung-Hsun, et 林仲訓. « Using Microarray Time Series Data and Gene Ontology for Gene Clustering and Network Reconstruction ». Thesis, 2013. http://ndltd.ncl.edu.tw/handle/83763564848362542147.
Texte intégral國立中山大學
資訊管理學系研究所
101
In recent years, using microarray time series data to reconstruct gene regulatory network, has become a very popular way. However, the number of these genes are usually very large. We want to rebuild before these genes do a proper clustering, which is the each other interaction between the genes will be divided in the same group. The way we use here is to combine multiple data sources. On the one hand avoid being affected by the impact of a single data source. When there is only one data source, data quality will have a great impact. On the other hand, we hope to have some clustering performance improvement, and improve the subsequent reconstruction of the accuracy of gene regulatory network. In our study, we combine two different types of data sources. One of source is microarray time series data, the other is the Gene Ontology. We quantify Gene Ontology, and combine with time series data. Finally, we use the partition clustering algorithm to cluster, and use Boolnet to reconstruct gene regulatory network. After our experiment, we can obtain more great performance when we use microarray time series data and Gene Ontology simultaneously. In the following reconstruction, when the clustering result is better, we can get a better reconstruction of gene regulatory network. Therefore, our method for clustering of gene is effective and feasible.
Richard, Guilhem. « Affecting the macrophage response to infection by integrating signaling and gene-regulatory networks ». Thesis, 2014. https://hdl.handle.net/2144/14270.
Texte intégralPadolina, Joanna Melinda. « Phylogenetic reconstruction of Phalaenopsis (Orchidaceae) using nuclear and chloroplast DNA sequence data and using Phalaenopsis as a natural system for assessing methods to reconstruct hybrid evolution in phylogenetic analyses ». 2006. http://hdl.handle.net/2152/20172.
Texte intégraltext
Chang, Chih-Jung, et 張志榮. « Gene Networks Reconstruction based on Structural Equation Modeling ». Thesis, 2007. http://ndltd.ncl.edu.tw/handle/21699798502766537048.
Texte intégral臺灣大學
醫學工程學研究所
96
With the continual progress of human genome researches, more and more genes have been found to be closely related to human diseases. Accordingly, exploration of genetic functions has become one of major foci in biotechnology researches. It is well known that each gene does not work alone. Instead, it may involve enormous complicated interactions among genes in a biological process. Because of the complexity of physiological and biochemical processes in biology, the relations between the genes and most diseases are not clear currently. Therefore, the ultimate goal of gene networks reconstruction is to analyze the regulatory mechanisms among genes and understand how genes involve in biological processes. Limited by the high cost of microarrays, most biological experiments can not offer a large number of observations for gene network reconstruction. To overcome this limitation, a new gene network model:linear dynamic factor model, which is based on structural equation modeling, is proposed in this study. Besides observed variables, linear dynamic factor model also incorporates hidden factors to depict regulations from proteins and other molecules that are not included in the gene networks but have influence on the gene networks. We simulated data from a 6-gene network with different observations to see the influence of the number of observations on the performance of the algorithm. We also applied the algorithm to microarray data to reconstruct the gene networks from focal adhesion pathway、SGS1 and its synthetic sick or lethal(SSL) partners and G2/M DNA damage checkpoint of Saccharomyces cerevisiae. For the simulated data with 14 observations, the performance of the algorithm is well;for the simulated data with 52 observations, the performance of the algorithm is better than that of the simulated data with 14 observations. For the microarray data, the sensitivity or true positive rate can be in the neighborhood of 50%.
« Reconstructing gene regulatory networks with new datasets ». 2013. http://library.cuhk.edu.hk/record=b5549309.
Texte intégral在我兩年碩士的研究中,我引入了一個新的概念微核糖核酸及其目標對向聚類(MTB) 運用了ceRNA 的假設,還提出算法,成功從微核糖核酸與信使核糖核酸的相互數據中找出一系列的MTB' 還利用GENCODE 項目上大量的微核糖核酸及信使核糖核酸的表達數據去驗証MTB 的概念。一方面,我從大量的表達數據中成功推斷出微核糖核酸與信使核糖核酸之間的相反關連、信使核糖核酸之間的正面關運和微核糖核酸之間的正面關連;另一方面,這些關連進一步肯定ceRNA 假設的真實性。此外,我提出一個從大量基因組中找出基因功能分析的方法,並在大量的MTB 的基因組中找出重要的基因註解。最後,我提出另一個MTB 概念的應用一新算法來預測微核糖核酸與信使核糖核酸的相互影響。總括而吉, MTB 概念從複雜且混亂的微核糖核酸與信使核糖核酸網絡中定義簡單且穩固的模姐,提供一個系統生物學分析微核糖核酸調節能力的方法。
The competing Endogenous RNA (ceRNA) hypothesis has become one of the hottest topics in bioinformatics research recently. Four papers related to the ceRNA hypothesis were published simultaneously in Cell in 2011, a top journal in life sciences. For most papers related to the ceRNA hypothesis, the corresponding studies have successfully validated the hypothesis with different individual examples, without a large-scale and comprehensive analysis.
In my Master of Philosophy study, a novel concept, called mi-RNA Target Bicluster (MTB), is introduced to model the ceRNA hypothesis. The MTBs are identified computationally from validated and/or predicted miRNA-mRNA interaction pairs. The MTB models were tested with the mRNAs and miRNAs expression data from the GENCODE Project. Statistically significant miRNA-mRNA anti-correlation, mRNA-mRNA correlation and miRNA-miRNA correlation in expression data are found, verifying the correlation relations among mRNAs and miRNAs stated in the ceRNA hypothesis with large-scale data support. Moreover, a novel large-scale functional enrichment analysis is performed, and the mRNAs selected by the MTBs are found to be biologically relevant. Besides, some new target prediction algorithms are suggested, as another application of the MTBs, are suggested. Overall, the concept of MTB defines simple and robust modules from the complex and noisy miRNA-mRNA network, suggesting ways for system biology analyses in miRNA-mediated regulations.
Detailed summary in vernacular field only.
Detailed summary in vernacular field only.
Yip, Kit Sang Danny.
Thesis (M.Phil.)--Chinese University of Hong Kong, 2013.
Includes bibliographical references (leaves [117]-126).
Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Abstracts also in Chinese.
Abstract --- p.i
Acknowledgement --- p.iv
Chapter 1 --- Introduction --- p.1
Chapter 1.1 --- Contributions --- p.1
Chapter 1.2 --- Thesis Outline --- p.2
Chapter 2 --- Background --- p.3
Chapter 2.1 --- Bioinformatics --- p.3
Chapter 2.2 --- Biological Background --- p.7
Chapter 2.2.1 --- The Central Dogma of Molecular Biology . --- p.7
Chapter 2.2.2 --- RNAs --- p.8
Chapter 2.2.3 --- Competing Endogenous RNA (ceRNA) hypothesis --- p.9
Chapter 2.2.4 --- Biological Considerations in Functional Enrichment Analysis --- p.11
Chapter 2.3 --- Computational Background --- p.12
Chapter 2.3.1 --- miRNA Genomic Annotation Prediction --- p.13
Chapter 2.3.2 --- miRNA Target Interaction Prediction --- p.14
Chapter 2.3.3 --- Applying Computational Algorithms on Related Problems --- p.16
Chapter 2.3.4 --- Algorithms in Functional Enrichment Analysis --- p.16
Chapter 2.4 --- Experiments and Data --- p.17
Chapter 2.4.1 --- miRNA Target Interactions --- p.17
Chapter 2.4.2 --- Expression Data --- p.18
Chapter 2.4.3 --- Annotation Datasets --- p.19
Chapter 2.5 --- Research Motivations --- p.20
Chapter 3 --- Definitions of miRNA Target Biclusters (MTB) --- p.22
Chapter 3.1 --- Representations --- p.22
Chapter 3.1.1 --- Binary Association Matrix Representation --- p.23
Chapter 3.1.2 --- Bipartite Graph Representation --- p.23
Chapter 3.1.3 --- Mathematical Representation --- p.24
Chapter 3.2 --- Concept of MTB --- p.24
Chapter 3.2.1 --- MTB Restrictive Type (Type R) --- p.27
Chapter 3.2.2 --- MTB Restrictive Type on miRNA (Type Rmi) --- p.31
Chapter 3.2.3 --- MTB Restrictive Type on mRNA (Type Rm) --- p.34
Chapter 3.2.4 --- MTB Restrictive and General Type (Type Rgen) --- p.37
Chapter 3.2.5 --- MTB Loose Type (Type L) --- p.44
Chapter 3.2.6 --- MTB Loose Type but restricts on miRNA (Type Lmi) --- p.47
Chapter 3.2.7 --- MTB Loose Type but restricts on mRNA (Type Lm) --- p.50
Chapter 3.2.8 --- MTB Loose and General Type (Type Lgen) --- p.53
Chapter 3.2.9 --- A General Definition on all Eight Types --- p.58
Chapter 3.2.10 --- Discussions --- p.60
Chapter 4 --- MTB Workflow in Checking Correlation Relations --- p.61
Chapter 4.1 --- MTB Workflow in Checking Correlation Relations --- p.61
Chapter 4.1.1 --- MTB Identification --- p.62
Chapter 4.1.2 --- Correlation Coefficients --- p.63
Chapter 4.1.3 --- Scoring Scheme --- p.64
Chapter 4.1.4 --- Background Construction --- p.65
Chapter 4.1.5 --- Wilcoxon Rank-sum Test --- p.66
Chapter 4.1.6 --- Preliminary Studies --- p.67
Chapter 4.2 --- miRNA-mRNA Anti-correlation in Expression Data --- p.68
Chapter 4.2.1 --- Interaction Datasets --- p.69
Chapter 4.2.2 --- Expression Datasets --- p.72
Chapter 4.2.3 --- Independence of the Choices of Datasets --- p.73
Chapter 4.2.4 --- Independence of the Types of MTBs --- p.76
Chapter 4.2.5 --- Independence of the Choices of Correlation Coefficients --- p.78
Chapter 4.2.6 --- Dependence on the Way to Score --- p.79
Chapter 4.2.7 --- Independence of theWay to Construct Background --- p.81
Chapter 4.2.8 --- Independence of Natural Bias in Datasets --- p.82
Chapter 4.3 --- mRNA-mRNA Correlation in Expression Data --- p.84
Chapter 4.3.1 --- Variations in the Analysis --- p.85
Chapter 4.3.2 --- Discussions --- p.87
Chapter 4.4 --- miRNA-miRNA Correlation in Expression Data --- p.88
Chapter 4.4.1 --- Variations in the Analysis --- p.89
Chapter 4.4.2 --- Discussions --- p.92
Chapter 5 --- Target Prediction Aided by MTB --- p.94
Chapter 5.1 --- Workflow in Target Prediction --- p.94
Chapter 5.2 --- Contingency Table Approach --- p.96
Chapter 5.2.1 --- One-tailed Hypothesis Testing --- p.97
Chapter 5.3 --- Ranked List Approach --- p.98
Chapter 5.3.1 --- Wilcoxon Signed Rank Test --- p.99
Chapter 5.4 --- Results and Discussions --- p.99
Chapter 6 --- Large-scale Functional Enrichment Analysis --- p.102
Chapter 6.1 --- Principles in Functional Enrichment Analysis --- p.102
Chapter 6.1.1 --- Annotation Files --- p.104
Chapter 6.1.2 --- Functional Enrichment Analysis on a gene --- p.set105
Chapter 6.1.3 --- Functional Enrichment Analysis on many gene sets --- p.106
Chapter 6.2 --- Results and Discussions --- p.107
Chapter 7 --- Future Perspectives and Conclusions --- p.112
Chapter 7.1 --- Applying MTB definition on other problems --- p.112
Chapter 7.2 --- Matrix Definitions and Optimization Problems --- p.113
Chapter 7.3 --- Non-binary association matrix problem settings --- p.114
Chapter 7.4 --- Limitations --- p.114
Chapter 7.5 --- Conclusions --- p.116
Bibliography --- p.117
Chapter A --- Publications --- p.127
Chapter A.1 --- Publications --- p.127
Whitehead, Dion [Verfasser]. « Reconstructing gene function and gene regulatory networks in prokaryotes / by Dion Whitehead ». 2005. http://d-nb.info/977558460/34.
Texte intégralSu, Chu-Hsien, et 蘇矩賢. « Reconstruction of Interaction Networks of Escherichia coli through Literature Mining of Gene-Gene Relations ». Thesis, 2014. http://ndltd.ncl.edu.tw/handle/80389978697198390403.
Texte intégral國立陽明大學
生物醫學資訊研究所
102
Previous studies of reconstructing interaction networks indicated that network-based methods represent the relationships between genes and gene products of the target organisms. In this study we used E. coli K-12 MG1655, an important model organism, as the target organism for reconstructing interaction networks. Interactions of E. coli large molecules are usually obtained from databases and literature. Most interactions in the databases are lacking of literature supports. It is challenging to retrieve traceable literature citations for these interactions of E. coli manually. We applied text mining methods to extract interactions from 310,378 abstracts of E. coli researches in PubMed databases and provide sentence-level annotations of the interactions. F-scores of 0.81, 0.86, and 0.93 were achieved for identification of gene regulations, physical interactions and signal transductions by text mining in random sampling evaluations. 1,084 interactions were identified after text mining extraction. We found that 394 of the 1,084 interactions were newly identified interactions comparing to collected interactions from the E. coli databases. These 394 newly identified interactions provided new insights and bridged the gaps in the interaction networks of E. coli. The precision of 52% was achieved for the identifications of interactions through text mining. We provided sentence-level annotations for 12% of collected interactions in the E. coli databases. We performed functional enrichment analysis of the genes involved in the newly identified interaction extracted by text mining. The enriched functional categories are DNA replication and repair, biofilm formation, and cell motility associated with RpoS-centered stress responses of E. coli. After combing interactions collected from the databases and extracted through text mining, we reconstructed integrated networks of E. coli. From the integrated networks, we found that the newly identified interactions filled the gaps between separated components of the interaction networks based on collected interactions from the databases. The newly identified interactions also led to the organizational changes of hierarchical structure of E. coli’s gene regulatory networks.