Tesis sobre el tema "Annotation de génome"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 28 mejores tesis para su investigación sobre el tema "Annotation de génome".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore tesis sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Daccord, Nicolas. "Analyse bioinformatique du génome et de l’épigénome du pommier". Thesis, Angers, 2018. http://www.theses.fr/2018ANGE0034/document.
Texto completoApple is one of the most consumed fruits in the world. Using the latest sequencing (PacBio) and optical mapping (BioNano) technologies, we have generated a high-quality de novo assembly of the apple (Malus domestica Borkh.) genome. We performed a gene annotation as well as a transposable element annotation to allow this assembly to be used as a reference genome. The highcontiguity of the assembly allowed to exhaustively detect the transposable elements, which represented over half the assembly, thus providing an unprecedented opportunity to investigate the uncharacterized regions of a tree genome. We also found that the apple genome is entirely duplicated as showed by the synteny links between chromosomes. Using Whole Genome Bisulfite Sequencing (WGBS) and the previously generated assembly, we produced genome-wide DNA methylation maps and showed a general correlation between DNA methylation next to genes and gene expression. Moreover, we identified several Differentially Methylated Regions (DMRs) between apple fruits and leaf methylomes associated to candidate genes that could be involved in agronomically relevant traits such as apple fruit development. Finally, we developped a complete and easyto- use pipeline which aim is to handle the complete treatment of WGBS data, from the reads mapping to the DMRs computing. It can handle datasets having a low number of biological replicates
Cleynen, Alice. "Approches statistiques en segmentation : application à la ré-annotation de génome". Phd thesis, Université Paris Sud - Paris XI, 2013. http://tel.archives-ouvertes.fr/tel-00913851.
Texto completoSevin, Emeric. "Annotation des petits éléments dans les génomes procaryotes : nouveaux outils informatiques et application à Sinorhizobium meliloti". Rennes 1, 2007. http://www.theses.fr/2007REN1S160.
Texto completoMany computational tools have been developed along with the automation of annotation. In spite of this and of the multiplication of sequenced prokaryotic genomes, small elements essential to our global understanding of systems remain poorly annotated. These comprise transcriptional regulation elements (promoters/terminators), non coding RNAs (ncRNAs), and peptidic genes, coding for proteins smaller shorter than 100 aa. Scanning the intergenic regions of Sinorhizobium meliloti with comparative genomics methods allowed us to identify 67 candidates, of which 14 were proven to be true ncRNAs. This analysis moreover revealed a sparse annotation of genes belonging to toxin-antitoxin (TA) systems. We thus developed a novel computational tool, based on their genomic features, for identifying them in prokaryotic genomes. Our results suggest that TA systems might be even more numerous than initially described. Finally we are currently validating a visualization tool for under- or over-represented n-mers in intergenic regions to help in the detection of regulatory elements
Beyne, Emmanuelle. "Règles de cohérence pour l'annotation génomique : développement et mise en œuvre in silico et in vivo". Bordeaux 1, 2008. https://tel.archives-ouvertes.fr/tel-00350902.
Texto completoBocs, Stéphanie. "(Ré)annotation de génomes procaryotes complets - Exploration de groupes de gènes chez les bactéries". Phd thesis, Université Pierre et Marie Curie - Paris VI, 2004. http://tel.archives-ouvertes.fr/tel-00008296.
Texto completoFlutre, Timothée. "L' annotation des éléments transposables par la compréhension de leur diversification". Paris 7, 2010. http://www.theses.fr/2010PA077239.
Texto completoTransposable elements are DNA sequences that can move and duplicata within genomes. They hence have a major impact on genome structure but also on the expression of neighbouring genes, notably via epigenetiç mechanisms. However, except for some model organisms for which reference sequences are available, the annotation of transposable elements corresponds to a bottleneck in the analysis of genomic sequences. Therefore, I started by comparing existing computer programs used in de novo approaches of transposable element identification. In this aim, I designed a test protocol on the genomes of Drosophila melanogaster and Arabidopsis thaliana. As a result, I proposed a de novo approach combining several tools, thus enabling the automatic recovery of a great number of reference sequences. Moreover, I showed that our approach highlighted the structural variations present within well-known families, thus reflecting the diversification of such families during their evolution. This approach was implemented in a package (REPET) making possible the analysis of transposable elements in numerous genomes from plants, insects and fungi among others. This work lead to a roadmap describing, from a practical point of view, how to annotate the transposable element content of any newly sequenced genome. Finally, I propose several perspectives, notably the simulation of the data required for the improvement of the detection algorithms, a way complementary to the modeling of transposable element dynamics
Morlot, Jean-Baptiste. "Annotation of the human genome through the unsupervised analysis of high-dimensional genomic data". Thesis, Paris 6, 2017. http://www.theses.fr/2017PA066641/document.
Texto completoThe human body has more than 200 different cell types each containing an identical copy of the genome but expressing a different set of genes. The control of gene expression is ensured by a set of regulatory mechanisms acting at different scales of time and space. Several diseases are caused by a disturbance of this system, notably some cancers, and many therapeutic applications, such as regenerative medicine, rely on understanding the mechanisms of gene regulation. This thesis proposes, in a first part, an annotation algorithm (GABI) to identify recurrent patterns in the high-throughput sequencing data. The particularity of this algorithm is to take into account the variability observed in experimental replicates by optimizing the rate of false positive and false negative, increasing significantly the annotation reliability compared to the state of the art. The annotation provides simplified and robust information from a large dataset. Applied to a database of regulators activity in hematopoiesis, we propose original results, in agreement with previous studies. The second part of this work focuses on the 3D organization of the genome, intimately linked to gene expression. This structure is now accessible thanks to 3D reconstruction algorithm from contact data between chromosomes. We offer improvements to the currently most efficient algorithm of the domain, ShRec3D, allowing to adjust the reconstruction according to the user needs
Claudel-Renard, Clotilde. "Inférence fonctionnelle et prédiction de voies métaboliques : application à la bactérie fixatrice d'azote Sinorhizobium meliloti". Toulouse 3, 2003. http://www.theses.fr/2003TOU30170.
Texto completoComplete genomes of bacteria are sequenced in growing number. At the same time, programs for systematic analysis of genes and proteins expression in different conditions are set up. The understanding of organisms functions requires the annotation of genes functions and the integration of that data into a functional diagram. Metabolic pathways constitute classes of functions allowing to tackle the integration issue. They are well identified in numerous organisms and are available to experimentation. In a first time, we have developed an automated method for enzyme function detection. This method, named PRIAM (PRofils pour l'Identification Automatique du Métabolisme), is based on the classification of enzymes in the ENZYME database and relies on sets of position-specific scoring matrices ("profiles"), automatically tailored for each ENZYME entry. Then, the method allows to identify enzymes in a complete genome and to visualize the results on KEGG graphs. In a second time, that method has been applied on the complete genome of the nitrogen fixing bacterium Sinorhizobium meliloti, in order to facilitate the interpretation of specific metabolic pathways of that symbiotic organism
Scalzitti, Nicolas. "Nouvelle stratégie d'annotation des génomes par l'utilisation d'algorithmes d'intelligence artificielle". Electronic Thesis or Diss., Strasbourg, 2021. http://www.theses.fr/2021STRAJ040.
Texto completoHigh-throughput genome sequencing projects produce a huge amount of raw biological data on a daily basis. However, the data are difficult to exploit if they are not annotated. Unfortunately, the currently available genome annotation programs are still too prone to prediction errors, making genome annotation one of the major challenges in bioinformatics. In this context, my thesis is organized around three connected topics: i) improving the prediction of eukaryotic protein-coding genes by focusing on splice sites ii) exploiting artificial intelligence algorithms (convolutional neural networks and evolutionary algorithms), iii) training with high quality data from a wide range of eukaryotic organisms (from humans to protists). The strategy developed consists in combining the processed and validated data set (G3PO) with the developed programs (Spliceator and SpliceSLEIA) in order to improve gene prediction by decreasing the error rate so that they no longer propagated in public databases. This work should contribute to a better understanding of organisms and their biological mechanisms
Blandin, Gaëlle. "Évolution des génomes de levures : analyse de la redondance et génomique comparative". Paris 7, 2001. http://www.theses.fr/2001PA077171.
Texto completoChelkha, Nisrine. "Etudes du génome de diverses amibes et de leurs interactions avec des virus géants et d’autres microorganismes". Thesis, Aix-Marseille, 2019. http://theses.univ-amu.fr.lama.univ-amu.fr/191122_CHELKHA_490trpga652cfvqep580b712k_TH.pdf.
Texto completoAcanthamoeba spp. are ubiquitous phagocytic protists in soil and water. Giant viruses, including mimiviruses, marseillevirus or pandoraviruses, are among Acanthamoeba-resistant microorganisms. They have a sympatric lifestyle with other microorganisms within amoebae, and many lateral sequence transfers have been described that involved Acanthamoeba castellanii. In addition, some Acanthamoeba species appear to have different levels of susceptibility to giant viruses.The objective of this work was to explore for the first time the gene repertoire of the genomes of 14 different species of Acanthamoeba, as well as that of the two species Vermamoeba vermiformis CDC-19 and Willaertia magna c2c maky, and to study the genetic relationship of these amoebae with giant viruses.This work provided important genetic information for several species of amoebae with different susceptibility to giant viral infection. An additional work aimed to discuss the defense strategies observed or predicted for giant viruses and their hosts. The search for sequence homologies in other giant viruses genomes revealed the presence of several sequences suspected to be involved in defense mechanisms. However, no significant evidence of integration of known genomes or virophages sequences has been identified in all available genomes of Acanthamoeba species.Finally, we have designed three PCR primer systems targeting a conserved gene that encodes an alanine-tRNA ligase, for an effective and specific identification of Acanthamoeba. This work opens up several potential leads for future work on the interactions between amoebae and giant viruses
Hubans, Christine. "Méthode ab initio de prédiction d'opérons chez les procaryotes et validations biologiques chez les Bordetelles". Lille 1, 2006. https://pepite-depot.univ-lille.fr/RESTREINT/Th_Num/2006/50376_2006_246.pdf.
Texto completoRal, Jean-Philippe. "Multiplicité et redondance des enzymes du métabolisme de l'amidon dans la lignée verte". Lille 1, 2004. https://pepite-depot.univ-lille.fr/RESTREINT/Th_Num/2004/50376-2004-23.pdf.
Texto completoBland, Céline. "Innovations pour l'annotation protéogénomique à grande échelle du vivant". Thesis, Montpellier 1, 2013. http://www.theses.fr/2013MON13508.
Texto completoProteogenomics is a recent field at the junction of genomics and proteomics which consists of refining the annotation of the genome of model organisms with the help of high-throughput proteomic data. Structural and functional errors are still frequent and have been reported on several occasions. Innovative methodologies to prevent such errors are essential. N-terminomics enables experimental validation of initiation codons and certification of the annotation data. With this objective in mind, two innovative strategies have been developed combining: i) selective N-terminal labeling of proteins, ii) multienzymatic digestion in parallel, and iii) specific enrichment of most N-terminal labeled peptides using either successive liquid chromatography steps or immunocapture directed towards the N-terminal label. Efficiency of these methodologies has been demonstrated using Roseobacter denitrificans as bacterial model organism. After enrichment with chromatography, 480 proteins were validated and 46 re-annotated. Several start sites for translation initiation were detected and homology driven annotation was challenged in some cases. After immunocapture, 269 proteins were characterized of which 40% were identified specifically after enrichment. Three novel genes were also annotated for the first time. Complementary results obtained after tandem mass spectrometry analysis allows easier data interpretation to reveal real start sites of translation initiation of proteins and to identify novel expressed products. In this way, the re-annotation process may become automatic and systematic to improve protein databases
Lehmann, Nathalie. "Development of bioinformatics tools for single-cell transcriptomics applied to the search for signatures of symmetric versus asymmetric division mode in neural progenitors". Electronic Thesis or Diss., Université Paris sciences et lettres, 2021. http://www.theses.fr/2021UPSLE070.
Texto completoIn recent years, single-cell RNA-seq (scRNA-seq) has fostered the characterization of cell heterogeneity at a remarkable high resolution. Despite their democratization, the analysis of scRNA-seq remains a challenge, particularly for organisms whose genomic annotations are partial. During my PhD, I observed that the chick genomic annotations are often incomplete, thus resulting in a loss of a large number of sequencing reads. I investigated how an enriched annotation affects the biological results and conclusions from these analyses. We developed a novel approach based on the re-annotation of the genome with scRNA-seq data and long reads bulk RNA-seq. This computational biology project capitalises on a tight collaboration with the experimental team of Xavier Morin (IBENS). The main biological focus is the search for signatures of symmetric versus asymmetric division mode in neural progenitors. In order to identify the key transcriptional switches that occur during the neurogenic transition, I have implemented bioanalysis approaches dedicated to the search for gene signatures from scRNA-seq data
Beyne, Emmanuelle. "Règles de cohérence pour l'annotation génomique : développement et mise en oeuvre in silico et in vivo". Phd thesis, Université Sciences et Technologies - Bordeaux I, 2008. http://tel.archives-ouvertes.fr/tel-00350902.
Texto completoAndréani, Julien. "Etude des grands virus à ADN nucléo-cytoplasmique : isolements et caractérisations". Thesis, Aix-Marseille, 2018. http://www.theses.fr/2018AIXM0695.
Texto completoMost viruses are known for their ability to cause symptomatic diseases in humans andother animals. Some of them are large DNA viruses named Nucleo-cytoplasmic Large DNAviruses (NCLDV), known for infecting eukaryotic cells. At the beginning of the 21st centuryfour families were defined by Iyer et al. as having a common origin (monophyletic group):Asfarviridae, Phycodnaviridae, Irido-Ascoviridae and Poxviridae.In 2003, the description of Acanthamoeba polyphaga Mimivirus broke this paradigmin the virus world. Because of their particles size (450 nm), their genome size (up to 1Mb),and their gene contents, their discovery changed the traditional definition of viruses (Lwoff).Since 2013 and the successive isolations of Pandoravirus, Pithovirus and Mollivirus; theseviruses have been characterized as possessing various novel properties.Their discoveries have been possible thanks to the co-culture method using protistnotably Acanthamoeba genus cells. This method went through multiple improvements and isemployed by different teams in different ways. In our case and in order to enhance thismethod we combined strategies applied in our co-culture. Indeed, this method consists inusing flow cytometry to detect lysis of protist cells (after all steps of co-culture enrichment).In addition, the flow cytometry was used with a DNA marker in order to identity viruses anddiscriminate viral populations. Then, we were able, using a FACS sorter device, to separatedifferent viral populations from our supernatants.Altogether these techniques have permitted the isolation of new viruses
Djebali, Sarah. "Un cadre formel pour l'annotation experte des gènes eucaryotes". Phd thesis, Université d'Evry-Val d'Essonne, 2006. http://tel.archives-ouvertes.fr/tel-00112099.
Texto completoLe fait que les règles heuristiques utilisées par l'expert soient de différentes natures, soient susceptibles d'évoluer au cours du temps et s'appliquent à des ressources de types différents, a fait ressortir le besoin d'un cadre formel à la fois flexible, intuitif et générique pour l'annotation de gènes. Nous avons donc également développé DACMLang, un langage dédié à l'annotation de gènes fondé sur les DACMs.
Smith, Adam Alexander Thil. "Exploitation automatisée des contextes métabolique et génomique pour l'annotation fonctionnelle des génomes prokaryotes". Thesis, Evry-Val d'Essonne, 2012. http://www.theses.fr/2012EVRY0002/document.
Texto completoThe subject of this thesis concerns the development of bioinformatic strategies exploiting genomic and metabolic contextual information in order to generate functional annotations for prokaryote genes. Two main projects were involved during this work: the first focuses on sequence-orphan enzymatic activities. Today, roughly 27% of activities defined by International Union of Biochemistry and Molecular Biology are sequence-orphans. For these, traditional bioinformatic approaches cannot propose candidate genes. It is thus imperative to use alternative, context-based approaches in such cases. The CanOE strategy fishing Candidate genes for Orphan Enzymes) was developed and added to the MicroScope bioinformatics platform in this aim. It integrates genomic and metabolic information across thousands of prokaryote genomes in order to locate promising gene candidates for orphan activities. The mirror project focuses on protein families of unknown function. A collaborative project has been set up at the Genoscope in hope of formalising functional exploration strategies for prokaryote protein families. A pilot version was created on the “DUF849” Pfam family, for which a single activity had recently been elucidated. Strategies for proposing novel functions and activities and creating isofunctional sub-families were researched, so as to guide biochemical experimentations and to analyse their results
Descorps-Declere, Stéphane. "Modélisation du processus d'annotation par une architecture blackboard". Paris 6, 2006. http://www.theses.fr/2006PA066165.
Texto completoBecause of the huge increase in sequencing projects of complete genomes, it is essential today to have powerful computer softwares able to help biologists towards a better use of analysis methods. This is particularly important because there exists many methods of analysis as well as a great amount of data. In order to solve this issue, we propose in this thesis to develop of a new bioinformatic model suitable for the annotation of complete genomes. From studying the "state of the art", we concluded that we could approach the modelisation of the annotation process of two different ways. This enabled us to propose a new model based on this analysis. We then decided to re-formalize the annotation process; it gave us the opportunity to propose an adequate software architecture in order to handle the process thus reformulated. The thesis is then a double contribution : it is both a modelisation of the annotation process, and a technical proposal based on a blackboard architecture
Carpentier, Mathilde. "Méthodes de détection des similarités structurales : caractérisation des motifs conservés dans les familles de structures pour l' annotation des génomes". Paris 6, 2005. http://www.theses.fr/2005PA066571.
Texto completoArnaiz, Olivier. "Annotation des génomes de paramécies Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences ParTIES: a toolbox for Paramecium interspersed DNA elimination studies". Thesis, université Paris-Saclay, 2020. http://www.theses.fr/2020UPASL046.
Texto completoThe next generation sequencing technologies (NGS) have revolutionized genomics.Genomes of numerous organisms have been sequenced and assembled.Deciphering the encoded information (annotation) has more than ever become crucial. In this thesis manuscript, I focus on the impact of NGS on the annotation of Paramecium genomes, in particular the annotation of genes and transposable elements (TE).Paramecia is a unicellular eukaryote possesses two types of nuclei. A germline nucleus (MIC) transmits the genetic information to the next sexual generation, and a somatic nucleus (MAC) is responsible for gene expression.Special genes caracteristics of paramecia stimulated me to develop a workflow dedicated to their annotation, using RNA-seq data.At each sexual cycle, the parental MAC is lost and a new MAC develops from a copy of the MIC, through programmed genome rearrangements, notably the elimination of DNA corresponding to TE and short unique copy sequences called IES (Internal Eliminated Sequence).I developed the ParTIES software, using DNA-Seq data, to identify the ~45,000 IESs in the germline genome of Paramecium tetraurelia and to show that the IESs are remnants of TE.A series of three whole genome duplications (WGD) in the evolutionary history of the lineage, ancient but still visible, allow us to describe the dynamics of the invasion and evolution of TE that decay to become IES
Carvunis, Anne-Ruxandra. "Des protéines et de leurs interactions aux principes évolutifs des systèmes biologiques". Thesis, Grenoble, 2011. http://www.theses.fr/2011GRENS001/document.
Texto completoDarwin exposed to the world that living species continuously evolve. Yet the molecular mechanisms of evolution remain under intense research. Systems biology proposes that dynamic molecular networks underlie relationships between genotype, environment and phenotype, but the organization of these networks is mysterious. Combining established concepts from evolutionary and systems biology with protein interaction mapping and the study of genome annotation methodologies, I have developed new bioinformatics approaches that partially unveiled the composition and organization of cellular systems for three eukaryotic organisms: the baker’s yeast, the nematode Caenorhabditis elegans and the plant Arabidopsis thaliana. My analyses led to insights into the evolution of biological systems. First, I propose that the translation of peptides from intergenic regions could lead to de novo birth of new protein-coding genes. Second, I show that the evolution of proteins originating from gene duplications and of their physical interaction repertoires are tightly interrelated. Lastly, I uncover signatures of the ancestral host-pathogen co-evolution in the topology of a host protein interaction network. My PhD work supports the thesis that molecular systems also evolve in a Darwinian fashion
Carvunis, Anne-ruxandra. "Des protéines et de leurs interactions aux principes évolutifs des systèmes biologiques". Phd thesis, Université de Grenoble, 2011. http://tel.archives-ouvertes.fr/tel-00586614.
Texto completoScott-Boyer, Marie Pier. "Annotation des ARN non codants du génome de Candida albicans par méthode bioinformatique". Thèse, 2009. http://hdl.handle.net/1866/2978.
Texto completoBioinformatics is a multidisciplinary field that uses biology, computer science, physics and mathematics to solve problems in biology. One of the topics of bioinformatics is the analysis of genomic sequences and prediction of genes from non-coding RNA (ncRNA). The non-coding RNAs are RNA molecules that are transcribed but not translated into protein and have a function in the cell. The use of biochemistry and molecular biology techniques in order to find non-coding RNA genes is rather difficult and relatively expensive. Thus, the prediction of genes by bioinformatics methods is an important issue. This research describes a computer analysis to search for new ncRNA in the pathogen Candida albicans and an experimental validation. The strategy used was to combine several algorithms and to validate a subset of computer predictions with a microarray experience covering 1979 regions of the genome. We have identified 62 new transcripts in Candida albicans. We have also developed an analytical method for tiling array and attempted to improve the prediction of ncRNAs this with a method based on the search of RNA motifs in the sequences.
Sarrasin, Matthew. "An approach to improved microbial eukaryotic genome annotation". Thèse, 2017. http://hdl.handle.net/1866/20415.
Texto completoMoreira, Sandrine. "Décodage de l'expression de gènes cryptiques". Thèse, 2016. http://hdl.handle.net/1866/18548.
Texto completoThanks to new high throughput sequencing technologies and automatic annotation pipelines, proceeding from an eppendorf tube to a genbank file can be achieved in a single mouse click or so, for some species. Others, however, fiercely resist bioinformaticians with their confounding genomic complexity. Diplonemids are one of them. My thesis is centered on the discovery of new strategies for encrypting genetic information in eukaryotes, and the identification of molecular decoding processes. Diplonemids are a group of poorly studied marine protists. Unexpectedly, metagenomic studies have recently ranked this group as one of the most diverse in the oceans. Yet, their most distinctive feature is their multipartite mitochondrial genome with genes in pieces, and encryption by nucleotide deletions and substitutions. Genes are decrypted at the RNA level through three processes: (i) trans-splicing, (ii) polyuridylation at the junction of gene pieces and (iii) substitutions of A-to-I and C-to-T. Such a diverse arsenal of mitochondrial post-transcriptional processes is highly exceptional. Using a bioinformatics approach, I have reconstructed the mitochondrial transcriptome from RNA-seq libraries. We have identified six new genes including one that presents alternative trans-splicing isoforms. In total, there are 216 uridines added in 14 genes with up to 29 U insertions, and 114 positions edited by deamination (A-to-I or C-to-T) among seven genes (nad4, nad7, rns, y1, y2, y3, y5). In order to identify the machinery that processes mitochondrial RNAs, the nuclear genome has been sequenced. I have then assembled and annotated the genome. This machinery is probably unique and complex because no cis signal or trans actor typical for known splicing machineries have been found. I have identified promising protein candidates that are worth to be tested experimentally, notably RNA ligases, numerous members of the PPR family involved in plants RNA editing and deaminases. During my thesis, we have identified new types of post-transcriptional RNA processing in diplonemid mitochondria and identified new promising candidates for the machinery. A system capable of joining precisely or editing RNAs could find biotechnological applications. From an evolutionary perspective, the discovery of new molecular systems gives insight into the process of gene recruitment, adaptation to new functions and establishment of complex molecular machineries.
Kannan, Sivakumar. "Molecular protein function prediction using sequence similarity-based and similarity-free approaches". Thèse, 2007. http://hdl.handle.net/1866/15681.
Texto completo