Dissertations / Theses on the topic 'Analyse des séquences biologiques'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Analyse des séquences biologiques.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
El, Zant El Kadhi Nahla. "Recherche de motifs relationnels dans des séquences : application aux séquences biologiques." Paris 13, 2005. http://www.theses.fr/2005PA132037.
Full textKhodji, Hiba. "Apprentissage profond et transfert de connaissances pour la détection d'erreurs dans les séquences biologiques." Electronic Thesis or Diss., Strasbourg, 2023. http://www.theses.fr/2023STRAD058.
Full textThe widespread use of high throughput technologies in the biomedical field is producing massive amounts of data, notably the new generation of genome sequencing technologies. Multiple Sequence Alignment (MSA) serves as a fundamental tool for the analysis of this data, with applications including genome annotation, protein structure and function prediction, or understanding evolutionary relationships, etc. However, the accuracy of MSA is often compromised due to factors such as unreliable alignment algorithms, inaccurate gene prediction, or incomplete genome sequencing. This thesis addresses the issue of data quality assessment by leveraging deep learning techniques. We propose novel models based on convolutional neural networks for the identification of errors in visual representations of MSAs. Our primary objective is to assist domain experts in their research studies, where the accuracy of MSAs is crucial. Therefore, we focused on providing reliable explanations for our model predictions by harnessing the potential of explainable artificial intelligence (XAI). Particularly, we leveraged visual explanations as a foundation for a transfer learning framework that aims essentially to improve a model's ability to focus on underlying features in an input. Finally, we proposed novel evaluation metrics designed to assess this ability. Initial findings suggest that our approach achieves a good balance between model complexity, performance, and explainability, and could be leveraged in domains where data availability is limited and the need for comprehensive result explanation is paramount
Diop, Awa. "Analyse des séquences des génomes bactériens en tant que source d'information taxonomique." Thesis, Aix-Marseille, 2018. http://www.theses.fr/2018AIXM0276/document.
Full textRapid identification and precise microbial classification are crucial in medical microbiology for human and animal health monitoring, appropriate clinical diagnosis and selection of optimal therapeutic and control measures. Indeed, the universal used for the definition of species are not applicable to many bacterial genera. This is particularly true of species of the genus Rickettsia which are strictly intracellular alpha-proteobacteria that express few phenotypic characteristics. Given the availability of genomic sequences of nearly 100 rickettsial genomes, we wanted to evaluate a range of taxonomic parameters based on genomic sequence analysis, to develop guidelines for the classification of Rickettsia isolates at the genus and species levels. By comparing the degree of similarity of the sequences of 78 genomes from Rickettsia species and 61 genomes from 3 closely related genera using several genomic parameters, we have shown that genome-based taxonomic tools are simple to use and fast, and allow for a reliable and reproducible taxonomic classification of isolates within species of the genus Rickettsia, with specific thresholds. The obtained results enabled us to develop guidelines for classifying rickettsial isolates at the genus and species levels. Using taxono-genomics, we have also been able to describe 17 new human-associated bacterial species on the basis of a combination of genomic analysis and phenotypic properties. The use of genomic tools is therefore perfectly adapted to taxonomic classification and can dramatically change our vision of taxonomy and bacterial evolution in the future
Balaguer, Patrick. "Détection non isotopique de sondes nucléiques : application à la réaction d'hybridation et d'amplification (PCR) : [Polymerase Catalysed Reaction]." Montpellier 2, 1989. http://www.theses.fr/1989MON20050.
Full textBenhamida, Sabria. "Mots interdits dans les séquences biologiques." Marne-la-Vallée, 2000. http://www.theses.fr/2000MARN0086.
Full textGîrdea, Marta. "De nouvelles méthodes pour l'alignement des séquences biologiques." Phd thesis, Université des Sciences et Technologie de Lille - Lille I, 2010. http://tel.archives-ouvertes.fr/tel-00833311.
Full textDugnolle, Patrick. "Outils mathématiques appliqués à l'analyse stoechiométrique d'une séquence vidéo-microscopique de cicatrisation in vitro en contraste de phase." Université Joseph Fourier (Grenoble), 2000. http://www.theses.fr/2000GRE10031.
Full textRonfard, Remi. "Analyse automatique de film - Des séquences d'images aux séquences d'actions." Habilitation à diriger des recherches, Université de Grenoble, 2009. http://tel.archives-ouvertes.fr/tel-00450230.
Full textCenac, Peggy. "Etude statistique de séquences biologiques et convergence de martingales." Phd thesis, Université Paul Sabatier - Toulouse III, 2006. http://tel.archives-ouvertes.fr/tel-00134328.
Full textque les méthodes de comptage de mots classiques ? A
partir d'une caractérisation basée sur la CGR, on propose une nouvelle famille de
tests donnant l'ordre d'une chaîne de Markov homogène.
On définit ensuite une construction d'arbres digitaux de recherche,
inspirés par la CGR, en insérant successivement les préfixes retournés d'une chaîne de Markov. On montre que les longueurs des branches critiques se comportent, au premier ordre, comme si les
séquences insérées étaient indépendantes entre elles.
La dernière partie est consacrée à l'étude de la convergence presque sûre des moments normalisés de tout ordre de martingales vectorielles dans le théorème de la limite centrale
presque sûr. Les résultats sont appliqués aux erreurs d'estimation et de prédiction dans les régressions linéaires et les processus de branchement.
Tran, Tuan Tu. "Comparaisons de séquences biologiques sur architecture massivement multi-cœurs." Phd thesis, Université des Sciences et Technologie de Lille - Lille I, 2012. http://tel.archives-ouvertes.fr/tel-00832663.
Full textFayyaz, Movaghar Afshin. "Signification statistique du score local d'alignement de séquences biologiques." Toulouse 2, 2007. http://www.theses.fr/2007TOU20083.
Full textThis thesis concerns the problem of assessing the statistical significance of the outcomes from the similarities of biological sequences. In this thesis, we are interested in optimizing local alignment. A local alignment expresses the similarities by giving the parts of sequences which are similar. If the methods (like BLAST) are appropriate for long sequences, in this thesis, we propose two new p-values stood on the exact means as follows that do not rely on asymptotics: 1. P-value relied on h-tuple approach: our method stands on combining an adapted scoring scheme that includes the gaps and an approximate distribution of the ungapped local score of two independent sequences. The new scoring scheme is defined on h-tuples of the sequences using the gapped global score. The influence of h and the accuracy of the p-value are numerically studied and compared with obtained p-value of BLAST. The numerical experiments emphasize that our approximate p-values outperform the BLAST ones for both simulated and real short sequences. 2. New p-value by improving Greedy Extension Model: this p-value stands on an approximation of Poisson whose parameter is to estimate. In this thesis, we estimate this parameter from the exact methods that results more accurate p-value. We illustrate our purpose by simulations
Cénac, Peggy. "Étude statistique de séquences biologiques et convergence de martingales." Toulouse 3, 2006. http://www.theses.fr/2006TOU30065.
Full textThe Chaos Game Representation is a dynamical system which maps a sequence of letters taken from a finite alphabet onto an empirical measure on a set. We show how the CGR can be used to characterize the order of an homogeneous Markov chain and to define a new family of tests. Then we propose a construction of Digital Search Trees, inspired from the CGR, by successively inserting all the returned prefixes of a Markov chain. We give the asymptotic behavior of the critical lengths of paths, which turns out to be, at first order, the same one as in the case of DST built from independent Markov chains. A last part deals with properties of almost sure convergence of vectorial martingales. Under suitable regularity conditions on the growing process, we establish the convergence of normalized moments of all orders in the almost sure central limit theorem. The results are applied to the cumulated errors of estimation and prediction in linear regression models and branching processes
Tran, Tuan Tu. "Comparaisons de séquences biologiques sur architecture massivement multi-coeurs." Thesis, Lille 1, 2012. http://www.theses.fr/2012LIL10138/document.
Full textSearching similarities between sequences is a fundamental operation in bioinformatics, providing insight in biological functions as well as tools for high-throughput data. There is a need to have algorithms able to process efficiently billions of sequences. To look for approximate similarities,a common heuristic is to consider short words that appear exactly in both sequences, the seeds, then to try to extend this similarity to the neighborhoods of the seeds. The thesis focuses on this second stage of seed-based heuristics : how can we retrieve and compare efficiently the neighborhoods of the seeds ? The thesis proposes several solutions tailored for manycore processors such as today’s GPUs. Such processors are making massively parallel computing more and more popular. The thesis proposes direct approaches (extension of bit-parallel Wu-Manber algorithm, published in PBC 2011, and binary search) and approaches with another index (with perfect hash functions). Each one of these solutions was conceived to obtain as much fine-grained parallelism as possible, requiring intensive but homogeneous computational operations. All proposed methods were implemented in OpenCL and benchmarked. Finally, the thesis presents MAROSE, a prototype parallel read mapper using these concepts. In some situations, MAROSE is more efficient than the existing read mappers with a comparable sensitivity
Mouchard, Laurent. "Superpositions dans les séquences." Rouen, 1998. http://www.theses.fr/1999ROUES038.
Full textÉtienne, Marie-Pierre. "Le score local : un outil pour l'analyse de séquences biologiques." Nancy 1, 2002. http://www.theses.fr/2002NAN10289.
Full textFor any organism, DNA, RNA and proteins information can be considered as long sequences of letters taken from a finite alphabet A. One way to analyze this information is to assign a weight at each letter (an elementary score). Then, we make the sum over each possible segment and search the segment which realizes the maximal score called the local score. Then the problem is to give a level of significance for this local score. We are led to study the distribution of the local score under the null hypothesis : elementary scores are i. I. D. Random variables. According to the sign of the mean, the behaviour of the local score is widely different. We determine the asymptotic distribution of the local score when random variables are centered. This work completes the asymptotic study of Karlin & al. And Mercier (the negative expectation). In the area of phase transition we study nurnerically the behaviour of the different approximations. Finally we give the rate of convergence of the cumulative distribution function for the local score over a sequence of length n, as n goes to infinity
Gorbovitskaia, Marie. "Identification et effets biologiques de séquences virales endogènes chez le poulet." Lyon 1, 1998. http://www.theses.fr/1998LYO10039.
Full textGroult, Richard. "Détection de répétitions en tandem avec évolution : application aux séquences biologiques." Rouen, 2004. http://www.theses.fr/2004ROUES014.
Full textAn evolutive tandem repeat consists in a series of almost contiguous copies, such that each copy is strongly similar to its predecessor and its successor. There is no model and the first and the last copy can be completely different. It differs from a ``classical'' tandem repeat that considers a model and a radius around this model. This ``new'' type of repeat has been pointed out during the assembly of the human genome. In this manuscript, I first define, in a formal way, the evolutive tandem repeats from observations achieved in biological sequences. After having carried out tests using well known existing softwares, I conclude that no one was able to detect it correctly and that they are not part of the sequences that have been submitted to databanks. I therefore designed algorithms that are able to locate this specific type of repeats: a first algorithm, quadratric in the length of the sequence, then a linear one. These algorithms have been implemented in order to provide biologists with tools that detect efficiently these repeats in real large sequences, such as human chromosomes. Several parallelisation methods are presented too. The results of the carried out tests on chromosomes show the existence of such repeats in genomes
Mercier, Sabine. "Statistiques des scores pour l'analyse et la comparaison de séquences biologiques." Rouen, 1999. http://www.theses.fr/1999ROUES089.
Full textNicodème, Pierre. "Alignement avec des familles de séquences protéiques." Paris 7, 1997. http://www.theses.fr/1997PA077333.
Full textBourguignon, Pierre Yves Vincent. "Parcimonie dans les modèles Markoviens et application à l'analyse des séquences biologiques." Thesis, Evry-Val d'Essonne, 2008. http://www.theses.fr/2008EVRY0042.
Full textMarkov chains, as a universal model accounting for finite memory, discrete valued processes, are omnipresent in applied statistics. Their applications range from text compression to the analysis of biological sequences. Their practical use with finite samples, however, systematically require to draw a compromise between the memory length of the model used, which conditions the complexity of the interactions the model may capture, and the amount of information carried by the data, whose limitation negatively impacts the quality of estimation. Context trees, as an extension of the model class of Markov chains, provide the modeller with a finer granularity in this model selection process, by allowing the memory length to vary across contexts. Several popular modelling methods are based on this class of models, in fields such as text indexation of text compression (Context Tree Maximization and Context Tree Weighting). We propose an extension of the models class of context trees, the Parcimonious context trees, which further allow the fusion of sibling nodes in the context tree. They provide the modeller with a yet finer granularity to perform the model selection task, at the cost of an increased computational cost for performing it. Thanks to a bayesian approach of this problem borrowed from compression techniques, we succeeded at desiging an algorithm that exactly optimizes the bayesian criterion, while it benefits from a dynamic programming scheme ensuring the minimisation of the computational complexity of the model selection task. This algorithm is able to perform in reasonable space and time on alphabets up to size 10, and has been applied on diverse datasets to establish the good performances achieved by this approach
Gautier, Christian. "Analyse statistique et évolution des séquences d'acides nucléiques." Lyon 1, 1987. http://www.theses.fr/1987LYO19034.
Full textAurengo, André. "Analyse factorielle des séquences d'images en médecine nucléaire." Paris 11, 1989. http://www.theses.fr/1989PA112413.
Full textMartin, Benjamin. "Analyse de structures répétitives dans les séquences musicales." Thesis, Bordeaux 1, 2012. http://www.theses.fr/2012BOR14711/document.
Full textThe work presented in this thesis deals with repetitive structure inference from audio signal using string matching techniques. It aims at proposing and evaluating inference algorithms from a formal study of notions of similarity and repetition in music.We first present a method for representing audio signals by symbolic strings. We introduce alignment tools enabling similarity estimation between such musical strings, and evaluate the application of these tools for automatic cover song identification. We further adapt a bioinformatics indexing technique to allow efficient assessments of music similarity in large-scale datasets. We then introduce several specific repetitive structures and use alignment tools to analyse these repetitions. A first structure, namely the repetition of a chosen segment, is retrieved and evaluated in the context of automatic assignment of missingaudio data. A second structure, namely the major repetition, is defined, retrieved and evaluated regarding expert annotations, and as an alternative indexing method for cover song identification.We finally present the problem of repetitive structure inference as addressed in literature, and propose our own problem statement. We further describe our model and propose an algorithm enabling the identification of a hierarchical music structure. We emphasize the relevance of our method through several examples and by comparing it to the state of the art
Blanquart, Samuel. "Reconstruction phylogénétique par analyse bayésienne des séquences moléculaires." Montpellier 2, 2007. http://www.theses.fr/2007MON20201.
Full textTwo non stationary and non parametric models for probabilistic phylogenetics have been developped. According to these models, a continuous stochastic process introduces variations of the stationary probabilities of the Markovian substitution process, inducing a free dimensionality. The second model combines the previous non stationary formalism with a mixture of substitution processes distributed among sites. This combination jointly accounts for evolutionary process variations both across sequences and along time, induced respectively by biochemical constraints applied to sites, and by compositional drifts. The two models have been implemented in a Markov Chain Monte Carlo (MCMC) framework and are available for the community. They appear robust against several reconstruction artefacts and their respective behaviors moreover suggest a new interpretation of long branch attraction (LBA) artefacts. This thesis concludes with some theoretical perspectives related to necessary improvements of the models, concerning the algorithmic complexity of the MCMC sampling, and the accuracy of the inference
Vergne, Nicolas. "Chaînes de Markov régulées et approximation de Poisson pour l'analyse de séquences biologiques." Phd thesis, Université d'Evry-Val d'Essonne, 2008. http://tel.archives-ouvertes.fr/tel-00322434.
Full textΠt/n = (1-t/n) Π0 + t/n Π1.
Cette modélisation correspond à une évolution douce entre deux états. Par exemple cela peut traduire la transition entre deux régimes d'un chaîne de Markov cachée, qui pourrait parfois sembler trop brutale. Ces modèles peuvent donc être vus comme une alternative mais aussi comme un outil complémentaire aux modèles de Markov cachés. Tout au long de ce travail, nous avons considéré des dérives polynomiales de tout degré ainsi que des dérives par splines polynomiales : le but de ces modèles étant de les rendre plus flexibles que ceux des polynômes. Nous avons estimé nos modèles de multiples manières puis évalué la qualité de ces estimateurs avant de les utiliser en vue d'applications telle la recherche de mots exceptionnels. Nous avons mis en oeuvre le software DRIMM (bientôt disponible à http://stat.genopole.cnrs.fr/sg/software/drimm/, dédié à l'estimation de nos modèles. Ce programme regroupe toutes les possibilités offertes par nos modèles, tels le calcul des matrices en chaque position, le calcul des lois stationnaires, des distributions de probabilité en chaque position... L'utilisation de ce programme pour la recherche des mots exceptionnels est proposée dans des programmes auxiliaires (disponibles sur demande).
Plusieurs perspectives à ce travail sont envisageables. Nous avons jusqu'alors décidé de faire varier la matrice seulement en fonction de la position, mais nous pourrions prendre en compte des covariables tels le degré d'hydrophobicité, le pourcentage en gc, un indicateur de la structure des protéines (hélice α, feuillets β...). Nous pourrions aussi envisager de mêler HMM et variation continue, où sur chaque région, au lieu d'ajuster un modèle de Markov, nous ajusterions un modèle de chaînes de Markov régulées.
Richard, Hugues. "Prédiction de la localisation cellulaire des protéines à l'aide de leurs séquences biologiques." Phd thesis, Université d'Evry-Val d'Essonne, 2005. http://tel.archives-ouvertes.fr/tel-00011707.
Full textAinsi la majorité de ce travail de thèse s'intéresse au problème de la prédiction du compartiment cellulaire d'une protéine à partir de sa séquence primaire.
Nous nous sommes attachés à proposer des alternatives descriptives aux méthodes existantes de prédiction de la localisation cellulaire en utilisant : (1) de nouveaux descripteurs issus de la séquence nucléique, (2) une approche par chaînes de Markov cachées (CMC) et arbres de décision. L'approche par CMC est justifiée biologiquement a posteriori car elle permet la modélisation de signaux d'adressage conjointement à la prise en compte de la composition globale. En outre, l'étape de classification hiérarchique par arbre améliore nettement les résultats de classification. Les résultats obtenues lors des comparaisons avec les méthodes existantes et utilisant des descripteurs fondés sur la composition globale possèdent des performances similaires.
Pudlo, Pierre. "Estimations précises de grandes déviations et applications à la statistique des séquences biologiques." Phd thesis, Université Claude Bernard - Lyon I, 2004. http://tel.archives-ouvertes.fr/tel-00008517.
Full textAguirre, Francisco. "Trajectographie sous-marine par analyse de séquences d'images vidéo." Brest, 1990. http://www.theses.fr/1990BRES2009.
Full textArribas, Gil Ana. "Estimation dans des modèles à variables cachées : alignement des séquences biologiques et modèles d'évolution." Paris 11, 2007. http://www.theses.fr/2007PA112054.
Full textThis thesis is devoted to parameter estimation in models for biological sequence alignment. These are models constructed considering an evolution process on the sequences. In the case of two sequences evolving under the classical evolution process, the alignment model is called a pair-Hidden Markov Model (pair-HMM). Observations in a pair-HMM are formed by the couple of sequences to be aligned and the hidden alignment is a Markov chain. From a theoretical point of view, we provide a rigorous formalism for these models and study consistency of maximum likelihood and bayesian estimators. From the point of view of applications, we are interested in detection of conserved motifs in the sequences. To do this we present an evolution process that allows heterogeneity along the sequence. The alignment under this process still fits the pair-HMM. We propose efficient estimation algorithms for alignments and evolution parameters. Finally we are interested in multiple alignment (more than two sequences). The classical evolution process for the sequences provides a complex hidden variable model for the alignment in which the phylogenetic relationships between the sequences must be taken into account. We provide a theoretical framework for this model and study, as for the pairwise alignment, the consistency of estimators
Einaudi, Florence. "Analyse de séquences magmatiques océaniques continues : approche pétrophysique et géochimique." Aix-Marseille 3, 2002. http://www.theses.fr/2002AIX30044.
Full textThis thesis presents the results of the investigations realized on two continuous magmatic sections. The first section is a basaltic sequence which has been sampled at high-resolution in the Oman Ophiolite. This section has been investigated in terms of physical properties, petrology and geochemistry to study the structure of the volcanics emplaced at the axis of a fast spreading ridge. The Wadi Shaffan section is composed of five magmatic cycles. The temporal variations enhanced by this study allowed the integration of magmatic processes from magma extraction to volcanic cycles. The second section was drilled into gabbroic basement in the Atlantis Bank (South West Indian Ridge). Downhole logging and minicore petrophysical measurements have been investigated. Electrical conduction via electronic processes in oxide-rich gabbros has been taken into account in our analysis of the electrical properties. High-resolution images have been interpreted in terms of deformation orientation
Casagranda, Stefano. "Modélisation, analyse et réduction des systèmes biologiques." Thesis, Université Côte d'Azur (ComUE), 2017. http://www.theses.fr/2017AZUR4049/document.
Full textThis thesis deals with modeling, analysis and reduction of various biological models, with a focus on gene regulatory networks in the bacterium E. coli. Different mathematical approaches are used. In the first part of the thesis, we model, analyze and reduce, using classical tools, a high-dimensional transcription-translation model of RNA polymerase in E. coli. In the second part, we introduce a novel method called Principal Process Analysis (PPA) that allows the analysis of high-dimensional models, by decomposing them into biologically meaningful processes, whose activity or inactivity is evaluated during the time evolution of the system. Exclusion of processes that are always inactive, and inactive in one or several time windows, allows to reduce the complex dynamics of the model to its core mechanisms. The method is applied to models of circadian clock, endocrine toxicology and signaling pathway; its robustness with respect to variations of the initial conditions and parameter values is also tested. In the third part, we present an ODE model of the gene expression machinery of E. coli cells, whose growth is controlled by an external inducer acting on the synthesis of RNA polymerase. We describe our contribution to the design of the model and analyze with PPA the core mechanisms of the regulatory network. In the last part, we specifically model the response of RNA polymerase to the addition of external inducer and estimate model parameters from single-cell data. We discuss the importance of considering cell-to-cell variability for modeling this process: we show that the mean of single-cell fits represents the observed average data better than an average-cell fit
Faisan, Sylvain. "Analyse et fusion markovienne de séquences en imagerie 3D+t : Application à l'analyse de séquences d'images IRM fonctionnelles cérébrales." Université Louis Pasteur (Strasbourg) (1971-2008), 2004. https://publication-theses.unistra.fr/public/theses_doctorat/2004/FAISAN_Sylvain_2004.pdf.
Full textHidden Markov Models (HMMs) which are widely used to process signals or images, are well-suited to the analysis of random processes that are segmental in nature. However, many processes, met in particular in the biomedical field, are event-based processes making the HMMs ill-suited. We present in this PHD two markovian approaches dedicated to the modeling and analysis of an event-based process or of multiple interacting event-based processes. Both approaches proceed in two steps. First, a preprocessing step detects and characterizes events of interest in the raw input data. Then, detected events are analyzed based on an adapted hidden Markov model. The two modeling approaches can be distinguished by the number of event sequences they can handle. The first approach, which is based on a hidden semi-Markov event sequence model(HSMESM), considers a single event sequence whereas the second approach,which is based on a hidden Markov multiple event sequence model (HMMESM),handles multiple observation channels at once, within a rich mathematical framework of fusion--association of asynchronous events across channels. From these models, two unsupervised functional MRI (fMRI) brain mapping methods have been developed. Both methods rely on the same, novel principle of temporal alignment between event sequences. By accounting for spatial information within a statistical framework of multiple event sequence detection- multiple event sequence fusion, the HMMESM-based mapping method shows high robustness to noise and variability of the active fMRI signal across space, time, experiments, and subjects. Besides, the HMMESM method clearly outperforms the HSMESM method as well as the widely used Statistical Parametric Mapping (SPM) approach
Beaudry, Cyrille. "Analyse et reconnaissance de séquences vidéos d'activités humaines dans l'espace sémantique." Thesis, La Rochelle, 2015. http://www.theses.fr/2015LAROS042/document.
Full textThis thesis focuses on the characterization and recognition of human activities in videos. This research domain is motivated by a large set of applications such as automatic video indexing, video monitoring or elderly assistance. In the first part of our work, we develop an approach based on the optical flow estimation in video to recognize human elementary actions. From the obtained vector field, we extract critical points and trajectories estimated at different spatio-temporal scales. The late fusion of local characteristics such as motion orientation and shape around critical points, combined with the frequency description of trajectories allow us to obtain one of the best recognition rate among state of art methods. In a second part, we develop a method for recognizing complex human activities by considering them as temporal sequences of elementary actions. In a first step, elementary action probabilities over time is calculated in a video sequence with our first approach. Vectors of action probabilities lie in a statistical manifold called semantic simplex. Activities are then represented as trajectories on this manifold. Finally, a new descriptor is introduced to discriminate between activities from the shape of their associated trajectories. This descriptor takes into account the induced geometry of the simplex manifold
Clarot, Pierre. "Analyse de séquences vidéo de surveillance basée sur la détection d'activités." Mémoire, Université de Sherbrooke, 2010. http://savoirs.usherbrooke.ca/handle/11143/4882.
Full textNjonkou, Fankam Marc-Aurèle. "Analyse du mouvement dans les séquences d'images et filtrage linéaire récursif." Rouen, 1997. http://www.theses.fr/1997ROUES046.
Full textJorda, Julien. "Analyse systématique des motifs répétés en tandem dans les séquences protéiques." Thesis, Montpellier 2, 2010. http://www.theses.fr/2010MON20090/document.
Full textOver the last decades, technical advances in molecular biology such as the genome sequencing projects led to a huge increase of data in the biological databanks. Among them, there are particular motifs which are adjacently repeated and similar between them, called tandem repeats. The purpose of this thesis is to understand the existence of these repeats in protein sequences through a large-scale analysis
Domelevo, Entfellner Jean-Baka. "Combinaison de modèles phylogénétiques et longitudinaux pour l'analyse des séquences biologiques : reconstruction de HMM profils ancestraux." Phd thesis, Université Montpellier II - Sciences et Techniques du Languedoc, 2011. http://tel.archives-ouvertes.fr/tel-00842847.
Full textBoukhetta, Salah Eddine. "Analyse de séquences avec GALACTIC – Approche générique combinant analyse formelle des concepts et fouille de motifs." Electronic Thesis or Diss., La Rochelle, 2022. http://www.theses.fr/2022LAROS035.
Full textA sequence is a sequence of ordered elements such as travel trajectories or sequences of product purchases in a supermarket. Sequence mining is a domain of data mining that aims an extracting frequent sequential patterns from a set of sequences, where these patterns are most often common subsequences. Support is a monotonic measure that defines the proportion of data sharing a sequential pattern. Several algorithms have been proposed for frequent sequential pattern extraction. With the evolution of computing capabilities, the task of frequent sequential pattern extraction has become faster. The difficulty then lies in the large number of extracted sequential patterns, which makes it difficult to read and therefore to interpret. We speak about "deluge of patterns". Formal Concept Analysis (FCA) is a field of data analysis for identifying relationships in a set of binary data. Pattern structures extend FCA to handle complex data such as sequences. The GALACTIC platform implements the Next Priority Concept algorithm which proposes a pattern extraction approach for heterogeneous and complex data. It allows a generic pattern computation through specific descriptions of objects by monadic predicates. It also proposes to refine a set of objects through specific exploration strategies, which allows to reduce the number of patterns. In this work, we are interested in the analysis of sequential data using GALACTIC. We propose several descriptions and strategies adapted to sequences. We also propose unsupervised quality measures to be able to compare between the obtained patterns. A qualitative and quantitative analysis is conducted on real and synthetic datasets to show the efficiency of our approach
Bourdon, Jérémie. "Sources Probabilistes: des séquences aux systèmes." Habilitation à diriger des recherches, Université de Nantes, 2012. http://tel.archives-ouvertes.fr/tel-00776681.
Full textAubert, Julie. "Analyse statistique de données biologiques à haut débit." Thesis, Université Paris-Saclay (ComUE), 2017. http://www.theses.fr/2017SACLS048/document.
Full textThe technological progress of the last twenty years allowed the emergence of an high-throuput biology basing on large-scale data obtained in a automatic way. The statisticians have an important role to be played in the modelling and the analysis of these numerous, noisy, sometimes heterogeneous and collected at various scales. This role can be from several nature. The statistician can propose new concepts, or new methods inspired by questions asked by this biology. He can propose a fine modelling of the phenomena observed by means of these technologies. And when methods exist and require only an adaptation, the role of the statistician can be the one of an expert, who knows the methods, their limits and the advantages.In a first part, I introduce different methods developed with my co-authors for the analysis of high-throughput biological data, based on latent variables models. These models make it possible to explain a observed phenomenon using hidden or latent variables. The simplest latent variable model is the mixture model. The first two presented methods constitutes two examples: the first in a context of multiple tests and the second in the framework of the definition of a hybridization threshold for data derived from microarrays. I also present a model of coupled hidden Markov chains for the detection of variations in the number of copies in genomics taking into account the dependence between individuals, due for example to a genetic proximity. For this model we propose an approximate inference based on a variational approximation, the exact inference not being able to be considered as the number of individuals increases. We also define a latent-block model modeling an underlying structure per block of rows and columns adapted to count data from microbial ecology. Metabarcoding and metagenomic data correspond to the abundance of each microorganism in a microbial community within the environment (plant rhizosphere, human digestive tract, ocean, for example). These data have the particularity of presenting a dispersion stronger than expected under the most conventional models (we speak of over-dispersion). Biclustering is a way to study the interactions between the structure of microbial communities and the biological samples from which they are derived. We proposed to model this phenomenon using a Poisson-Gamma distribution and developed another variational approximation for this particular latent block model as well as a model selection criterion. The model's flexibility and performance are illustrated on three real datasets.A second part is devoted to work dedicated to the analysis of transcriptomic data derived from DNA microarrays and RNA sequencing. The first section is devoted to the normalization of data (detection and correction of technical biases) and presents two new methods that I proposed with my co-authors and a comparison of methods to which I contributed. The second section devoted to experimental design presents a method for analyzing so-called dye-switch design.In the last part, I present two examples of collaboration, derived respectively from an analysis of genes differentially expressed from microrrays data, and an analysis of translatome in sea urchins from RNA-sequencing data, how statistical skills are mobilized, and the added value that statistics bring to genomics projects
Elloumi, Mourad. "Analyse de chaînes de caractères codant des macromolécules biologiques." Aix-Marseille 3, 1994. http://www.theses.fr/1994AIX30031.
Full textVirazel, Arnaud. "Test intégré des circuits digitaux : analyse et génération de séquences aléatoires adjacentes." Montpellier 2, 2001. http://www.theses.fr/2001MON20094.
Full textBruey-Sédano, Nathalie. "Analyse fonctionnelle des séquences régulatrices du gène cuticulaire ACP65A chez la drosophile." Dijon, 2001. http://www.theses.fr/2001DIJOS034.
Full textHadji, Mohamed. "Contributions à l'étude d'un processeur s'intégrant dans un réseau systolique linéaire dédié à la comparaison des séquences biologiques." Rennes 1, 1995. http://www.theses.fr/1995REN10044.
Full textCabrol-Bass, Daniel. "Étude conformationnelle quantique de séquences tri peptidiques typiques des structures collagéniques." Nice, 1985. http://www.theses.fr/1985NICE4002.
Full textTankam, Chloé. "Analyse économique du développement des nouveaux marchés biologiques : le cas des marchés biologiques domestiques au Kenya." Thesis, Clermont-Ferrand 1, 2015. http://www.theses.fr/2015CLF10480.
Full textOver the past two decades the organic agro-food system has been transformed, extending beyond Northern countries and expanding in many so-called developing countries.This dynamic was first a positioning on high added value products’ export markets. Since several years, domestic markets have emerged in a growing number of developing countries, including Kenya. This thesis attempts to shed light on the dynamics explaining these new markets’ emergence and development. The research is based on four questions. The first one concerns the way organic transactions have been organized in Nairobi. Based on neo institutional economics, we explain how, in the absence of credible certification systems, these products are able to be sold with premium. The second question deals with the conditions of producers’ access to these markets. It is common to highlight that organic domestic Kenyan markets are more profitable than conventional ones. However, some producers, who could actually sell in these markets, prefer conventionalones. Thanks to the concept of transaction costs, we show the role of negotiation and monitoring transaction costs. Chapter 3 analyses the effects of selling on organic market on crops’ diversification, as a proxy of economic and environmental sustainability .Based on microeconomics’ contributions our results confirm the positive effect of organic markets. Finally, our fourth and final chapter provides an analysis of the conditions of these markets sustainability. We propose an analysis grid combining economy of information and quality construction approach, based on Hirschman (1970) works.This grid helps us comparing different certification and verification schemes existing in Kenya. Analysis shows that the least effective scheme is the one that is bound to grow,raising the question of the sustainability of Kenyan domestic organic markets
Corpetti, Thomas. "Images & télédétection : analyse de séquences à basse et très haute résolution spatiale." Habilitation à diriger des recherches, Université Rennes 1, 2011. http://tel.archives-ouvertes.fr/tel-00616558.
Full textPalmeira, Leonor. "Analyse et modélisation des dépendances entre sites voisins dans l'évolution des séquences d'ADN." Phd thesis, Université Claude Bernard - Lyon I, 2007. http://tel.archives-ouvertes.fr/tel-00178453.
Full textBoulfani-Cuisinaud, Yasmine. "Compression au fil de l'eau de séquences de maillages 3D par analyse multirésolution." Nice, 2010. http://www.theses.fr/2010NICE4080.
Full textWe propose in this thesis two approaches for geometric compression of 3D dynamic meshes computing on the fly multiresolution analysis. As a first compression approach, the encoder TWC (Temporal wavelet Coding) is proposed, including a temporal wavelet transforms implemented by several lifting schemes. The subbands of the wavelet transforms are encoded thanks to a bit allocation giving the optimal quantization steps, by optimizing the rate-distorstion trade-off. A second compression approach is proposed, called MCTWC (Motion Compensated Temporal Wavelet Coding). It is based on the motion compensated wavelet transform. An automatic motion-based segmentation technique is proposed using z region growing approach : the vertices that follow the same affine motion belong to a same cluster. A motion estimation / compensation is then applied to each cluster of the partition to obtain a cluster-based motion compensation sequence. Then, a wavelet-based lifting scheme is applied. Experimentally, the encoder MCTWC including motion compensation improves significantly the compression performances compared to the encoder TWC which does not include this feature. Also, compared to some state of the art encoders, the proposed coders do not depend on the geometric characteristics of the mesh sequences, permit a low computational complexity multiresolution analysis, and offer features like temporal scalability and streaming
Palmeira, Leonor. "Analyse et modélisation des dépendances entre sites voisins dans l’évolution des séquences d’ADN." Lyon 1, 2007. http://tel.archives-ouvertes.fr/docs/00/17/84/53/PDF/structure.pdf.
Full textOn the one hand, this study examined dinucleotide over- and under-representations in different complete genomes, in order to determine possible links with DNA damage known mechanims. We focused on direct 5’ and 3’ neighbors, and analyzed the effect of UV light on the genomes of micro-organisms, and the effect of methylation on the genomes of metazoans. On the other hand, recent results by Bérard et al. On models of evolution incorporating neighboring site dependencies (pyrimidine followed by purine), allowed us to develop a probabilistic approach for the estimation of substitution rates due to the methylation-deamination process acting on CG dinucleotides