Dissertations / Theses on the topic 'Exploration des séquences'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 31 dissertations / theses for your research on the topic 'Exploration des séquences.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Faisan, Sylvain. "Analyse et fusion markovienne de séquences en imagerie 3D+t : Application à l'analyse de séquences d'images IRM fonctionnelles cérébrales." Université Louis Pasteur (Strasbourg) (1971-2008), 2004. https://publication-theses.unistra.fr/public/theses_doctorat/2004/FAISAN_Sylvain_2004.pdf.
Full textHidden Markov Models (HMMs) which are widely used to process signals or images, are well-suited to the analysis of random processes that are segmental in nature. However, many processes, met in particular in the biomedical field, are event-based processes making the HMMs ill-suited. We present in this PHD two markovian approaches dedicated to the modeling and analysis of an event-based process or of multiple interacting event-based processes. Both approaches proceed in two steps. First, a preprocessing step detects and characterizes events of interest in the raw input data. Then, detected events are analyzed based on an adapted hidden Markov model. The two modeling approaches can be distinguished by the number of event sequences they can handle. The first approach, which is based on a hidden semi-Markov event sequence model(HSMESM), considers a single event sequence whereas the second approach,which is based on a hidden Markov multiple event sequence model (HMMESM),handles multiple observation channels at once, within a rich mathematical framework of fusion--association of asynchronous events across channels. From these models, two unsupervised functional MRI (fMRI) brain mapping methods have been developed. Both methods rely on the same, novel principle of temporal alignment between event sequences. By accounting for spatial information within a statistical framework of multiple event sequence detection- multiple event sequence fusion, the HMMESM-based mapping method shows high robustness to noise and variability of the active fMRI signal across space, time, experiments, and subjects. Besides, the HMMESM method clearly outperforms the HSMESM method as well as the widely used Statistical Parametric Mapping (SPM) approach
Levivier, Emilie. "Exploration des similitudes de séquences protéiques à haut niveau de divergence évolutive : perspectives de l'approche Hydrophobic Cluster Analysis (HCA)." Paris 7, 2003. http://www.theses.fr/2003PA077069.
Full textLi, Dong Haoyuan. "Extraction de séquences inattendues : des motifs séquentiels aux règles d’implication." Montpellier 2, 2009. http://www.theses.fr/2009MON20253.
Full textThe sequential patterns can be viewed as an extension of the notion of association rules with integrating temporal constraints, which are effective for representing statistical frequency based behaviors between the elements contained in sequence data, that is, the discovered patterns are interesting because they are frequent. However, with considering prior domain knowledge of the data, another reason why the discovered patterns are interesting is because they are unexpected. In this thesis, we investigate the problems in the discovery of unexpected sequences in large databases with respect to prior domain expertise knowledge. We first methodically develop the framework Muse with integrating the approaches to discover the three forms of unexpected sequences. We then extend the framework Muse by adopting fuzzy set theory for describing sequence occurrence. We also propose a generalized framework SoftMuse with respect to the concept hierarchies on the taxonomy of data. We further propose the notions of unexpected sequential patterns and unexpected implication rules, in order to evaluate the discovered unexpected sequences by using a self-validation process. We finally propose the discovery and validation of unexpected sentences in free format text documents. The usefulness and effectiveness of our proposed approaches are shown with the experiments on synthetic data, real Web server access log data, and text document classification
Jaziri, Rakia. "Modèles de mélanges topologiques pour la classification de données structurées en séquences." Paris 13, 2013. http://scbd-sto.univ-paris13.fr/secure/edgalilee_th_2013_jaziri.pdf.
Full textRecent years have seen the development of data mining techniques in various application areas, with the purpose of analyzing sequential, large and complex data. In this work, the problem of clustering, visualization and structuring data is tackled by a three-stage proposal. The first proposal present a generative approach to learn a new probabilistic Self-Organizing Map (PrSOMS) for non independent and non identically distributed data sets. Our model defines a low dimensional manifold allowing friendly visualizations. To yield the topology preserving maps, our model exhibits the SOM like learning behavior with the advantages of probabilistic models. This new paradigm uses HMM (Hidden Markov Models) formalism and introduces relationships between the states. This allows us to take advantage of all the known classical views associated to topographic map. The second proposal concerns a hierarchical extension of the approach PrSOMS. This approach deals the complex aspect of the data in the classification process. We find that the resulting model ”H-PrSOMS” provides a good interpretability of classes built. The third proposal concerns an alternative approach statistical topological MGTM-TT, which is based on the same paradigm than HMM. It is a generative topographic modeling observation density mixtures, which is similar to a hierarchical extension of time GTM model. These proposals have then been applied to test data and real data from the INA (National Audiovisual Institute). This work is to provide a first step, a finer classification of audiovisual broadcast segments. In a second step, we sought to define a typology of the chaining of segments (multiple scattering of the same program, one of two inter-program) to provide statistically the characteristics of broadcast segments. The overall framework provides a tool for the classification and structuring of audiovisual programs
Nicolas, Renaud. "Développement de nouvelles séquences d'IRM de diffusion dédiées à la neuro-imagerie." Toulouse 3, 2012. http://www.theses.fr/2012TOU30283.
Full textThis PhD thesis is dedicated to a technique, diffusion MRI, which allow to obtain images of micro-structural properties (inferior to the MRI voxel size) of biological media, and to the application of this technique to study brain. Because of its ability to reveal early micro-structural changes (associated with complex energetic metabolism changes), diffusion MRI is become a reference method to detect focal diseases like ischemic stroke. The reader can find in this thesis a complete introduction to the physical phenomenon related to brownian motion in biological media and those related to diffusion NMR and MRI, and an original synthesis of the biological and biophysical determinisms of the changes of apparent diffusion coefficients observed in stroke animal models. To extend the field of the technique from stroke focal phenomenon (studied experimentally in man an rodents) to non focal pathologies, the study of the deviation of diffusion from Gaussian behaviour has been studied theoretically and experimentally. Pratical methodologies allowing the preparation of diffusion images for non-gaussian diffusion imaging, and artefacts corrections are described here. This work has lead to a study of non-gaussian diffusion MRI signal treated as an inverse problem and to applications for Alzheimer's disease detection, characterized by non-focal and microscopic lesions. Finally, we have developed three original approaches for technological developments of MRI sequences (with the associated image treatment necessary to use them). The first is the development of non-gaussian diffusion together with variation of diffusion time applied to imaging at 4. 7 and 7 T. The second concern the development of magnetization transfer and diffusion imaging that give additional information about water probed by MRI. The latter approach is the development of fonctionnal diffusion MRI at 3 T in DTI mode dedicated to apply the biological hypothesis resumed in the first part of this thesis, concerning the particular role of water in brain activation. With a progression for the experimental validations, hypothesis concerning micro-structures of biological media are tested and validated with different approaches (in vivo, ex vivo, in silico), to apply the recent discoveries concerning the physic of diffusion MRI in order to detect focal and non-focal pathologies and to interpret them
Hérisson, Joan. "Représentation spatiale et exploration virtuelle des génomes : une approche globale pour l'analyse des éléments architecturaux des séquences." Paris 11, 2004. http://www.theses.fr/2004PA112147.
Full textDNA sequences are often represented by a succession of four nucleotides: A, C, G and T. Even if this representation allows to study the linguistics and syntax of DNA sequences, it remains textual, local and monodimensional and does not provide any visual, local nor spatial information. However, DNA is a three-dimensional structure forming a double helix which can bend and create long distance interactions. The aim of this thesis is to propose a new approach of the genomic sequences in order to enrich classic analyses with three-dimensonal criteria. The modelling of these 3D DNA sequences is based on a biophysical model of spatial conformation of DNA. Such a representation raises problematics both in computer science - concerning Virtual Reality for scene management, interaction, data representation and associated algorithms - and in Bioinformatics of genomes. These different aspects, which form the pluridisciplinary nature of this thesis, have been treated through the software program tool ADN-Viewer that I have developed. Two directions have been taken during this work and should endure after this thesis. The first one is to come close as much as possible to the DNA biological reality. Our work represents a very first step in this sense and has to be enriched by new criteria of spatial conformation and by the integration of biological partners of DNA. The second direction is to exploit the three-dimensional structure of DNA as a representation among others to explore, treat and analyze the biological content of sequences
Guillame-Bert, Mathieu. "Apprentissage de règles associatives temporelles pour les séquences temporelles de symboles." Thesis, Grenoble, 2012. http://www.theses.fr/2012GRENM081/document.
Full textThe learning of temporal patterns is a major challenge of Data mining. We introduce a temporal pattern model called Temporal Interval Tree Association Rules (Tita rules or Titar). This pattern model can be used to express both uncertainty and temporal inaccuracy of temporal events. Among other things, Tita rules can express the usual time point operators, synchronicity, order, and chaining,disjunctive time constraints, as well as temporal negation. Tita rules are designed to allow predictions with optimum temporal precision. Using this representation, we present the Titar learner algorithm that can be used to extract Tita rules from large datasets expressed as Symbolic Time Sequences. This algorithm based on entropy minimization, apriori pruning and statistical dependence analysis. We evaluate our technique on simulated and real world datasets. The problem of temporal planning with Tita rules is studied. We use Tita rules as world description models for a Planning and Scheduling task. We present an efficient temporal planning algorithm able to deal with uncertainty, temporal inaccuracy, discontinuous (or disjunctive) time constraints and predictable but imprecisely time located exogenous events. We evaluate our technique by joining a learning algorithm and our planning algorithm into a simple reactive cognitive architecture that we apply to control a robot in a virtual world
Guillame-bert, Mathieu. "Apprentissage de règles associatives temporelles pour les séquences temporelles de symboles." Phd thesis, Université de Grenoble, 2012. http://tel.archives-ouvertes.fr/tel-00849087.
Full textWeber, Jonathan. "Segmentation morphologique interactive pour la fouille de séquences vidéo." Phd thesis, Université de Strasbourg, 2011. http://tel.archives-ouvertes.fr/tel-00643585.
Full textBastide, Nathalie. "Segmentation et analyse du mouvement du ventricule gauche à partir de séquences d'images cardiaques de scanographie ultra-rapide." Paris 12, 1993. http://www.theses.fr/1993PA120021.
Full textFiot, Céline. "Extraction de séquences fréquentes : des données numériques aux valeurs manquantes." Phd thesis, Université Montpellier II - Sciences et Techniques du Languedoc, 2007. http://tel.archives-ouvertes.fr/tel-00179506.
Full textMartinez, Coralie. "Classification précoce de séquences temporelles par de l'apprentissage par renforcement profond." Thesis, Université Grenoble Alpes (ComUE), 2019. http://www.theses.fr/2019GREAT123.
Full textEarly classification (EC) of time series is a recent research topic in the field of sequential data analysis. It consists in assigning a label to some data that is sequentially collected with new data points arriving over time, and the prediction of a label has to be made using as few data points as possible in the sequence. The EC problem is of paramount importance for supporting decision makers in many real-world applications, ranging from process control to fraud detection. It is particularly interesting for applications concerned with the costs induced by the acquisition of data points, or for applications which seek for rapid label prediction in order to take early actions. This is for example the case in the field of health, where it is necessary to provide a medical diagnosis as soon as possible from the sequence of medical observations collected over time. Another example is predictive maintenance with the objective to anticipate the breakdown of a machine from its sensor signals. In this doctoral work, we developed a new approach for this problem, based on the formulation of a sequential decision making problem, that is the EC model has to decide between classifying an incomplete sequence or delaying the prediction to collect additional data points. Specifically, we described this problem as a Partially Observable Markov Decision Process noted EC-POMDP. The approach consists in training an EC agent with Deep Reinforcement Learning (DRL) in an environment characterized by the EC-POMDP. The main motivation for this approach was to offer an end-to-end model for EC which is able to simultaneously learn optimal patterns in the sequences for classification and optimal strategic decisions for the time of prediction. Also, the method allows to set the importance of time against accuracy of the classification in the definition of rewards, according to the application and its willingness to make this compromise. In order to solve the EC-POMDP and model the policy of the EC agent, we applied an existing DRL algorithm, the Double Deep-Q-Network algorithm, whose general principle is to update the policy of the agent during training episodes, using a replay memory of past experiences. We showed that the application of the original algorithm to the EC problem lead to imbalanced memory issues which can weaken the training of the agent. Consequently, to cope with those issues and offer a more robust training of the agent, we adapted the algorithm to the EC-POMDP specificities and we introduced strategies of memory management and episode management. In experiments, we showed that these contributions improved the performance of the agent over the original algorithm, and that we were able to train an EC agent which compromised between speed and accuracy, on each sequence individually. We were also able to train EC agents on public datasets for which we have no expertise, showing that the method is applicable to various domains. Finally, we proposed some strategies to interpret the decisions of the agent, validate or reject them. In experiments, we showed how these solutions can help gain insight in the choice of action made by the agent
Raissi, Chedy. "Extraction de séquences fréquentes : des bases de données statiques aux flots de données." Montpellier 2, 2008. http://www.theses.fr/2008MON20063.
Full textZidouni, Azeddine. "Modèles graphiques discriminants pour l'étiquetage de séquences : application à la reconnaissance d'entités nommées radiophiniques." Thesis, Aix-Marseille 2, 2010. http://www.theses.fr/2010AIX22125/document.
Full textRecent researches in Information Extraction are designed to extract fixed types of information from data. Sequence annotation systems are developed to associate structured annotations to input data presented in sequential form. The named entity recognition (NER) task consists of identifying and classifying every word in a document into some predefined categories such as person name, locations, organizations, and dates. The complexity of the NER is largely related to the definition of the task and to the complexity of the relationships between words and the semantic associated. Our first contribution is devoted to solving the NER problem using discriminative graphical models. The proposed approach investigates the use of various contexts of the words to improve recognition. NER systems are fixed in accordance with a specific annotation protocol. Thus, new applications are developed for new protocols. The challenge is how we can adapt an annotation system which is performed for a specific application to other target application? We will propose in this work an adaptation approach of sequence labelling task based on annotation enrichment using conditional random fields (CRF). Experimental results show that the proposed approach outperform rules-based approach in NER task. Finally, we propose a multimodal approach of NER by integrating low level features as contextual information in radio broadcast news data. The objective of this study is to measure the correlation between the speaker voicing quality and the importance of his speech
Boukhetta, Salah Eddine. "Analyse de séquences avec GALACTIC – Approche générique combinant analyse formelle des concepts et fouille de motifs." Electronic Thesis or Diss., La Rochelle, 2022. http://www.theses.fr/2022LAROS035.
Full textA sequence is a sequence of ordered elements such as travel trajectories or sequences of product purchases in a supermarket. Sequence mining is a domain of data mining that aims an extracting frequent sequential patterns from a set of sequences, where these patterns are most often common subsequences. Support is a monotonic measure that defines the proportion of data sharing a sequential pattern. Several algorithms have been proposed for frequent sequential pattern extraction. With the evolution of computing capabilities, the task of frequent sequential pattern extraction has become faster. The difficulty then lies in the large number of extracted sequential patterns, which makes it difficult to read and therefore to interpret. We speak about "deluge of patterns". Formal Concept Analysis (FCA) is a field of data analysis for identifying relationships in a set of binary data. Pattern structures extend FCA to handle complex data such as sequences. The GALACTIC platform implements the Next Priority Concept algorithm which proposes a pattern extraction approach for heterogeneous and complex data. It allows a generic pattern computation through specific descriptions of objects by monadic predicates. It also proposes to refine a set of objects through specific exploration strategies, which allows to reduce the number of patterns. In this work, we are interested in the analysis of sequential data using GALACTIC. We propose several descriptions and strategies adapted to sequences. We also propose unsupervised quality measures to be able to compare between the obtained patterns. A qualitative and quantitative analysis is conducted on real and synthetic datasets to show the efficiency of our approach
Bercot, Béatrice. "Etude de l'expression de gènes d'aminoside 6'-N-acétyltransférase dans deux intégrons de classe 1 : exploration de séquences contrôlant la traduction ou la spécificité de substrat." Paris 5, 2002. http://www.theses.fr/2002PA05N015.
Full textSananes, Jean-Christophe. "Exploration IRM des voies biliaires : intérêt de la séquence Haste." Bordeaux 2, 1994. http://www.theses.fr/1994BOR23011.
Full textBen, Zakour Asma. "Extraction des utilisations typiques à partir de données hétérogènes en vue d'optimiser la maintenance d'une flotte de véhicules." Thesis, Bordeaux 1, 2012. http://www.theses.fr/2012BOR14539/document.
Full textThe present work is part of an industrial project driven by 2MoRO Solutions company.It aims to develop a high value service enabling aircraft operators to optimize their maintenance actions.Given the large amount of data available around aircraft exploitation, we aim to analyse the historical events recorded with each aircraft in order to extract maintenance forecasting. Theresults are used to integrate and consolidate maintenance tasks in order to minimize aircraft downtime and risk of failure. The proposed method involves three steps : (i) streamlining information in order to combinethem, (ii) organizing this data for easy analysis and (iii) an extraction step of useful knowledgein the form of interesting sequences. [...]
Mathonat, Romain. "Rule discovery in labeled sequential data : Application to game analytics." Thesis, Lyon, 2020. http://www.theses.fr/2020LYSEI080.
Full textIt is extremely useful to exploit labeled datasets not only to learn models and perform predictive analytics but also to improve our understanding of a domain and its available targeted classes. The subgroup discovery task has been considered for more than two decades. It concerns the discovery of rules covering sets of objects having interesting properties, e.g., they characterize a given target class. Though many subgroup discovery algorithms have been proposed for both transactional and numerical data, discovering rules within labeled sequential data has been much less studied. In that context, exhaustive exploration strategies can not be used for real-life applications and we have to look for heuristic approaches. In this thesis, we propose to apply bandit models and Monte Carlo Tree Search to explore the search space of possible rules using an exploration-exploitation trade-off, on different data types such as sequences of itemset or time series. For a given budget, they find a collection of top-k best rules in the search space w.r.t chosen quality measure. They require a light configuration and are independent from the quality measure used for pattern scoring. To the best of our knowledge, this is the first time that the Monte Carlo Tree Search framework has been exploited in a sequential data mining setting. We have conducted thorough and comprehensive evaluations of our algorithms on several datasets to illustrate their added-value, and we discuss their qualitative and quantitative results. To assess the added-value of one or our algorithms, we propose a use case of game analytics, more precisely Rocket League match analysis. Discovering interesting rules in sequences of actions performed by players and using them in a supervised classification model shows the efficiency and the relevance of our approach in the difficult and realistic context of high dimensional data. It supports the automatic discovery of skills and it can be used to create new game modes, to improve the ranking system, to help e-sport commentators, or to better analyse opponent teams, for example
Soriano, Mélanie. "Astérosismologie d'étoiles de séquence principale ou évoluées en relation avec l'expérience spatiale COROT et les instruments au sol HARPS et SOPHIE." Toulouse 3, 2009. http://thesesups.ups-tlse.fr/827/.
Full textThe work presented in this thesis focuses on asteroseismology of solar-type stars, and more specifically on central stars of planetary systems. The analysis of waves propagating in these stars can help constraining their internal structure. The two first chapters describe the theory and the stellar oscillations and the numerical tools used for this work. The third part deals with HD 52265, target of the CoRoT mission with a planet. We computed preliminary models for this star, taking into account the spectroscopic constraints, and we made some asteroseismic predictions. During this study, we found a seismic signature characteristic of the stellar core. This phenomenon is induced by a strong helium gradient in the core. We studied this effect in the general case of solar-type stars and we showed that it always happens, at the end of the main sequence or at the beginning of the subgiant branch. This characteristic signature can be used to constrain the stellar core. The fifth chapter is devoted to the star 51 Peg. We observed this star with the SOPHIE spectrograph at the Observatoire de Haute Provence in 2007 and we detected its oscillations. The analysis of the data led to the identification of 21 pulsation modes. Finally, we present a new seismic analysis of the exoplanet-host star µ Arae, observed and analysed by Bazot et al. In 2004. Asteroseismology coupled with spectroscopy allowed us to determine the helium abundance of the star, and its parameters: mass, age, radius, size of the convective core and its possible extension due to overshooting. These results show that asteroseismology is a powerful tool to bring constraints on the internal structure of stars
Pinet, Svetlana. "Exploration cognitive de l'écriture au clavier." Thesis, Aix-Marseille, 2016. http://www.theses.fr/2016AIXM3031/document.
Full textTyping has become a ubiquitous skill in our modern information societies. It constitutes an important language production modality and probably our preferred way to produce written language. Still its investigation is rather scarce. Understanding typing behavior pertains to several research domains such as language production, motor control and sequence programming. The aim of this thesis was to characterize linguistic and motor processing during typing. The methodology combined fine grained behavioral and electroencephalography (EEG) investigations.The first study aimed to assess the importance of linguistic processes during typing. It revealed a composite pattern of effects on response latencies, inter-keystroke intervals and accuracy rates. The second study assessed the reliability of an online platform to perform large-scale studies of typing skills. Then, three EEG studies aimed to characterize motor planning during typing and their putative interaction with linguistic processing. While linguistic processing was harder to trace with EEG, all three studies revealed a reliable pattern over motor cortices prior to the striking of the first keystroke of a word, interpreted as an index of motor preparation. The manipulation of effectors engaged in sequence production revealed versatile inhibitory processes dependent on the content of the sequence. The results are discussed in terms of linguistic and motor processes and their putative interactions during typed language production, contributing to the popular debate about information processing in cognitive science. This work provides novel data that pave the way to promising future investigations of typing
Aïzan, Josky. "Modélisation et reconnaissance d'activités quotidiennes au sein d'une maison intelligente : application à la surveillance des personnes âgées." Thesis, Littoral, 2020. http://www.theses.fr/2020DUNK0557.
Full textThe ADL systems for keeping seniors at home are expanding today. The new approaches involve setting up an automated activity monitoring system in a smart home equipped with wearable sensors such as Global Positioning System (GPS), electronics bracelets or RFID chips. These sensors unfortunately have the constraint to be worn constantly. The use of binary sensors is an increasingly common alternative. In this thesis we proposed modeling and recognition of daily activities within a smart home equipped with binary sensors. The first phase of the proposed architecture concerns activity modelling. Deterministic and uncertain sequential pattern mining algorithms were used. These algorothms contain a pre-processing phase that integrates the temporal constraint between events. The performance of these algorithms was evaluated on the MIT database, which contains a collection of human activities from two instruments of 77 and 84 sensors respectively. These experiments show that the number and quality of models from the modeling phase are strongly linked to the confidence rate of the sensors. The second phase of architecture involves the recognition of activities. During this phase, two approaches are proposed. The first approach is to pair the random forest method with the deterministic sequential pattern mining algorithm. This approach incorporates a temporal characterization of the activity models discovered. An experiment is carried out on the MIT database and the results in terms of activity recognition are 98% for the subject 1 and 95% for the subject 2. These results are compared with those in the literature to reflect the performance of the proposed approach. The second approach uses the sequence alignment recognition method based on the Levenshtein distance coupled with the uncertain sequential pattern mining. At this level, the uncertain sequential pattern mining algorithm integrates both the management of time constraints between events and the management of the uncertainty of data from the sensors. The performance of this method was evaluated on the MIT and CASAS databases. The CASAS database contains a collection of data from realistic scenarios to detect normal and intertwined daily activities. The results of the experiments on its two databases show that the recognition rate is an increasing function of the confidente rate of the sensors. These results are 100% and 94% respectively for the normal and interweave activities of the CASAS base and 93% and 90% respectively for the activities of subjects 1 and 2 of the MIT base. Compared with those in literature, these results highlight the effectiveness of our method
Richard, Jérémy. "De la capture de trajectoires de visiteurs vers l’analyse interactive de comportement après enrichissement sémantique." Electronic Thesis or Diss., La Rochelle, 2023. http://www.theses.fr/2023LAROS012.
Full textThis thesis focuses on the behavioral study of tourist activity using a generic and interactive analysis approach. The developed analytical process concerns the tourist trajectory in the city and museums as the study field. Experiments were conducted to collect movement data in the tourist city using GPS signals, thus enabling the acquisition of a movement trajectory. However, the study primarily focuses on reconstructing a visitor’s trajectory in museums using indoor positioning equipment, i.e., in a constrained environment. Then, a generic multi-aspect semantic enrichment model is developed to supplement an individual’s trajectory using multiple context data such as the names of neighborhoods the individual passed through in the city, museum rooms, weather outside, and indoor mobile application data. The enriched trajectories, called semantic trajectories, are then analyzed using formal concept analysis and the GALACTIC platform, which enables the analysis of complex and heterogeneous data structures as a hierarchy of subgroups of individuals sharing common behaviors. Finally, attention is paid to the "ReducedContextCompletion" algorithm that allows for interactive navigation in a lattice of concepts, allowing the data analyst to focus on the aspects of the data they wish to explore
Dumont, Victoria. "Explorations cérébrale et comportementale des capacités de traitement des séquences de stimuli tactiles non-sociaux par les nouveau-nés prématurés." Thesis, Normandie, 2017. http://www.theses.fr/2017NORMC017/document.
Full textThe neuroconstructivist theoretical framework of cognitive development, taking into account the variability of the constraints that act from the conception to shape development, is relevant to consider the early influence of sensory experiences on the neurobehavioral development of preterm neonates. They evolve in a particular environment and are vulnerable to neurodevelopmental disorders, to which atypical tactile and temporal processing are associated. The aim of the thesis is to study tactile and temporal abilities in preterm newborns and to evaluate the effect of the early environment on these perceptions. We included 61 preterm neonates (born between 32 and 34 weeks of gestational age (wGA)). At 35 weeks of corrected gestational age, we measured orienting responses (forearm, hand, and fingers movements) during vibrotactile stimulation of their hand and forearm, during a habituation and dishabituation paradigm, the dishabituation being either a location change or a pause in the stimulation sequence. Preterm newborns displayed a manual orienting response to vibrotactile stimuli which significantly decreased when the stimulus was repeated, regardless of the stimulated location on the limb. Habituation was delayed in subjects born at a younger gestational age, smaller birth weight, and having experienced more painful care procedures. Preterm neonates perceived changes in stimulus location and interstimulus time interval, suggesting a prenatal development of temporal processing capacities. These temporal processing abilities and their use to generate sensory prediction are being evaluated in a second study. 19 premature neonates (born between 31 and 32wGA) were presented with a tactile sequence (regular or irregular) at 33 and 35 weeks of corrected GA. Variations in cerebral blood flow were measured. At both corrected GA, tactile stimuli are associated with a hemodynamic response in the primary somatosensory cortex. At 33 weeks of corrected GA, omissions in the sequence are associated with an increase in cerebral blood flow, which indicates that premature neonates form sensory predictions, regardless of their experimental group. This thesis work allows to better characterize the tactile and temporal processing abilities in premature neonates, which lack recent and thorough investigation. In addition, it provides rational arguments that could help to propose sensory therapies to these patients, based on their perceptual abilities
Biteau, Nicolas. "Faisabilité du séquençage systématique d'un chromosome : stratégies et exploration du génome de Saccharomyces cerevisiae." Bordeaux 2, 1993. http://www.theses.fr/1993BOR28241.
Full textFahed, Lina. "Prédire et influencer l'apparition des événements dans une séquence complexe." Thesis, Université de Lorraine, 2016. http://www.theses.fr/2016LORR0125/document.
Full textFor several years now, a new phenomenon related to digital data is emerging : data which are increasingly voluminous, varied and rapid, appears and becomes available, they are often referred to as complex data. In this dissertation, we focus on a particular type of data : complex sequence of events, by asking the following question : “how to predict as soon as possible and to influence the appearance of future events within a complex sequence of events?”. First of all, we focus on the problem of predicting events as soon as possible in a sequence of events. We propose DEER : an algorithm for mining episode rules, which has the originality of controlling the horizon of the appearance of future events by imposing a temporal distance within the extracted rules. In a second phase, we address the problem of emergence detection in an events stream. We propose EER : an algorithm for detecting new emergent rules as soon as possible. In order to increase the reliability of new rules, EER relies on the similarity between theses rules and previously extracted rules. At last, we study the impact carried by events on other events within a sequence of events. We propose IE : an algorithm that introduces the concept of “influencer events” and studies the influence on the support, on the confidence and on the distance through three proposed measures. Our work is evaluated and validated through an experimental study carried on a real data set of blogs messages
Pham, Quang-Khai. "Time Sequence Summarization: Theory and Applications." Phd thesis, Université de Nantes, 2010. http://tel.archives-ouvertes.fr/tel-00538512.
Full textLuu, Vinh Trung. "Using event sequence alignment to automatically segment web users for prediction and recommendation." Thesis, Mulhouse, 2016. http://www.theses.fr/2016MULH0098/document.
Full textThis thesis explored the application of sequence alignment in web usage mining, including user clustering and web prediction and recommendation.This topic was chosen as the online business has rapidly developed and gathered a huge volume of information and the use of sequence alignment in the field is still limited. In this context, researchers are required to build up models that rely on sequence alignment methods and to empirically assess their relevance in user behavioral mining. This thesis presents a novel methodological point of view in the area and show applicable approaches in our quest to improve previous related work. Web usage behavior analysis has been central in a large number of investigations in order to maintain the relation between users and web services. Useful information extraction has been addressed by web content providers to understand users’ need, so that their content can be correspondingly adapted. One of the promising approaches to reach this target is pattern discovery using clustering, which groups users who show similar behavioral characteristics. Our research goal is to perform users clustering, in real time, based on their session similarity
Dalloux, Clément. "Fouille de texte et extraction d'informations dans les données cliniques." Thesis, Rennes 1, 2020. http://www.theses.fr/2020REN1S050.
Full textWith the introduction of clinical data warehouses, more and more health data are available for research purposes. While a significant part of these data exist in structured form, much of the information contained in electronic health records is available in free text form that can be used for many tasks. In this manuscript, two tasks are explored: the multi-label classification of clinical texts and the detection of negation and uncertainty. The first is studied in cooperation with the Rennes University Hospital, owner of the clinical texts that we use, while, for the second, we use publicly available biomedical texts that we annotate and release free of charge. In order to solve these tasks, we propose several approaches based mainly on deep learning algorithms, used in supervised and unsupervised learning situations
Belghiti, Moulay Tayeb. "Modélisation et techniques d'optimisation en bio-informatique et fouille de données." Thesis, Rouen, INSA, 2008. http://www.theses.fr/2008ISAM0002.
Full textThis Ph.D. thesis is particularly intended to treat two types of problems : clustering and the multiple alignment of sequence. Our objective is to solve efficiently these global problems and to test DC Programming approach and DCA on real datasets. The thesis is divided into three parts : the first part is devoted to the new approaches of nonconvex optimization-global optimization. We present it a study in depth of the algorithm which is used in this thesis, namely the programming DC and the algorithm DC ( DCA). In the second part, we will model the problem clustering in three nonconvex subproblems. The first two subproblems are distinguished compared to the choice from the norm used, (clustering via norm 1 and 2). The third subproblem uses the method of the kernel, (clustering via the method of the kernel). The third part will be devoted to bioinformatics, one goes this focused on the modeling and the resolution of two subproblems : the multiple alignment of sequence and the alignment of sequence of RNA. All the chapters except the first end in numerical tests
Lu, Peng. "Empirical study and multi-task learning exploration for neural sequence labeling models." Thèse, 2019. http://hdl.handle.net/1866/22530.
Full text