Tesis sobre el tema "Analyse de données fonctionnelle"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 50 mejores tesis para su investigación sobre el tema "Analyse de données fonctionnelle".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore tesis sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Chevaillier, Béatrice. "Analyse de données d'IRM fonctionnelle rénale par quantification vectorielle". Phd thesis, Université de Metz, 2010. http://tel.archives-ouvertes.fr/tel-00557235.
Texto completoDécarie, Yann. "Analyse en composantes principales et analyse discriminante fonctionnelles appliquées à des données de prises alimentaires animales". Mémoire, Université de Sherbrooke, 2011. http://savoirs.usherbrooke.ca/handle/11143/4899.
Texto completoThirion, Bertrand. "Analyse de données d' IRM fonctionnelle : statistiques, information et dynamique". Phd thesis, Télécom ParisTech, 2003. http://tel.archives-ouvertes.fr/tel-00457460.
Texto completoOperto, Grégory. "Analyse structurelle surfacique de données fonctionnelles cétrébrales". Aix-Marseille 3, 2009. http://www.theses.fr/2009AIX30060.
Texto completoFunctional data acquired by magnetic resonance contain a measure of the activity in every location of the brain. If many methods exist, the automatic analysis of these data remains an open problem. In particular, the huge majority of these methods consider these data in a volume-based fashion, in the 3D acquisition space. However, most of the activity is generated within the cortex, which can be considered as a surface. Considering the data on the cortical surface has many advantages : on one hand, its geometry can be taken into account in every processing step, on the other hand considering the whole volume reduces the detection power of usually employed statistical tests. This thesis hence proposes an extension of the application field of volume-based methods to the surface-based domain by adressing problems such as projecting data onto the surface, performing surface-based multi-subjects analysis, and estimating results validity
Feuillard, Vincent. "Analyse d'une base de données pour la calibration d'un code de calcul". Phd thesis, Université Pierre et Marie Curie - Paris VI, 2007. http://tel.archives-ouvertes.fr/tel-00809048.
Texto completoKarkar, Slim Ismael. "Parcellisation et analyse multi-niveaux de données : Application à l’étude des réseaux de connectivité cérébrale". Strasbourg, 2011. https://publication-theses.unistra.fr/public/theses_doctorat/2011/KARKAR_Slim_Ismael_2011.pdf.
Texto completoOver the last decade, functional MRI has emerged as a widely used tool for mapping functions of the brain. More recently, it has been used for identifying networks of cerebral connectivity that represent the interactions between different brain areas. In this context, a recent strategy is based on a preliminary parcellation of the brain into functional regions, and then identifying functional networks from a measurement of interactions between each area. The first part of this thesis describes a novel approach for parcellation that produces regions that are homogeneous at several levels. These regions are shown to be consistent with the anatomical landmarks of the processed subjects. In the second part, we propose a new family of statistics to identify significant networks of functional connectivity. This approach enables the detection of small, strongly-connected networks as well as larger networks that involve weaker interactions. Finally, within a classification framework, we developed a group-level study, producing networks that synthesize characteristics of functional networks across the population under study
Prifti, Edi. "Une approche bioinformatique intégrative pour la recherche de cibles physiopathologiques dans les maladies complexes : une application aux données transcriptomiques". Paris 6, 2011. http://www.theses.fr/2011PA066175.
Texto completoGrollemund, Paul-Marie. "Régression linéaire bayésienne sur données fonctionnelles". Thesis, Montpellier, 2017. http://www.theses.fr/2017MONTS045.
Texto completoThe linear regression model is a common tool for a statistician. If a covariable is a curve, we tackle a high-dimensional issue. In this case, sparse models lead to successful inference, for instance by expanding the functional covariate on a smaller dimensional space.In this thesis, we propose a Bayesian approach, named Bliss, to fit the functional linear regression model. The Bliss model supposes, through the prior, that the coefficient function is a step function. From the posterior, we propose several estimators to be used depending on the context: an estimator of the support and two estimators of the coefficient function: a smooth one and a stewpise one. To illustrate this, we explain the black Périgord truffle yield with the rainfall during the truffle life cycle. The Bliss method succeeds in selecting two relevant periods for truffle development.As another feature of the Bayesian paradigm, the prior distribution enables the integration of preliminary judgments in the statistical inference. For instance, the biologists’ knowledge about the truffles growth is relevant to inform the Bliss model. To this end, we propose two modifications of the Bliss model to take into account preliminary judgments. First, we indirectly collect preliminary judgments using pseudo data provided by experts. The prior distribution proposed corresponds to the posterior distribution given the experts’ pseudo data. Futhermore, the effect of each expert and their correlations are controlled with weighting. Secondly, we collect experts’ judgments about the most influential periods effecting the truffle yield and if the effect is positive or negative. The prior distribution proposed relies on a penalization of coefficient functions which do not conform to these judgments.Lastly, the asymptotic behavior of the Bliss method is studied. We validate the proposed approach by showing the posterior consistency of the Bliss model. Using model-specific assumptions, efficient proof of the Wald theorem is given. The main difficulty is the misspecification of the model since the true coefficient function is surely not a step function. We show that the posterior distribution contracts on a step function which is the Kullback-Leibler projection of the true coefficient function on a set of step functions. This step function is derived from the true parameter and the design
Feydy, Jean. "Analyse de données géométriques, au delà des convolutions". Thesis, université Paris-Saclay, 2020. http://www.theses.fr/2020UPASN017.
Texto completoGeometric data analysis, beyond convolutionsTo model interactions between points, a simple option is to rely on weighted sums known as convolutions. Over the last decade, this operation has become a building block for deep learning architectures with an impact on many applied fields. We should not forget, however, that the convolution product is far from being the be-all and end-all of computational mathematics.To let researchers explore new directions, we present robust, efficient and principled implementations of three underrated operations: 1. Generic manipulations of distance-like matrices, including kernel matrix-vector products and nearest-neighbor searches.2. Optimal transport, which generalizes sorting to spaces of dimension D > 1.3. Hamiltonian geodesic shooting, which replaces linear interpolation when no relevant algebraic structure can be defined on a metric space of features.Our PyTorch/NumPy routines fully support automatic differentiation and scale up to millions of samples in seconds. They generally outperform baseline GPU implementations with x10 to x1,000 speed-ups and keep linear instead of quadratic memory footprints. These new tools are packaged in the KeOps (kernel methods) and GeomLoss (optimal transport) libraries, with applications that range from machine learning to medical imaging. Documentation is available at: www.kernel-operations.io/keops and /geomloss
Thirion, Bertrand. "Analyse de données d'IRM fonctionnelle : statistiques, information et dynamique /". Paris : École nationale supérieure des télécommunications, 2004. http://catalogue.bnf.fr/ark:/12148/cb39181884r.
Texto completoBibliogr. p. 239-252. Introd. en français. Résumé en français et en anglais.
Saumard, Mathieu. "Contribution à l'analyse statistique des données fontionnelles". Thesis, Rennes, INSA, 2013. http://www.theses.fr/2013ISAR0009/document.
Texto completoIn this thesis, we are interested in the functional data. The problem of estimation in a model of estimating equations is studying. We derive a central limit type theorem for the considered estimator. The optimal instruments are estimated, and we obtain a uniform convergence of the estimators. We are then interested in various testing with functional data. We study the problem of nonparametric testing for the effect of a random functional covariate on an error term which could be directly observed as a response or estimated from a functional model like for instance the functional linear model. We proved, in order to construct the tests, a result of dimension reduction which relies on projections of the functional covariate. We have constructed no-effect tests by using a kernel smoothing or a nearest neighbor smoothing. A goodness-of-fit test in the functional linear model is also proposed. All these tests are studied from a theoretical and practical perspective
Champely, Stéphane. "Analyse de données fonctionnelles : approximation par les splines de régression". Lyon 1, 1994. http://www.theses.fr/1994LYO10242.
Texto completoKarkar, Slim. "Parcellisation et analyse multi-niveaux de données IRM fonctionnelles. Application à l'étude des réseaux de connectivité cérébrale". Phd thesis, Université de Strasbourg, 2011. http://tel.archives-ouvertes.fr/tel-00652609.
Texto completoMartin, Hugo. "Étude de données et analyse de modèles intégro-différentiels en biologie cellulaire". Electronic Thesis or Diss., Sorbonne université, 2019. http://www.theses.fr/2019SORUS668.
Texto completoIn this dissertation, we are interested in the study of some dynamics in molecular biology, making us of mathematical analysis of established models, modelling and data analysis. The first two chapters focus on growth-fragmentation equations with linear growth rate. We are first interested in the recent so-called incremental model, describing a bacterial population. We prove the existence and uniqueness of the solution of the eigenproblem in a weighted Lebesgue space. Then we study the asymptotic behaviour of measures solutions of the growth-fragmentation equation in the equal mitosis case. A solution is then expressed as a semigroup acting on an initial condition. We extend to this framework a known phenomenon of long time oscillating dynamics, which results here in a weak convergence of the solution towards a periodic family of measures. The third chapter deals with the joint dynamics between mesenchymal, pre-adipocyte and adipocyte cells. We propose a non-linear model in which the growth rate depends on the average size of the latter and analyze it using both analytical and numerical approaches. In the last chapter, we carry out a statistical analysis of experimental data from individual yeast lines. In particular, we highlight the existence of distinct phenomena between early arrests and replicative senescence. Finally, we propose a refinement of an existing model, now able to describe the generation of onset of senescence for all the lineages
Conan-Guez, Brieuc. "Modélisation supervisée de données fonctionnelles par perceptron multi-couches". Phd thesis, Université Paris Dauphine - Paris IX, 2002. http://tel.archives-ouvertes.fr/tel-00178892.
Texto completoRaguideau, Sébastien. "Analyse de données de métagénomique fonctionnelle par NMF pour la modélisation de la dégradation des fibres par le microbiote intestinal humain". Thesis, Université Paris-Saclay (ComUE), 2016. http://www.theses.fr/2016SACLA027/document.
Texto completoThe purpose of this work of thesis is to model the capacity of degradation of non-digestible polysaccharides by the human intestinal microbiote. To this end we exploit metagenomic data. We use abundances of nucleotide sequences in 1408 samples whose metabolic function are assigned by annotation against a database. The sequences are annotated with functional markers. Upon manual selection of 86 functional markers relevant to the activity of metabolisation of polysaccharides, we their abundances variation among the metagenomic samples are studied.We propose an ecological approach in modeling the human intestinal microbiote. We consider the intense functional selection of this ecosystem and assume that identical cluster of metabolic functions can be found in different proportions in every human gut microbiota. We propose the term of functional assembly as to account for spacial and temporal co-occurence of functional cluster. In practice, theses assemblies are determined by their composition and can be interpreted as combinations of functional traits aggregated at the levels of the cluster of microorganisms composing each assembly. Functional assemblies are inferred by the means of Non-Negative Matrix Factorization (NMF). This method allows to determine the composition of functional assemblies and their abundance in each of the 1408 metagenomic sample.Furthermore, we exploit metabolic information from bibliographic resources and 190 microbial genomes in order to specify the composition of these functional assemblies. This information is translated in the form of a constraint.We find 4 assemblies by considering a consensus between various criteria. The use of metabolic information allow to interpret theses assemblies biologically. By exploiting the metadata of the 1408 samples, we observe a different behaviour for the samples coming from individuals suffering from Crohn disease. We validate this observation on external data.We proposed a reductionistic approach allowing to represent an important metabolic process at the level of the microbiota. We find a small number of 4 functional assemblies which are biologically likely and approach well the 1408 metagenomic samples
Chaouch, Mohamed. "Contribution à l'estimation non paramétrique des quantiles géométriques et à l'analyse des données fonctionnelles". Phd thesis, Université de Bourgogne, 2008. http://tel.archives-ouvertes.fr/tel-00364538.
Texto completoAndrieu, Cindie. "Modélisation fonctionnelle de profils de vitesse en lien avec l'infrastructure et méthodologie de construction d'un profil agrégé". Phd thesis, Toulouse 3, 2013. http://thesesups.ups-tlse.fr/2057/.
Texto completoThe knowledge of the actual vehicle speeds is an essential characteristic of drivers behavior and their road usage. This information become available with the generalization of connected vehicles, but also smartphones, which increase the number of "tracers" likely to refer their position and speed in real time. In this thesis, we propose to use these digital traces and to develop a methodology, based on a functional approach, to produce several reference speed profiles. In a first part, we propose a functional modeling of space-speed profiles (i. E. Speed vs position) and we study their properties (continuity, differentiability). In a second part, we propose a methodology to construct an estimator of a space speed profile from noisy measurements of position and speed, based on smoothing splines and the theory of reproducing kernel Hilbert spaces (RKHS). The third part is devoted to the construction of several aggregated profiles (average, median). In particular, we propose a landmark-based registration of profiles at stops, and we propose the construction of speed corridors reflecting the dispersion of actual speeds
Malkassian, Anthony. "Méthodes d’analyse fonctionnelle et multivariée appliquées à l’étude du fonctionnement écologique des assemblages phytoplanctoniques de l’étang de Berre". Thesis, Aix-Marseille, 2012. http://www.theses.fr/2012AIXM4108.
Texto completoThe study of the relationship between variations in phytoplankton abundance and environmental forces (natural or anthropogenic) in shallow brackish areas is essential to both understanding and managing this complex ecosystem. Over a 16 year (1994-2011) monthly monitoring program the relationships between physicochemical variables (temperature, salinity and nutrients) and phytoplankton assemblages of the Berre Lagoon were analyzed. Using data collected from this long-term study, we have addressed environmental management issues through the application of advanced statistical analyses and original data displays. These analyses and data displays can readily be applied to other data sets related to the environment, with the aim of informing both researcher and practitioner. Since 2004, a new policy for freshwater discharge has induced strong changes in the global salinity of the lagoon : a weakened stratification and a rarefaction of anoxia phenomena in its deepest part. A shift in the structure of the phytoplankton community has been observed in association with changes in environmental conditions. An increase of phytoplanktonic species richness, and more precisely, the emergence of species with marine affinity highlights the first step of a marinization of the lagoon. The results underline the significant impact of a new management policy in this specific coastal zone. We then focused on the response of phytoplankton to quick environmental variations. An original approach for automated high frequency analysis of phytoplankton was adopted with the use of an autonomous flow cytometer (CytoSense)
Bonizzi, Pietro. "Atrial activity extraction and analysis in atrial fibrillation episodes". Nice, 2010. http://www.theses.fr/2010NICE4027.
Texto completoLa fibrillation auriculaire (FA) est l’arythmie la plus fréquente dans le domaine clinique. Malgré son importance et fréquence (10% des gens plus âgés de 70), les mécanismes de génération de la FA sont encore plutôt inconnus. Différents stratégies pour traiter la FA sont sélectionnées par rapport à la durée des épisodes de FA, et leur efficacité dépend aussi du degré d’organisation de l’activité auriculaire (AA). Le degré d’organisation de l’AA dépend à son tour du niveau de chronicité de la FA, et du conséquent remodelage électro-structurel qui concerne le substrat du myocarde, et qui affecte le fonctionnement du noeud auriculo-ventriculaire en particulier. Par conséquent, des outils de traitement du signal appropriés s’avèrent nécessaires pour éclaircir les origines électrophysiologiques de la FA et son influence sur le système cardiaque. En particulier, l’intérêt du traitement du signal repose sur l’extraction de plus d’informations possibles des enregistrements non invasifs, en accord avec la tendance générale dans le domaine clinique, pour réduire les risques chez le patient et pour réduire le temps et le coût des analyses cliniques. Dans ce sens, une certaine connaissance du degré d’organisation de l’AA peut-être potentiellement important pour aider la décision clinique. Ceci pourrait guider la sélection du meilleur traitement de la FA pour chaque patient. Les méthodes classiques pour l’extraction d’un signal de AA des enregistrements d’électrocardiogramme (ECG) et pour l’estimation non invasive du degré de l’organisation des activations auriculaires pendant FA n’exploitent pas complètement la diversité spatiale offerte par des enregistrements ECG à plusieurs dérivations. En général, ils se concentrent sur l’analyse du contenu spectral de la FA dans une seule dérivation, avec le risque de sous-estimer la complexité réelle des activations auriculaires en interne. Dans ces travaux de thèse, nous exploitons la diversité spatiale offerte par des enregistrements ECG à plusieurs dérivations pour accomplir deux objectifs principaux. Premièrement, nous voulons améliorer la qualité de l’extraction du signal de AA des enregistrements ECG, nécessaire pour des ultérieures analyses détaillées de la FA. Pour ce faire, nous exploitons l’information spatiale de l’ECG pour généré des sous-espaces appropriés qui représentent chacune des activités cardiaques d’intérêt, la ventriculaire et l’auriculaire, respectivement, en déterminant les segments correspondants dans l’ECG. Ces sous-espaces sont exploités comme information a priori et insérés en forme de contraintes supplémentaires dans l’algorithme de extraction aveugle des sources. Différentes possibilités d’exploiter ces sous-espaces comme information a priori sont présentées, mettant en évidence leur polyvalence dans leur capacité de se concentrer de façon satisfaisante sur différentes caractéristiques des différentes activités cardiaques et de leur relations. Deuxièmement, nous voulons quantifier d’une manière non invasive le degré 1 de l’organisation spatio-temporel des activations auriculaires pendant FA à partir de l’étude des enregistrements ECG à plusieurs dérivations. Ceci est accompli en regardant la complexité spatiale de l’enregistrement de l’AA des enregistrements ECG correctement segmenté, et la stationnarité de l’AA mesurés exploitant une estimation de son sous-espace. Les résultats de notre étude confirment l’intérêt d’exploiter l’information spatiale dans l’ECG pour générer différentes sous-espaces qui décrivent de façon appropriée les composants ventriculaire et auriculaire de l’ECG. A leur tour, ces composants se révèlent utiles et pour définir des contraintes supplémentaires dans l’algorithme de extraction aveugle des sources pour l’extraction de l’AA et pour analyser directement l’organisation de la FA par des enregistrements de surface, soutenant la justesse des approches de traitement du signal qui exploitent la diversité spatiale dans l’analyse de la FA. Des premières applications de ces techniques pour l’étude des effets de l’ablation par cathéter sur la réorganisation de la FA à partir de l’analyse des enregistrements ECG standards à12 dérivations montrent leur importance clinique potentielle pour la sélection des sujets qui pourraient bénéficier de la thérapie d’ablation, et aussi indiquent la possibilité de les utiliser de manière plus généralisée dans des application cliniques à venir
Andrieu, Cindie. "Modélisation fonctionnelle de profils de vitesse en lien avec l'infrastructure et méthodologie de construction d'un profil agrégé". Phd thesis, Université Paul Sabatier - Toulouse III, 2013. http://tel.archives-ouvertes.fr/tel-00915420.
Texto completoDe, Vitis Alba Chiara. "Méthodes du noyau pour l’analyse des données de grande dimension". Thesis, Université Côte d'Azur (ComUE), 2019. http://www.theses.fr/2019AZUR4034.
Texto completoSince data are being collected using an increasing number of features, datasets are of increasingly high dimension. Computational problems, related to the apparent dimension, i.e. the dimension of the vectors used to collect data, and theoretical problems, which depends notably on the effective dimension of the dataset, the so called intrinsic dimension, have affected high dimensional data analysis. In order to provide a suitable approach to data analysis in high dimensions, we introduce a more comprehensive scenario in the framework of metric measure spaces. The aim of this thesis, is to show how to take advantage of high dimensionality phenomena in the pure high dimensional regime. In particular, we aim at introducing a new point of view in the use of distances and probability measures defined on the data set. More specifically, we want to show that kernel methods, already used in the intrinsic low dimensional scenario in order to reduce dimensionality, can be investigated under purely high dimensional hypotheses, and further applied to cases not covered by the literature
Kherif, Ferath. "Applications de modèles statistiques multivariés à la résolution de problèmes posés par les données d' imagerie fonctionnelle cérébrale". Paris 6, 2003. http://www.theses.fr/2003PA066598.
Texto completoZullo, Anthony. "Analyse de données fonctionnelles en télédétection hyperspectrale : application à l'étude des paysages agri-forestiers". Thesis, Toulouse 3, 2016. http://www.theses.fr/2016TOU30135/document.
Texto completoIn hyperspectral imaging, each pixel is associated with a spectrum derived from observed reflectance in d measurement points (i.e., wavelengths). We are often facing a situation where the sample size n is relatively low compared to the number d of variables. This phenomenon called "curse of dimensionality" is well known in multivariate statistics. The mored increases with respect to n, the more standard statistical methodologies performances are degraded. Reflectance spectra incorporate in their spectral dimension a continuum that gives them a functional nature. A hyperspectrum can be modelised by an univariate function of wavelength and his representation produces a curve. The use of functional methods allows to take into account functional aspects such as continuity, spectral bands order, and to overcome strong correlations coming from the discretization grid fineness. The main aim of this thesis is to assess the relevance of the functional approach in the field of hyperspectral remote sensing for statistical analysis. We focused on the nonparametric fonctional regression model, including supervised classification. Firstly, the functional approach has been compared with multivariate methods usually involved in remote sensing. The functional approach outperforms multivariate methods in critical situations where one has a small training sample size combined with relatively homogeneous classes (that is to say, hard to discriminate). Secondly, an alternative to the functional approach to overcome the curse of dimensionality has been proposed using parsimonious models. This latter allows, through the selection of few measurement points, to reduce problem dimensionality while increasing results interpretability. Finally, we were interested in the almost systematic situation where one has contaminated functional data. We proved that for a fixed sample size, the finer the discretization, the better the prediction. In other words, the larger dis compared to n, the more effective the functional statistical methodis
Gharbi, Zied. "Contribution à l’économétrie spatiale et l’analyse de données fonctionnelles". Thesis, Lille 1, 2019. http://www.theses.fr/2019LIL1A012/document.
Texto completoThis thesis covers two important fields of research in inferential statistics, namely spatial econometrics and functional data analysis. More precisely, we have focused on the analysis of real spatial or spatio-functional data by extending certain inferential methods to take into account a possible spatial dependence. We first considered the estimation of a spatial autoregressive model (SAR) with a functional dependent variable and a real response variable using observations on a given geographical unit. This is a regression model with the specificity that each observation of the independent variable collected in a geographical location depends on observations of the same variable in neighboring locations. This relationship between neighbors is generally measured by a square matrix called the spatial weighting matrix, which measures the interaction effect between neighboring spatial units. This matrix is assumed to be exogenous, i.e. the metric used to construct it does not depend on the explanatory variable. The contribution of this thesis to this model lies in the fact that the explanatory variable is of a functional nature, with values in a space of infinite dimension. Our estimation methodology is based on a dimension reduction of the functional explanatory variable through functional principal component analysis followed by maximization of the truncated likelihood of the model. Asymptotic properties of the estimators, illustrations of the performance of the estimators via a Monte Carlo study and an application to real environmental data were considered. In the second contribution, we use the functional SAR model studied in the first part by considering an endogenous structure of the spatial weighting matrix. Instead of using a geographical criterion to calculate the dependencies between neighboring locations, we calculate them via an endogenous process, i.e. one that depends on explanatory variables. We apply the same two-step estimation approach described above and study the performance of the proposed estimator for finite or infinite-tending samples. In the third part of this thesis we focus on heteroskedasticity in partially linear models for real exogenous variables and binary response variable. We propose a spatial Probit model containing a non-parametric part. Spatial dependence is introduced at the level of errors (perturbations) of the model considered. The estimation of the parametric and non-parametric parts of the model is recursive and consists of first setting the parametric parameters and estimating the non-parametric part using the weighted likelihood method and then using the latter estimate to construct a likelihood profile to estimate the parametric part. The performance of the proposed method is investigated via a Monte-Carlo study. An empirical study on the relationship between economic growth and environmental quality in Sweden using some spatial econometric tools finishes the document
Schmutz, Amandine. "Contributions à l'analyse de données fonctionnelles multivariées, application à l'étude de la locomotion du cheval de sport". Thesis, Lyon, 2019. http://www.theses.fr/2019LYSE1241.
Texto completoWith the growth of smart devices market to provide athletes and trainers a systematic, objective and reliable follow-up, more and more parameters are monitored for a same individual. An alternative to laboratory evaluation methods is the use of inertial sensors which allow following the performance without hindering it, without space limits and without tedious initialization procedures. Data collected by those sensors can be classified as multivariate functional data: some quantitative entities evolving along time and collected simultaneously for a same individual. The aim of this thesis is to find parameters for analysing the athlete horse locomotion thanks to a sensor put in the saddle. This connected device (inertial sensor, IMU) for equestrian sports allows the collection of acceleration and angular velocity along time in the three space directions and with a sampling frequency of 100 Hz. The database used for model development is made of 3221 canter strides from 58 ridden jumping horses of different age and level of competition. Two different protocols are used to collect data: one for straight path and one for curved path. We restricted our work to the prediction of three parameters: the speed per stride, the stride length and the jump quality. To meet the first to objectives, we developed a multivariate functional clustering method that allow the division of the database into smaller more homogeneous sub-groups from the collected signals point of view. This method allows the characterization of each group by it average profile, which ease the data understanding and interpretation. But surprisingly, this clustering model did not improve the results of speed prediction, Support Vector Machine (SVM) is the model with the lowest percentage of error above 0.6 m/s. The same applied for the stride length where an accuracy of 20 cm is reached thanks to SVM model. Those results can be explained by the fact that our database is build from 58 horses only, which is a quite low number of individuals for a clustering method. Then we extend this method to the co-clustering of multivariate functional data in order to ease the datamining of horses’ follow-up databases. This method might allow the detection and prevention of locomotor disturbances, main source of interruption of jumping horses. Lastly, we looked for correlation between jumping quality and signals collected by the IMU. First results show that signals collected by the saddle alone are not sufficient to differentiate finely the jumping quality. Additional information will be needed, for example using complementary sensors or by expanding the database to have a more diverse range of horses and jump profiles
Gautheron, Léo. "Construction de Représentation de Données Adaptées dans le Cadre de Peu d'Exemples Étiquetés". Thesis, Lyon, 2020. http://www.theses.fr/2020LYSES044.
Texto completoMachine learning consists in the study and design of algorithms that build models able to handle non trivial tasks as well as or better than humans and hopefully at a lesser cost.These models are typically trained from a dataset where each example describes an instance of the same task and is represented by a set of characteristics and an expected outcome or label which we usually want to predict.An element required for the success of any machine learning algorithm is related to the quality of the set of characteristics describing the data, also referred as data representation or features.In supervised learning, the more the features describing the examples are correlated with the label, the more effective the model will be.There exist three main families of features: the ``observable'', the ``handcrafted'' and the ``latent'' features that are usually automatically learned from the training data.The contributions of this thesis fall into the scope of this last category. More precisely, we are interested in the specific setting of learning a discriminative representation when the number of data of interest is limited.A lack of data of interest can be found in different scenarios.First, we tackle the problem of imbalanced learning with a class of interest composed of a few examples by learning a metric that induces a new representation space where the learned models do not favor the majority examples.Second, we propose to handle a scenario with few available examples by learning at the same time a relevant data representation and a model that generalizes well through boosting models using kernels as base learners approximated by random Fourier features.Finally, to address the domain adaptation scenario where the target set contains no label while the source examples are acquired in different conditions, we propose to reduce the discrepancy between the two domains by keeping only the most similar features optimizing the solution of an optimal transport problem between the two domains
Hedli-Griche, Sonia. "Estimation de l'opérateur de régression pour des données fonctionnelles et des erreurs corrélées". Université Pierre Mendès France (Grenoble), 2008. http://www.theses.fr/2008GRE21009.
Texto completoIn the research work that we present in this thesis, we study the problem of nonparametric modelization when the statistical data are represented by curves. More precisely, we are interested in the problems of prediction from an explanatory random variable that takes values in some, eventually, infinite dimensional space. Recently, some work has been realised in the functional operatoriel estimation under the independence assumptions of the functional data. In this thesis, we consider that the functional data are dependent and that the error process is stationary (with short or long memory). We have studied and estimated the regression operator under different set-ups: when the functional data (dependent) are deterministic or random, when the error process is a short or long memory, the asymptotic normality when the error process is negatively associated, the local/global choice of the bandwidth, the study of the relevancy of our theoretical results to simulated data and then to real data
Cardot, Hervé. "Contribution à l'estimation et à la prévision statistique de données fonctionnelles". Toulouse 3, 1997. http://www.theses.fr/1997TOU30162.
Texto completoBayle, Severine. "Modélisation statistique de données fonctionnelles environnementales : application à l'analyse de profils océanographiques". Thesis, Aix-Marseille, 2014. http://www.theses.fr/2014AIXM4016.
Texto completoTo study biogeochemical processes in the Southern Ocean, tags placed on elephant seals allowed to collect during 2009-2010 oceanographic variables profiles (Chlorophyll a (Chl a), temperature, salinity, light) in an area ranging from southern Kerguelen until the Antarctic continent. This thesis focuses on Chl a data as it is contained in photosynthetic organisms and these ones play an essential role in the oceanic carbon cycle. The infrequently collected vertical Chl a profiles don't provide a mapping of this variable in this area of the ocean. However, we have light profiles sampled more often. The aim of this thesis was then to develop a methodology for reconstructing indirectly Chl a profiles from light profiles, and that takes into account characteristics of this kind of data that naturally occur as functional data. For this, we adressed the profiles decomposition to rebuild or explanations on splines basis, as well as issues related adjustment. A functional linear model was used to predict Chl a profiles from light profiles derivatives. It was shown that the use of such a model provides a good quality of reconstruction to access high frequency variations of Chl a profiles at fine scale. Finally, a functional kriging interpolation predicted the Chl a concentration during night, as light measurements acquired at that time can't be exploited. In the future, the methodology aims to be applied to any type of functional data
Sauder, Cécile. "Méthodes d’analyse des données fonctionnelles appliquées aux dynamiques de croissance et de lactation chez les bovins laitiers". Rennes, Agrocampus Ouest, 2014. http://www.theses.fr/2014NSARB245.
Texto completoDos, Santos Raimundo N. Macedo. "Rationalisation de l'usage de la Classification Internationale des Brevets par l'analyse fonctionnelle pour répondre à la demande de l'information industrielle". Aix-Marseille 3, 1995. http://www.theses.fr/1995AIX30037.
Texto completoSaouessi, Melek. "Modélisation de la dynamique fonctionnelle de l'Acétylcholinestérase humainevue par diffusion quasi-élastique de neutrons". Thesis, Orléans, 2020. http://www.theses.fr/2020ORLE3065.
Texto completoIn the present work, quasi-elastic neutron scattering spectra (QENS) from human Acetylcholinesterase are analyzed to study changes in the internal dynamics of this enzyme upon the non-covalent binding of the ligand HuperZine A (HupA). The challenge is to see if the enzymatic activity is reflected in the short time relaxation dynamics extending over time scales of some ten picoseconds. Global motions of whole molecules can here be neglected since the experiments have been performed on hydrated powder samples. In order to account for the the self-similar character of protein dynamics, a multi-scale model has been used for the scattering functions, which fits simultaneously the elastic and quasi-elastic components of the the QENS spectrum. In contrast to a previous analysis of the experimental data, the present study reveals subtle but systematic changes of the internal dynamics of the enzyme in presence of the inhibitor. In a first analysis, which is performed in the time domain, the intermediate scattering functions are obtained by deconvolution of the experimental spectra from the instrumental resolution. The corresponding relaxation functions are here modeled by the ``stretched'' Mittag-Leffler function whose choice is justified, among others, by its asymptotic power law decay. In order to consolidate the results, a second analysis has been performed directly on the experimental spectra measured in the frequency domain, by using a semi-analytical approach for the convolution of the model spectrum with the instrumental resolution function. The results are consistent with those obtained by the preceding analysis in the time domain. They indicate in particular an increase of the motional amplitudes of the hydrogen atoms and a slowing-down of the internal dynamics of the enzyme. From a physical point of view, these findings are interpreted by employing the the concept of ``energy landscapes'' for the motions of the hydrogen atoms
Vrac, Mathieu. "Analyse et modélisation de données probabilistes par décomposition de mélange de copules et application à une base de données climatologiques". Phd thesis, Université Paris Dauphine - Paris IX, 2002. http://tel.archives-ouvertes.fr/tel-00002386.
Texto completoKandé, Yoba. "Spatial environmental analyses using functional approaches : application to multifrequential fisheries acoustics data". Electronic Thesis or Diss., Université de Lille (2022-....), 2023. http://www.theses.fr/2023ULILB047.
Texto completoThis thesis falls within the framework of functional statistics applied to the environment.Functional data analysis is a field of statistics that studies data in functional forms. It provides techniques for dimension reduction, supervised and unsupervised learning, while considering temporal and/or spatial dependencies in functional data. Such data types are increasingly available in various fields, particularly in environmental sciences, thanks to modern technologies. One example is the use of fisheries acoustics, which allows for obtaining spatial and temporal samples of marine organisms at various depths and spatial scales, without intrusiveness.In this thesis, we analyzed a set of multifrequency acoustic data collected by scientific echosounders to study the spatial structure of marine organism aggregations, commonly known as "Sound Scattering Layers." We examined the characteristics of these complex biological entities, such as thickness, relative density, and depth, in relation to their environment, represented at a fine scale using a towed multiparametric system. To do so, we initially applied standard multivariate statistical methods and then incorporated functional data analysis techniques, with or without the spatial dimension.In our initial exploratory analysis, Multivariate Functional Principal Component Analysis provided precise information about parameter variation along depths, unlike traditional Principal Component Analysis. In regression tasks, our analyses, whether incorporating spatial dimension or not, revealed interactions between "Sound Scattering Layers" descriptors and key environmental variables on a spatial scale. We noted significant differences between the "Sound Scattering Layers" in the northern and southern regions, as well as between those in coastal and offshore zones. It is worth noting that considering the spatial dimension improved modeling quality. These results highlight spatial-functional statistical analysis as a key method in ecological studies involving spatially complex objects.Beyond our specific case study, the application of functional data analysis offers promising prospects for a wide range of ecological studies involving massive spatial data
Henchiri, Yousri. "L'approche Support Vector Machines (SVM) pour le traitement des données fonctionnelles". Thesis, Montpellier 2, 2013. http://www.theses.fr/2013MON20187/document.
Texto completoFunctional Data Analysis is an important and dynamic area of statistics. It offers effective new tools and proposes new methodological and theoretical developments in the presence of functional type data (functions, curves, surfaces, ...). The work outlined in this dissertation provides a new contribution to the themes of statistical learning and quantile regression when data can be considered as functions. Special attention is devoted to use the Support Vector Machines (SVM) technique, which involves the notion of a Reproducing Kernel Hilbert Space. In this context, the main goal is to extend this nonparametric estimation technique to conditional models that take into account functional data. We investigated the theoretical aspects and practical attitude of the proposed and adapted technique to the following regression models.The first model is the conditional quantile functional model when the covariate takes its values in a bounded subspace of the functional space of infinite dimension, the response variable takes its values in a compact of the real line, and the observations are i.i.d.. The second model is the functional additive quantile regression model where the response variable depends on a vector of functional covariates. The last model is the conditional quantile functional model in the dependent functional data case. We obtained the weak consistency and a convergence rate of these estimators. Simulation studies are performed to evaluate the performance of the inference procedures. Applications to chemometrics, environmental and climatic data analysis are considered. The good behavior of the SVM estimator is thus highlighted
Ávila-Funes, José Alberto. "Relations entre le risque nutritionnel, les symptômes dépressifs et la capacité fonctionnelle chez la personne âgée de la communauté une analyse secondaire des données de l'étude NuAge". Mémoire, Université de Sherbrooke, 2007. http://savoirs.usherbrooke.ca/handle/11143/3912.
Texto completoMorvan, Marie. "Modèles de régression pour données fonctionnelles hétérogènes : application à la modélisation de données de spectrométrie dans le moyen infrarouge". Thesis, Rennes 1, 2019. http://www.theses.fr/2019REN1S097.
Texto completoIn many application fields, data corresponds to curves. This work focuses on the analysis of spectrometric curves, composed of hundreds of ordered variables that corresponds to the absorbance values measured for each wavenumber. In this context, an automatic statistical procedure is developped, that aims at building a prediction model taking into account the heterogeneity of the observed data. More precisely, a diagnosis tool is built in order to predict a metabolic disease from spectrometric curves measured on a population composed of patients with differents profile. The procedure allows to select portions of curves relevant for the prediction and to build a partition of the data and a sparse predictive model simultaneously, using a mixture of penalized regressions suitable for functional data. In order to study the complexity of the data and of the application case, a method to better understand and display the interactions between variables is built. This method is based on the study of the covariance matrix structure, and aims to highlight the dependencies between blocks of variables. A medical example is used to present the method and results, and allows the use of specific visualization tools
Sidibe, Ibrahima dit Bouran. "Analyse non-paramétrique des politiques de maintenance basée sur des données des durées de vie hétérogènes". Thesis, Université de Lorraine, 2014. http://www.theses.fr/2014LORR0081/document.
Texto completoIn the reliability literature, several researches works have been developed to deal with modeling, analysis and implementation of maintenance policies for equipments subject to random failures. The majority of these works are based on common assumptions among which the distribution function of the equipment lifetimes is assumed to be known. Furthermore, the equipment is assumed to experience only one operating environment. Such assumptions are indeed restrictive and may introduce a bias in the statistical analysis of the distribution function of the equipment lifetimes which in turn impacts optimization of maintenance policies. In the present research work, these two particular assumptions are relaxed. This relaxation allows to take into account of information related to conditions where the equipment is being operating and to focus on the statistical analysis of maintenance policies without using an intermediate parametric lifetimes distribution. The objective of this thesis consists then on the development of efficient statistical models and tools for managing the maintenance of equipments whose lifetimes distribution is unknown and defined through the heterogeneous lifetimes data. Indeed, this thesis proposes a framework for maintenance strategies determination, from lifetimes data acquisition toward the computation of optimal maintenance policies. The maintenance policies considered are assumed to be performed on used equipments. These later are conduct to experience their missions within different environments each of which is characterized by a degree of severity. In this context, a first mathematical model is proposed to evaluate costs induced by maintenance strategies. The analysis of these costs helps to establish the necessary and sufficient conditions to ensure the existence of an optimal age to perform the preventive maintenance. The maintenance costs are fully estimated by using the Kernel method. This estimation method is non-parametric and defined by two parameters, namely the kernel function and the smoothing parameter. The variability of maintenance costs estimator is deeply analyzed according to the smoothing parameter of Kernel method. From these analyses, it is shown that Kernel estimator method ensures a weak propagation of the errors due to the computation of smoothing parameter. In addition, several simulations are made to estimate the optimal replacement age. These simulations figure out that the numerical results from the Kernel method are close to the theoretical values with a weak coefficient of variation. Two probabilistic extensions of the first mathematical model are proposed and theoretically discussed. To deal with the problem of delayed preventive maintenance, an approach is proposed and discussed. The proposed approach allows evaluating the risk that could induce the delay taken to perform a preventive maintenance at the required optimal date. This approach is based on risk analysis conduct on the basis of a proposed risk function
Schwartz, Cédric. "Contribution à l'élaboration d'un espace commun de représentation pour l'analyse morpho-fonctionnelle du membre supérieur : application à l'articulation glénohumérale". Brest, 2009. http://www.theses.fr/2009BRES2016.
Texto completoRouch-Leroyer, Isabelle. "Aspects neuropsychologiques et fonctionnels de la phase précoce de démence : analyse des données de la cohorte PAQUID". Bordeaux 2, 2001. http://www.theses.fr/2001BOR28825.
Texto completoThe aim of this thesis relying on the PAQUID study was to better understand the natural history of neuropsychological impairments and their consequences on complex functional activities on preclinical stage of dementia, this in order to develop a better screening strategy of subjects at high risk of developing dementia. The first section of our work leaded to better identify the cognitive processes impaired in the phase preceding dementia. The results of the Articles 1 and 2 suggest that the controlled processes are deteriorated early in the preclinical phase of dementia. In the second part of this work, we studied the 4 Instrumental Activities of Daily Living and their relation with neuropsychological tests. The results of the articles 3 and 4 allowed to better understand the cognitive processes related to each of these IADL, and to better describe the evolution of each IADL in the 5 years preceding the clinical phase of dementia. In the last section, we determined for clinical practice a strategy of detection of subjects at high risk of developing dementia (article 5). Besides, we also defined a short-form of a depression scale which could be used in general medical practice ; depression is indeed a frequent differential diagnosis of dementia, but can also be a early sign of dementia (article 6)
Stransky, Jan. "Analyse sémantique de structures de données dynamiques avec application au cas particulier de langages LISPiens". Paris 11, 1988. http://www.theses.fr/1988PA112187.
Texto completoBenabderrahmane, Sidahmed. "Prise en compte des connaissances du domaine dans l'analyse transcriptomique : Similarité sémantique, classification fonctionnelle et profils flous : application au cancer colorectal". Phd thesis, Université Henri Poincaré - Nancy I, 2011. http://tel.archives-ouvertes.fr/tel-00653169.
Texto completoCarcenac, Manuel. "Structures de données arborescentes et évaluation paresseuse : une nouvelle approche pour la résolution des équations aux dérivées partielles". Toulouse, ENSAE, 1994. http://www.theses.fr/1994ESAE0008.
Texto completoChrysanthos, Nicolas. "Kernel methods for flight data monitoring". Thesis, Troyes, 2014. http://www.theses.fr/2014TROY0030/document.
Texto completoFlight Data Monitoring (FDM), is the process by which an airline routinely collects, processes, and analyses the data recorded in aircrafts with the goal of improving the overall safety or operational efficiency.The goal of this thesis is to investigate machine learning methods, and in particular kernel methods, for the detection of atypical flights that may present problems that cannot be found using traditional methods.Atypical flights may present safety of operational issues and thus need to be studied by an FDM expert.In the first part we propose a novel method for anomaly detection that is suited to the constraints of the field of FDM.We rely on a novel dimensionality reduction technique called kernel entropy component analysis to design a method which is both unsupervised and robust.In the second part we solve the most salient issue regarding the field of FDM, which is how the data is structured.Firstly, we extend the method to take into account parameters of diverse types such as continuous, discrete or angular.Secondly, we explore techniques to take into account the temporal aspect of flights and propose a new kernel in the family of dynamic time warping techniques, and demonstrate that it is faster to compute than competing techniques and is positive definite.We illustrate our approach with promising results on real world datasets from airlines TAP and Transavia comprising hundreds of flights
Gregorutti, Baptiste. "Forêts aléatoires et sélection de variables : analyse des données des enregistreurs de vol pour la sécurité aérienne". Thesis, Paris 6, 2015. http://www.theses.fr/2015PA066045/document.
Texto completoNew recommendations require airlines to establish a safety management strategy to keep reducing the number of accidents. The flight data recorders have to be systematically analysed in order to identify, measure and monitor the risk evolution. The aim of this thesis is to propose methodological tools to answer the issue of flight data analysis. Our work revolves around two statistical topics: variable selection in supervised learning and functional data analysis. The random forests are used as they implement importance measures which can be embedded in selection procedures. First, we study the permutation importance measure when the variables are correlated. This criterion is extended for groups of variables and a new selection algorithm for functional variables is introduced. These methods are applied to the risks of long landing and hard landing which are two important questions for airlines. Finally, we present the integration of the proposed methods in the software FlightScanner implemented by Safety Line. This new solution in the air transport helps safety managers to monitor the risks and identify the contributed factors
Jarry, Gabriel. "Analyse et détection des trajectoires d'approches atypiques des aéronefs à l'aide de l'analyse de données fonctionnelles et de l'apprentissage automatique". Thesis, Toulouse 3, 2020. http://www.theses.fr/2020TOU30284.
Texto completoImproving aviation safety generally involves identifying, detecting and managing undesirable events that can lead to final events with fatalities. Previous studies conducted by the French National Supervisory Authority have led to the identification of non-compliant approaches presenting deviation from standard procedures as undesirable events. This thesis aims to explore functional data analysis and machine learning techniques in order to provide algorithms for the detection and analysis of atypical trajectories in approach from ground side. Four research directions are being investigated. The first axis aims to develop a post-op analysis algorithm based on functional data analysis techniques and unsupervised learning for the detection of atypical behaviours in approach. The model is confronted with the analysis of airline flight safety offices, and is applied in the particular context of the COVID-19 crisis to illustrate its potential use while the global ATM system is facing a standstill. The second axis of research addresses the generation and extraction of information from radar data using new techniques such as Machine Learning. These methodologies allow to \mbox{improve} the understanding and the analysis of trajectories, for example in the case of the estimation of on-board parameters from radar parameters. The third axis proposes novel data manipulation and generation techniques using the functional data analysis framework. Finally, the fourth axis focuses on extending the post-operational algorithm into real time with the use of optimal control techniques, giving directions to new situation awareness alerting systems
Dwivedi, Ankit. "Functional analysis of genomic variations associated with emerging artemisinin resistant P. falciparum parasite populations and human infecting piroplasmida B. microti". Thesis, Montpellier, 2016. http://www.theses.fr/2016MONTT073/document.
Texto completoThe undergoing WHO Malaria elimination program is threatened by the emergenceand potential spread of the Plasmodium falciparum artemisinin resistant parasite.Recent reports have shown (a) SNPs in region of chromosome 13 to be understrong recent positive selection in Cambodia, (b) presence of P. falciparum parasiteresistant and sensitive subpopulations in Cambodia, (c) the evidence that mutationsin the Kelch propeller domain of the k13 gene are major determinants ofartemisinin resistance in Cambodian parasite population and (d) parasite subpopulations in Northern Cambodia near Thailand and Laos with mefloquine drugresistance and carrying R539T allele of the k13 gene.Identifying the genetic basis of resistance is important to monitor and control thetransmission of resistant parasites and to understand parasite metabolism for the development of new drugs. This thesis focuses on analysis of P. falciparum population structure in Cambodia and description of metabolic properties of these subpopulations and gene flow among them. This could help in identifying the genetic evidence associated to transmission and acquisition of artemisinin resistance over the country.First, a barcode approach was used to identify parasite subpopulations using smallnumber of loci. A mid-throughput PCR-LDR-FMA approach based on LUMINEXtechnology was used to screen for SNPs in 537 blood samples (2010 - 2011) from 16health centres in Cambodia. Based on successful typing of 282 samples, subpopulations were characterized along the borders of the country. Gene flow was described based on the gradient of alleles at the 11 loci in the barcode. The barcode successfully identifies recently emerging parasite subpopulations associated to artemisinin and mefloquine resistance.In the second approach, the parasite population structure was defined based on167 parasite NGS genomes (2008 - 2011) originating from four locations in Cambodia,recovered from the ENA database. Based on calling of 21257 SNPs, eight parasite subpopulations were described. Presence of admixture parasite subpopulation couldbe supporting artemisinin resistance transmission. Functional analysis based on significant genes validated similar background for resistant isolates and revealed PI3K pathway in resistant populations supporting acquisition of resistance by assisting the parasite in ring stage form.Our findings question the origin and the persistence of the P. falciparum subpopulations in Cambodia, provide evidence of gene flow among subpopulations anddescribe a model of artemisinin resistance acquisition.The variant calling approach was also implemented on the Babesia microti genome.This is a malaria like syndrome, and is endemic in the North-Eastern USA. Theobjective was to validate the taxonomic position of B. microti as out-group amongpiroplasmida and improve the functional genome annotation based on genetic variation, gene expression and protein antigenicity. We identified new proteins involved in parasite host interactions
Denecker, Thomas. "Bioinformatique et analyse de données multiomiques : principes et applications chez les levures pathogènes Candida glabrata et Candida albicans Functional networks of co-expressed genes to explore iron homeostasis processes in the pathogenic yeast Candida glabrata Efficient, quick and easy-to-use DNA replication timing analysis with START-R suite FAIR_Bioinfo: a turnkey training course and protocol for reproducible computational biology Label-free quantitative proteomics in Candida yeast species: technical and biological replicates to assess data reproducibility Rendre ses projets R plus accessibles grâce à Shiny Pixel: a content management platform for quantitative omics data Empowering the detection of ChIP-seq "basic peaks" (bPeaks) in small eukaryotic genomes with a web user-interactive interface A hypothesis-driven approach identifies CDK4 and CDK6 inhibitors as candidate drugs for treatments of adrenocortical carcinomas Characterization of the replication timing program of 6 human model cell lines". Thesis, université Paris-Saclay, 2020. http://www.theses.fr/2020UPASL010.
Texto completoBiological research is changing. First, studies are often based on quantitative experimental approaches. The analysis and the interpretation of the obtained results thus need computer science and statistics. Also, together with studies focused on isolated biological objects, high throughput experimental technologies allow to capture the functioning of biological systems (identification of components as well as the interactions between them). Very large amounts of data are also available in public databases, freely reusable to solve new open questions. Finally, the data in biological research are heterogeneous (digital data, texts, images, biological sequences, etc.) and stored on multiple supports (paper or digital). Thus, "data analysis" has gradually emerged as a key research issue, and in only ten years, the field of "Bioinformatics" has been significantly changed. Having a large amount of data to answer a biological question is often not the main challenge. The real challenge is the ability of researchers to convert the data into information and then into knowledge. In this context, several biological research projects were addressed in this thesis. The first concerns the study of iron homeostasis in the pathogenic yeast Candida glabrata. The second concerns the systematic investigation of post-translational modifications of proteins in the pathogenic yeast Candida albicans. In these two projects, omics data were used: transcriptomics and proteomics. Appropriate bioinformatics and analysis tools were developed, leading to the emergence of new research hypotheses. Particular and constant attention has also been paid to the question of data reproducibility and sharing of results with the scientific community
Brahimi, Lahcene. "Données de tests non fonctionnels de l'ombre à la lumière : une approche multidimensionnelle pour déployer une base de données". Thesis, Chasseneuil-du-Poitou, Ecole nationale supérieure de mécanique et d'aérotechnique, 2017. http://www.theses.fr/2017ESMA0009/document.
Texto completoChoosing appropriate database management systems (DBMS) and/or execution platforms for given database (DB) is complex and tends to be time- and effort-intensive since this choice has an important impact on the satisfaction of non-functional requirements (e.g., temporal performance or energy consumption). lndeed, a large number of tests have been performed for assessing the quality of developed DB. This assessment often involves metrics associated with non-functional requirement. That leads to a mine of tests covering all life-cycle phases of the DB's design. Tests and their environments are usually published in scientific articles or specific websites such as Transaction Processing Council (TPC). Therefore, this thesis bas taken a special interest to the capitalization and the reutilization of performed tests to reduce and mastery the complexity of the DBMS/platforms selection process. By analyzing the test accurately, we identify that tests concem: the data set, the execution platform, the addressed non-functional requirements, the used queries, etc. Thus, we propose an approach of conceptualization and persistence of all dimensions as well as the results of tests. Conseguently, this thesis leads to the following contributions. (1) The design model based on descriptive, prescriptive and ontological concepts to raise the different dimensions. (2) The development of a multidimensional repository to store the test environments and their results. (3) The development of a decision making methodology based on a recommender system for DBMS and platforms selection