Letteratura scientifica selezionata sul tema "Curation des données"
Cita una fonte nei formati APA, MLA, Chicago, Harvard e in molti altri stili
Consulta la lista di attuali articoli, libri, tesi, atti di convegni e altre fonti scientifiche attinenti al tema "Curation des données".
Accanto a ogni fonte nell'elenco di riferimenti c'è un pulsante "Aggiungi alla bibliografia". Premilo e genereremo automaticamente la citazione bibliografica dell'opera scelta nello stile citazionale di cui hai bisogno: APA, MLA, Harvard, Chicago, Vancouver ecc.
Puoi anche scaricare il testo completo della pubblicazione scientifica nel formato .pdf e leggere online l'abstract (il sommario) dell'opera se è presente nei metadati.
Articoli di riviste sul tema "Curation des données"
Le Provost, Aline. "La curation, un enjeu pour la gestion des données numériques". Y a-t-il un bibliothécaire dans la salle ?, n. 97 (1 aprile 2020): 20. http://dx.doi.org/10.35562/arabesques.1793.
Testo completoPlumejeaud-Perreau, Christine, Silvia Marzagalli, Pierre Niccolò Sofia e Robin de Mourat. "Curation en interdisciplinarité d’une base de données historique : de Navigocorpus à Portic, ou de la qualification de l’incertitude". Histoire & mesure XXXVIII, n. 2 (1 dicembre 2023): 39–72. http://dx.doi.org/10.4000/histoiremesure.19833.
Testo completoDesrosiers, Georges, Benoît Gaumer e Othmar Keel. "Contribution de l’École d’hygiène de l’Université de Montréal à un enseignement francophone de santé publique, 1946-1970". Revue d'histoire de l'Amérique française 47, n. 3 (26 agosto 2008): 323–47. http://dx.doi.org/10.7202/305244ar.
Testo completoSimon, M., J. Jouffroy, C. Lebihan, C. Gastaldi-Ménager, P. Tuppin e J. M. Sabaté. "Évaluation de la surveillance radiologique après traitement curatif du cancer colorectal non métastatique après chimiothérapie adjuvante à partir des données du Système national des données de santé". Revue d'Épidémiologie et de Santé Publique 68 (marzo 2020): S53. http://dx.doi.org/10.1016/j.respe.2020.01.121.
Testo completoWeerahandi, Ambereen, Shane Sinclair, Shelley Raffin-Bouchal, Linda Watson e Laurie Lemieux. "Myélome multiple et approche palliative des soins : étude théorique ancrée dans la pratique". Canadian Oncology Nursing Journal 34, n. 4 (novembre 2024): 550–61. http://dx.doi.org/10.5737/23688076344550.
Testo completoGONZÁLEZ VÁZQUEZ, B., J. M. CHOUBERT, E. PAUL e J. P. CANLER. "Comment éviter le colmatage irréversible des installations de biofiltration ?" Techniques Sciences Méthodes, n. 11 (20 novembre 2020): 71–86. http://dx.doi.org/10.36904/tsm/202011071.
Testo completoSimaga, Karamoko. "Facteurs déterminants de la faible utilisation des soins curatifs du centre de santé communautaire de Lassa en commune IV de Bamako en 2017". Mali Santé Publique 10, n. 1 (24 luglio 2020): 51–54. http://dx.doi.org/10.53318/msp.v10i1.1662.
Testo completoLaugier, C., G. Lang, V. Mary e É. Parent. "Modélisation d'une politique d'autocontrôle sur un réseau d'eau potable". Revue des sciences de l'eau 12, n. 1 (12 aprile 2005): 201–17. http://dx.doi.org/10.7202/705349ar.
Testo completoSow, O., NS Fetche, C. Vermare, B. Annabel, A. Anusca e L. Perrot. "C34: Résultat de la prise en charge des tumeurs stromales gastro-intestinales (GIST) : A propos de 6 cas au centre hospitalier de Vichy (France)". African Journal of Oncology 2, n. 1 Supplement (1 marzo 2022): S15. http://dx.doi.org/10.54266/ajo.2.1s.c34.myrhvjbqpy.
Testo completoBruneel, F., A. Raffetin, A. Roujansky, P. Corne, C. Tridon, J. F. Llitjos, B. Mourvillier, V. Laurent e S. Jauréguiberry. "Prise en charge du paludisme grave d’importation de l’adulte". Médecine Intensive Réanimation 27, n. 3 (maggio 2018): 228–38. http://dx.doi.org/10.3166/rea-2018-0051.
Testo completoTesi sul tema "Curation des données"
Cappuzzo, Riccardo. "Deep learning models for tabular data curation". Electronic Thesis or Diss., Sorbonne université, 2022. http://www.theses.fr/2022SORUS047.
Testo completoData retention is a pervasive and far-reaching topic, affecting everything from academia to industry. Current solutions rely on manual work by domain users, but they are not adequate. We are investigating how to apply deep learning to tabular data curation. We focus our work on developing unsupervised data curation systems and designing curation systems that intrinsically model categorical values in their raw form. We first implement EmbDI to generate embeddings for tabular data, and address the tasks of entity resolution and schema matching. We then turn to the data imputation problem using graphical neural networks in a multi-task learning framework called GRIMP
Scavo, Giuseppe. "Content curation and characterization in communities of a place". Thesis, Paris 6, 2016. http://www.theses.fr/2016PA066521/document.
Testo completoThe amount of information on the Internet today overwhelms most users. Discovering relevant information (e.g. news to read or videos to watch) is time-consuming and tedious and yet it is part of the daily job of at least 80% of the employees in North America. Several information filtering systems for the web can ease this task for users. Examples fall into families such as Social Networks, Social Rating Systems and Social Bookmarking Systems. All these systems require user engagement to work (e.g. submission or rating of content). They work well in an Internet-wide community but suffer in the case smaller communities. Indeed, in smaller communities, the users' input is more scarce. We focus on communities of a place that are communities that group people who live, work or study in the same area. Examples of communities of a place are: (i) the students of a campus, (ii) the people living in a neighborhood or (iii) researchers working in the same site. Anecdotally we know that only 0.3% of workers contribute daily to their corporate social network. This information shows that there is a lack of user engagement in communities of a place
Scavo, Giuseppe. "Content curation and characterization in communities of a place". Electronic Thesis or Diss., Paris 6, 2016. http://www.theses.fr/2016PA066521.
Testo completoThe amount of information on the Internet today overwhelms most users. Discovering relevant information (e.g. news to read or videos to watch) is time-consuming and tedious and yet it is part of the daily job of at least 80% of the employees in North America. Several information filtering systems for the web can ease this task for users. Examples fall into families such as Social Networks, Social Rating Systems and Social Bookmarking Systems. All these systems require user engagement to work (e.g. submission or rating of content). They work well in an Internet-wide community but suffer in the case smaller communities. Indeed, in smaller communities, the users' input is more scarce. We focus on communities of a place that are communities that group people who live, work or study in the same area. Examples of communities of a place are: (i) the students of a campus, (ii) the people living in a neighborhood or (iii) researchers working in the same site. Anecdotally we know that only 0.3% of workers contribute daily to their corporate social network. This information shows that there is a lack of user engagement in communities of a place
Kemp, Gavin. "CURARE : curating and managing big data collections on the cloud". Thesis, Lyon, 2018. http://www.theses.fr/2018LYSE1179/document.
Testo completoThe emergence of new platforms for decentralized data creation, such as sensor and mobile platforms and the increasing availability of open data on the Web, is adding to the increase in the number of data sources inside organizations and brings an unprecedented Big Data to be explored. The notion of data curation has emerged to refer to the maintenance of data collections and the preparation and integration of datasets, combining them to perform analytics. Curation tasks include extracting explicit and implicit meta-data; semantic metadata matching and enrichment to add quality to the data. Next generation data management engines should promote techniques with a new philosophy to cope with the deluge of data. They should aid the user in understanding the data collections’ content and provide guidance to explore data. A scientist can stepwise explore into data collections and stop when the content and quality reach a satisfaction point. Our work adopts this philosophy and the main contribution is a data collections’ curation approach and exploration environment named CURARE. CURARE is a service-based system for curating and exploring Big Data. CURARE implements a data collection model that we propose, used for representing their content in terms of structural and statistical meta-data organised under the concept of view. A view is a data structure that provides an aggregated perspective of the content of a data collection and its several associated releases. CURARE provides tools focused on computing and extracting views using data analytics methods and also functions for exploring (querying) meta-data. Exploiting Big Data requires a substantial number of decisions to be performed by data analysts to determine which is the best way to store, share and process data collections to get the maximum benefit and knowledge from them. Instead of manually exploring data collections, CURARE provides tools integrated in an environment for assisting data analysts determining which are the best collections that can be used for achieving an analytics objective. We implemented CURARE and explained how to deploy it on the cloud using data science services on top of which CURARE services are plugged. We have conducted experiments to measure the cost of computing views based on datasets of Grand Lyon and Twitter to provide insight about the interest of our data curation approach and environment
Oshurko, Ievgeniia. "Knowledge representation and curation in hierarchies of graphs". Thesis, Lyon, 2020. http://www.theses.fr/2020LYSEN024.
Testo completoThe task of automatically extracting insights or building computational models fromknowledge on complex systems greatly relies on the choice of appropriate representation.This work makes an effort towards building a framework suitable for representation offragmented knowledge on complex systems and its semi-automated curation---continuouscollation, integration, annotation and revision.We propose a knowledge representation system based on hierarchies of graphs relatedwith graph homomorphisms. Individual graphs situated in such hierarchies representdistinct fragments of knowledge and the homomorphisms allow relating these fragments.Their graphical structure can be used efficiently to express entities and their relations. Wefocus on the design of mathematical mechanisms, based on algebraic approaches to graphrewriting, for transformation of individual graphs in hierarchies that maintain consistentrelations between them. Such mechanisms provide a transparent audit trail, as well as aninfrastructure for maintaining multiple versions of knowledge.We describe how the developed theory can be used for building schema-aware graphdatabases that provide schema-data co-evolution capabilities. The proposed knowledgerepresentation framework is used to build the KAMI (Knowledge Aggregation and ModelInstantiation) framework for curation of cellular signalling knowledge. The frameworkallows for semi-automated aggregation of individual facts on protein-protein interactionsinto knowledge corpora, reuse of this knowledge for instantiation of signalling models indifferent cellular contexts and generation of executable rule-based models
Ahmadi, Naser. "A framework for the continuous curation of a knowledge base system". Electronic Thesis or Diss., Sorbonne université, 2021. http://www.theses.fr/2021SORUS320.
Testo completoEntity-centric knowledge graphs (KGs) are becoming increasingly popular for gathering information about entities. The schemas of KGs are semantically rich, with many different types and predicates to define the entities and their relationships. These KGs contain knowledge that requires understanding of the KG’s structure and patterns to be exploited. Their rich data structure can express entities with semantic types and relationships, oftentimes domain-specific, that must be made explicit and understood to get the most out of the data. Although different applications can benefit from such rich structure, this comes at a price. A significant challenge with KGs is the quality of their data. Without high-quality data, the applications cannot use the KG. However, as a result of the automatic creation and update of KGs, there are a lot of noisy and inconsistent data in them and, because of the large number of triples in a KG, manual validation is impossible. In this thesis, we present different tools that can be utilized in the process of continuous creation and curation of KGs. We first present an approach designed to create a KG in the accounting field by matching entities. We then introduce methods for the continuous curation of KGs. We present an algorithm for conditional rule mining and apply it on large graphs. Next, we describe RuleHub, an extensible corpus of rules for public KGs which provides functionalities for the archival and the retrieval of rules. We also report methods for using logical rules in two different applications: teaching soft rules to pre-trained language models (RuleBert) and explainable fact checking (ExpClaim)
Ogun, Sewade. "Generating diverse synthetic data for ASR training data augmentation". Electronic Thesis or Diss., Université de Lorraine, 2024. http://www.theses.fr/2024LORR0116.
Testo completoIn the last two decades, the error rate of automatic speech recognition (ASR) systems has drastically dropped, making them more useful in real-world applications. This improvement can be attributed to several factors including new architectures using deep learning techniques, new training algorithms, large and diverse training datasets, and data augmentation. In particular, the large-scale training datasets have been pivotal to learning robust speech representations for ASR. Their large size allows them to effectively cover the inherent diversity in speech, in terms of speaker voice, speaking rate, pitch, reverberation, and noise. However, the size and diversity of datasets typically found in high-resourced languages are not available in medium- and low-resourced languages and in domains with specialised vocabulary like the medical domain. Therefore, the popular method to increase dataset diversity is through data augmentation. With the recent increase in the naturalness and quality of synthetic data that can be generated by text-to-speech (TTS) and voice conversion (VC) systems, these systems have also become viable options for ASR data augmentation. However, several problems limit their application. First, TTS/VC systems require high-quality speech data for training. Hence, we develop a method of dataset curation from an ASR-designed corpus for training a TTS system. This method leverages the increasing accuracy of deep-learning-based, non-intrusive quality estimators to filter high-quality samples. We explore filtering the ASR dataset at different thresholds to balance the size of the dataset, number of speakers, and quality. With this method, we create a high-quality multi-speaker dataset which is comparable to LibriTTS in quality. Second, the data generation process needs to be controllable to generate diverse TTS/VC data with specific attributes. Previous TTS/VC systems either condition the system on the speaker embedding alone or use discriminative models to learn the speech variabilities. In our approach, we design an improved flow-based architecture that learns the distribution of different speech variables. We find that our modifications significantly increase the diversity and naturalness of the generated utterances over a GlowTTS baseline, while being controllable. Lastly, we evaluated the significance of generating diverse TTS and VC data for augmenting ASR training data. As opposed to naively generating the TTS/VC data, we independently examined different approaches such as sentence selection methods and increasing the diversity of speakers, phoneme duration, and pitch contours, in addition to systematically increasing the environmental conditions of the generated data. Our results show that TTS/VC augmentation holds promise in increasing ASR performance in low- and medium-data regimes. In conclusion, our experiments provide insight into the variabilities that are particularly important for ASR, and reveal a systematic approach to ASR data augmentation using synthetic data
Grégoire, Matthieu. "Optimisation de l'utilisation des céphalosporines en curatif et préventif d'infections bactériennes à partir de données PK/PD, de la pharmacocinétique de population, de simulations et d'une analyse du microbiote intestinal". Thesis, Nantes, 2018. http://www.theses.fr/2018NANT4077/document.
Testo completoCephalosporins, discovered in the middle of the 20th century, belong to the beta-lactam class and act by inhibiting the synthesis of bacterial peptidoglycan. Many factors can affect their effectiveness but also their adverse effects. This thesis work articulated in 3 parts was interested in the pharmacokinetics of these molecules but also their pharmacodynamics targets within the digestive microbiota. The first part dealt with the antibiotic prophylaxis of bariatric surgery with cefazolin. This population study has demonstrated the superiority of the French recommendations on American recommendations and to propose an innovative administration plan in continuous infusion combining practicality of use and high level of efficiency. The second part dealt with the use of high dose ceftriaxone in the treatment of meningeal infections. This population study concluded that it is useful to adapt the administration plan to the patient's renal function with once daily administration in case of renal insufficiency compared with twice in case of normorenal function. The last part of these work demonstrated in a mouse model that ceftriaxone selected more extensivespectrum beta-lactamase-producing enterobacteriaceae than cefotaxime and that the metagenomics profile selected by the two antibiotics explained this difference. All of this work fits into the current dynamics of personalization of antibiotic therapies for each patient and optimizes the use of cephalosporins
Libri sul tema "Curation des données"
Data Stewardship for Open Science: Implementing FAIR Principles. Taylor & Francis Group, 2018.
Cerca il testo completoMons, Barend. Data Stewardship for Open Science: Implementing FAIR Principles. Taylor & Francis Group, 2018.
Cerca il testo completoMons, Barend. Data Stewardship for Open Science: Implementing FAIR Principles. Taylor & Francis Group, 2018.
Cerca il testo completoMons, Barend. Data Stewardship for Open Science. Taylor & Francis Group, 2021.
Cerca il testo completoMons, Barend. Data Stewardship for Open Science: Implementing FAIR Principles. Taylor & Francis Group, 2018.
Cerca il testo completoMons, Barend. Data Stewardship for Open Science: Implementing FAIR Principles. Taylor & Francis Group, 2018.
Cerca il testo completoCapitoli di libri sul tema "Curation des données"
Sawchuk, Sandra, Louise Gillis e Lachlan MacLeod. "Soutenir la recherche reproductible avec la curation active de données". In La gestion des données de recherche dans le contexte canadien: un guide pour la pratique et l'apprentissage. Western University, Western Libraries, 2023. http://dx.doi.org/10.5206/blaz5966.
Testo completo