Dissertations / Theses on the topic 'Multitask learning'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 28 dissertations / theses for your research on the topic 'Multitask learning.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Patel, Vatsa Sanjay. "Masked Face Analysis via Multitask Deep Learning." University of Dayton / OhioLINK, 2021. http://rave.ohiolink.edu/etdc/view?acc_num=dayton1619637677725646.
Full textRomera, Paredes B. "Multitask and transfer learning for multi-aspect data." Thesis, University College London (University of London), 2014. http://discovery.ucl.ac.uk/1457869/.
Full textSettipalli, Venkata Sai Sukesh, and Naga Manendra Kumar Dasireddy. "Reducing Unintended bias in Text Classification using Multitask learning." Thesis, Blekinge Tekniska Högskola, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-21174.
Full textYu, Qingtian. "Deep Learning-Enabled Multitask System for Exercise Recognition and Counting." Thesis, Université d'Ottawa / University of Ottawa, 2021. http://hdl.handle.net/10393/42686.
Full textNina, Oliver A. Nina. "A Multitask Learning Encoder-N-Decoder Framework for Movie and Video Description." The Ohio State University, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=osu1531996548147165.
Full textLin, Yu-Kai, Hsinchun Chen, Randall A. Brown, Shu-Hsing Li, and Hung-Jen Yang. "HEALTHCARE PREDICTIVE ANALYTICS FOR RISK PROFILING IN CHRONIC CARE: A BAYESIAN MULTITASK LEARNING APPROACH." SOC INFORM MANAGE-MIS RES CENT, 2017. http://hdl.handle.net/10150/625248.
Full textVALSECCHI, CECILE. "Advancing the prediction of Nuclear Receptor modulators through machine learning methods." Doctoral thesis, Università degli Studi di Milano-Bicocca, 2022. http://hdl.handle.net/10281/356289.
Full textNuclear receptors are transcription factors involved in processes critical to human health and are a relevant target for toxicological risk assessment and the drug discovery process. Computational models can be a useful tool (i) to prioritize chemicals that can mimic natural hormones and thus be endocrine disruptors and (ii) to identify new possible lead for drug discovery. Therefore, the main goal of this project is to study potential interactions between chemicals and nuclear receptors, with the dual purpose of developing in silico tools to search for new modulators and to identify possible endocrine disrupting chemicals. After creating an exhaustive collection of nuclear receptor modulators, we applied machine learning methods to fill the data gap and prioritize modulators by building predictive models. In particular, modeling strategies included multi-tasking machine learning algorithms to investigate the complex relationships between chemicals and multiple nuclear receptors.
Zylich, Brian Matthew. "Training Noise-Robust Spoken Phrase Detectors with Scarce and Private Data: An Application to Classroom Observation Videos." Digital WPI, 2019. https://digitalcommons.wpi.edu/etd-theses/1289.
Full textBao, Guoqing. "End-to-End Machine Learning Models for Multimodal Medical Data Analysis." Thesis, The University of Sydney, 2022. https://hdl.handle.net/2123/28153.
Full textWidmer, Christian Verfasser], Klaus-Robert [Akademischer Betreuer] [Müller, Gunnar [Akademischer Betreuer] Rätsch, and Klaus [Akademischer Betreuer] Obermayer. "Regularization-based multitask learning with applications in computational biology / Christian Widmer. Gutachter: Klaus-Robert Müller ; Gunnar Rätsch ; Klaus Obermayer. Betreuer: Klaus-Robert Müller ; Gunnar Rätsch." Berlin : Technische Universität Berlin, 2014. http://d-nb.info/1068856017/34.
Full textJohnson, Travis Steele. "Integrative approaches to single cell RNA sequencing analysis." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1586960661272666.
Full textCarbonera, Luvizon Diogo. "Apprentissage automatique pour la reconnaissance d'action humaine et l'estimation de pose à partir de l'information 3D." Thesis, Cergy-Pontoise, 2019. http://www.theses.fr/2019CERG1015.
Full text3D human action recognition is a challenging task due to the complexity ofhuman movements and to the variety on poses and actions performed by distinctsubjects. Recent technologies based on depth sensors can provide 3D humanskeletons with low computational cost, which is an useful information foraction recognition. However, such low cost sensors are restricted tocontrolled environment and frequently output noisy data. Meanwhile,convolutional neural networks (CNN) have shown significant improvements onboth action recognition and 3D human pose estimation from RGB images. Despitebeing closely related problems, the two tasks are frequently handled separatedin the literature. In this work, we analyze the problem of 3D human actionrecognition in two scenarios: first, we explore spatial and temporalfeatures from human skeletons, which are aggregated by a shallow metriclearning approach. In the second scenario, we not only show that precise 3Dposes are beneficial to action recognition, but also that both tasks can beefficiently performed by a single deep neural network and stillachieves state-of-the-art results. Additionally, wedemonstrate that optimization from end-to-end using poses as an intermediateconstraint leads to significant higher accuracy on the action task thanseparated learning. Finally, we propose a new scalable architecture forreal-time 3D pose estimation and action recognition simultaneously, whichoffers a range of performance vs speed trade-off with a single multimodal andmultitask training procedure
Bellón, Molina Víctor. "Prédiction personalisée des effets secondaires indésirables de médicaments." Thesis, Paris Sciences et Lettres (ComUE), 2017. http://www.theses.fr/2017PSLEM023/document.
Full textAdverse drug reaction (ADR) is a serious concern that has important health and economical repercussions. Between 1.9%-2.3% of the hospitalized patients suffer from ADR, and the annual cost of ADR have been estimated to be of 400 million euros in Germany alone. Furthermore, ADRs can cause the withdrawal of a drug from the market, which can cause up to millions of dollars of losses to the pharmaceutical industry.Multiple studies suggest that genetic factors may play a role in the response of the patients to their treatment. This covers not only the response in terms of the intended main effect, but also % according toin terms of potential side effects. The complexity of predicting drug response suggests that machine learning could bring new tools and techniques for understanding ADR.In this doctoral thesis, we study different problems related to drug response prediction, based on the genetic characteristics of patients.We frame them through multitask machine learning frameworks, which combine all data available for related problems in order to solve them at the same time.We propose a novel model for multitask linear prediction that uses task descriptors to select relevant features and make predictions with better performance as state-of-the-art algorithms. Finally, we study strategies for increasing the stability of the selected features, in order to improve interpretability for biological applications
Coavoux, Maximin. "Discontinuous constituency parsing of morphologically rich languages." Thesis, Sorbonne Paris Cité, 2017. http://www.theses.fr/2017USPCC032.
Full textSyntactic parsing consists in assigning syntactic trees to sentences in natural language. Syntactic parsing of non-configurational languages, or languages with a rich inflectional morphology, raises specific problems. These languages suffer more from lexical data sparsity and exhibit word order variation phenomena more frequently. For these languages, exploiting information about the internal structure of word forms is crucial for accurate parsing. This dissertation investigates transition-based methods for robust discontinuous constituency parsing. First of all, we propose a multitask learning neural architecture that performs joint parsing and morphological analysis. Then, we introduce a new transition system that is able to predict discontinuous constituency trees, i.e.\ syntactic structures that can be seen as derivations of mildly context-sensitive grammars, such as LCFRS. Finally, we investigate the question of lexicalization in syntactic parsing. Some syntactic parsers are based on the hypothesis that constituent are organized around a lexical head and that modelling bilexical dependencies is essential to solve ambiguities. We introduce an unlexicalized transition system for discontinuous constituency parsing and a scoring model based on constituent boundaries. The resulting parser is simpler than lexicalized parser and achieves better results in both discontinuous and projective constituency parsing
Caye, Daudt Rodrigo. "Convolutional neural networks for change analysis in earth observation images with noisy labels and domain shifts." Electronic Thesis or Diss., Institut polytechnique de Paris, 2020. http://www.theses.fr/2020IPPAT033.
Full textThe analysis of satellite and aerial Earth observation images allows us to obtain precise information over large areas. A multitemporal analysis of such images is necessary to understand the evolution of such areas. In this thesis, convolutional neural networks are used to detect and understand changes using remote sensing images from various sources in supervised and weakly supervised settings. Siamese architectures are used to compare coregistered image pairs and to identify changed pixels. The proposed method is then extended into a multitask network architecture that is used to detect changes and perform land cover mapping simultaneously, which permits a semantic understanding of the detected changes. Then, classification filtering and a novel guided anisotropic diffusion algorithm are used to reduce the effect of biased label noise, which is a concern for automatically generated large-scale datasets. Weakly supervised learning is also achieved to perform pixel-level change detection using only image-level supervision through the usage of class activation maps and a novel spatial attention layer. Finally, a domain adaptation method based on adversarial training is proposed, which succeeds in projecting images from different domains into a common latent space where a given task can be performed. This method is tested not only for domain adaptation for change detection, but also for image classification and semantic segmentation, which proves its versatility
Cherifa-Luron, Ményssa. "Prédiction des épisodes d'hypotension à partir de données longitudinales à haute fréquence recueillies auprès de patients en soins intensifs." Electronic Thesis or Diss., Université Paris Cité, 2021. https://wo.app.u-paris.fr/cgi-bin/WebObjects/TheseWeb.woa/wa/show?t=8076&f=67992.
Full textThe digital revolution in healthcare, reflected in both the centralization of and access to extensive medical databases and the considerable advances in artificial intelligence (AI), has created new opportunities for data science applied to medicine. Putting the patient at the heart of the health care system, developing these new technologies guarantees a more personalized medicine by identifying more predictive factors and individual prognosis. This thesis work is entirely in line with the concept of personalized medicine. More precisely, it is an example of medical AI's development and concrete application to predict hypotension and, more broadly, of states of shock, frequent pathologies affecting more than one-third of patients hospitalized in intensive care. Indeed, shock, defined as a failure of the circulatory system leading to an inadequacy between the supply and the peripheral tissue needs in oxygen, is considered a diagnostic and therapeutic emergency. Therefore, anticipating hypotension, one of its main symptoms, can be extremely useful to make better therapeutic decisions and, in some cases, prevent the onset of organ failure from the beginning by appropriately adjusting the therapy. In addition, the ability to predict future deterioration can be beneficial to assist in the proactive assignment of care teams within hospital departments. The first part of this thesis work focused on using and applying a machine learning-based ensemble algorithm, the Super Learner (SL), to predict the occurrence of a hypotensive episode 10 minutes or more in advance in patients hospitalized in the ICU. This work demonstrated that physiological signals could be integrated into predictive models when dealing with massive data without requiring complex pre-processing methods to be exploited. Also, the SL was far superior to each of the algorithms included in its library, as evidenced by its lower errors and good values of sensitivity and specificity values during its internal and external evaluation. Then, to mimic the way that clinicians are trained to jointly analyze the evolution of mean arterial pressure (MAP) and heart rate (HR) given their close physiological interdependence, we developed a deep learning model, the Physiological Deep Learner (PDL), to predict MAP and HR simultaneously. We highlighted that the use of a multitasking algorithm outperformed the prediction performance of single-tasking algorithms. Indeed, compared to a more traditional approach, our PDL achieved better performance, exhibiting a better calibration profile and fewer errors. In addition, the PDL was able to predict with high accuracy the occurrence or non-occurrence of a hypotensive episode up to 60 minutes in advance
Tafforeau, Jérémie. "Modèle joint pour le traitement automatique de la langue : perspectives au travers des réseaux de neurones." Thesis, Aix-Marseille, 2017. http://www.theses.fr/2017AIXM0430/document.
Full textNLP researchers has identified different levels of linguistic analysis. This lead to a hierarchical division of the various tasks performed in order to analyze a text statement. The traditional approach considers task-specific models which are subsequently arranged in cascade within processing chains (pipelines). This approach has a number of limitations: the empirical selection of models features, the errors accumulation in the pipeline and the lack of robusteness to domain changes. These limitations lead to particularly high performance losses in the case of non-canonical language with limited data available such as transcriptions of conversations over phone. Disfluencies and speech-specific syntactic schemes, as well as transcription errors in automatic speech recognition systems, lead to a significant drop of performances. It is therefore necessary to develop robust and flexible systems. We intend to perform a syntactic and semantic analysis using a deep neural network multitask model while taking into account the variations of domain and/or language registers within the data
Donini, Michele. "Exploiting the structure of feature spaces in kernel learning." Doctoral thesis, Università degli studi di Padova, 2016. http://hdl.handle.net/11577/3424320.
Full textIl problema dell'apprendimento della reppresentazione ottima per un task specifico è divenuto un importante argomento nella comunità dell'apprendimento automatico. In questo campo, le architetture di tipo deep sono attualmente le più avanzate tra i possibili algoritmi di apprendimento automatico. Esse generano modelli che utilizzando alti gradi di astrazione e sono in grado di scoprire strutture complicate in dataset anche molto ampi. I kernel e le Deep Neural Network (DNN) sono i principali metodi per apprendere una rappresentazione di un problema in modo ricco (cioè deep). Le DNN sfruttano il famoso algoritmo di back-propagation migliorando le prestazioni degli algoritmi allo stato dell'arte in diverse applicazioni reali, come per esempio il riconoscimento vocale, il riconoscimento di oggetti o l'elaborazione di segnali. Tuttavia, gli algoritmi DNN hanno anche delle problematiche, ereditate dalle classiche reti neurali e derivanti dal fatto che esse non sono completamente comprese teoricamente. I problemi principali sono: la complessità della struttura della soluzione, la non chiara separazione tra la fase di apprendimento della rappresentazione ottimale e del modello, i lunghi tempi di training e la convergenza a soluzioni ottime solo localmente (a causa dei minimi locali e del vanishing gradient). Per questi motivi, in questa tesi, proponiamo nuove idee per ottenere rapprensetazioni ottimali sfruttando la teoria dei kernel. I metodi kernel hanno un elegante framework che separa l'algoritmo di apprendimento dalla rappresentazione delle informazioni. D'altro canto, anche i kernel hanno alcune debolezze, per esempio essi non scalano e, per come sono solitamente utilizzati, portano con loro una rappresentazione poco ricca (shallow). In questa tesi, proponiamo nuovi risultati teorici e nuovi algoritmi per cercare di risolvere questi problemi e rendere l'apprendimento dei kernel in grado di generare rappresentazioni più ricche (deeper) ed essere più scalabili. Verrà quindi presentato un nuovo algoritmo in grado di combinare migliaia di kernel deboli con un basso costo computazionale e di memoria. Questa procedura, chiamata EasyMKL, supera i metodi attualmente allo stato dell'arte combinando frammenti di informazione e creando in questo modo il kernel ottimale per uno specifico task. Perseguendo l'idea di creare una famiglia di kernel deboli ottimale, abbiamo creato una nuova misura di valutazione dell'espressività dei kernel, chiamata Spectral Complexity. Sfruttando questa misura siamo in grado di generare famiglia di kernel deboli con una struttura gerarchica nelle feature definendo una nuova proprietà riguardante la monotonicità della Spectral Complexity. Mostriamo la qualità dei nostri kernel deboli sviluppando una nuova metologia per il Multiple Kernel Learning (MKL). In primo luogo, siamo in grado di creare una famiglia ottimale di kernel deboli sfruttando la proprietà di monotinicità della Spectral Complexity; combiniamo quindi la famiglia di kernel deboli ottimale sfruttando EasyMKL e ottenendo un nuovo kernel, specifico per il singolo task; infine, siamo in grado di generare un modello sfruttando il nuovo kernel e kernel machine (per esempio una SVM). Inoltre, in questa tesi sottolineiamo le connessioni tra Distance Metric Learning, Feature Larning e Kernel Learning proponendo un metodo per apprendere la famiglia ottimale di kernel deboli per un algoritmo MKL in un contesto differente, in cui la regola di combinazione è il prodotto componente per componente delle matrici kernel. Questo algoritmo è in grado di generare i parametri ottimali per un kernel RBF anisotropico. Di conseguenza, si crea un naturale collegamento tra il Feature Weighting, le combinazioni dei kernel e l'apprendimento della metrica ottimale per il task. Infine, l'importanza della rappresentazione è anche presa in considerazione in tre task reali, dove affrontiamo differenti problematiche, tra cui: il rumore nei dati, le applicazioni in tempo reale e le grandi moli di dati (Big Data)
Kovac, Krunoslav. "Multitask learning for Bayesian neural networks." 2005. http://link.library.utoronto.ca/eir/EIRdetail.cfm?Resources__ID=370149&T=F.
Full textTSAI, YAO-CHANG, and 蔡耀樟. "Applications of Multitask Learning to Human Activity Recognition." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/5h9828.
Full text國立臺北科技大學
電機工程系
107
In previous studies, most of them treat human activity recognition as a single classification problem. They train a common classifier for all subjects without considering personal activity pattern. Since different person has his or her own activity pattern, a common classifier is not applicable for all subjects. Due to this problem, some researchers do the research on personalized human activity recognition, which train personalized model for each subject. However, with doing personalized human activity recognition comes a new problem which is data sparseness. Sometimes the amount of data is inadequate for training a good classifier. Although other researchers proposed a hybrid model which can train with current subject and other subject, there still has few methods that can enable model to train efficiently with two subjects. To solve this problem, this thesis proposed a method applying multitask learning to personalized human activity recognition. This method uses multitask neural network to train with two subject efficiently. It divides two subjects into main subject and auxiliary subject. The main purpose of this method is to improve the accuracy of main subject through training with auxiliary subject. With training with suitable auxiliary subject, Main subject can have at most 11.38 % of accuracy improvement.
Koyejo, Oluwasanmi Oluseye. "Constrained relative entropy minimization with applications to multitask learning." 2013. http://hdl.handle.net/2152/20793.
Full texttext
Martí, i. Rabadán Miquel. "Multitask Deep Learning models for real-time deployment in embedded systems." Thesis, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673.
Full textMartí, Rabadán Miquel. "Multitask Deep Learning models for real-time deployment in embedded systems." Thesis, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-216673.
Full textHsieh, Yeu-Chen, and 謝雨辰. "Forecasting Solar Power Production by Heterogeneous Data Streams and Multitask Learning." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/373zyg.
Full text國立臺灣大學
資訊工程學研究所
105
In recent years, solar energy has become a significant field of research across the globe because of the growing demand for renewable energy and its promising potential in sustainability aspects. Therefore, with the increasing integration of photovoltaic (PV) systems to the electrical grid, reliable prediction of the expected production output of PV systems is gaining importance as a basis for management and operation strategies. However, power production of PV systems is highly variable due to its dependence on solar radiance, meteorological conditions and other external factors. Currently, most studies are unable to predict the solar power production at multiple future time points and never analyze the influence of the possible factors on the solar power production but simply feed all features without considering their properties. Therefore, this paper provides a holistic comparison among all factors (e.g., solar radiance, meteorology) affecting the solar power production and also, a multimodal and end-to-end neural networks model is proposed to simultaneously predict the solar power production at multiple future time points by multitask learning. Finally, with multitask learning on heterogeneous (and multimodal) data, the proposed method achieves the lowest error rates (11.83\% for 5-minute prediction) compared to the state of the art.
"Novel Deep Learning Models for Medical Imaging Analysis." Doctoral diss., 2019. http://hdl.handle.net/2286/R.I.55510.
Full textDissertation/Thesis
Doctoral Dissertation Industrial Engineering 2019
Reid, Mark Darren Computer Science & Engineering Faculty of Engineering UNSW. "DEFT guessing: using inductive transfer to improve rule evaluation from limited data." 2007. http://handle.unsw.edu.au/1959.4/40513.
Full textHwang, Sung Ju. "Discriminative object categorization with external semantic knowledge." 2013. http://hdl.handle.net/2152/21320.
Full texttext
Yaghoubi, Ehsan. "Soft Biometric Analysis: MultiPerson and RealTime Pedestrian Attribute Recognition in Crowded Urban Environments." Doctoral thesis, 2021. http://hdl.handle.net/10400.6/12081.
Full textThis thesis was prepared at the University of Beria Interior, IT Instituto de Telecomunicações, Soft Computing and Image Analysis Laboratory (SOCIA Lab), Covilhã Delegation, and was submitted to the University of Beira Interior for defense in a public examination session.