Littérature scientifique sur le sujet « End-to-end multimodal modelling »
Créez une référence correcte selon les styles APA, MLA, Chicago, Harvard et plusieurs autres
Sommaire
Consultez les listes thématiques d’articles de revues, de livres, de thèses, de rapports de conférences et d’autres sources académiques sur le sujet « End-to-end multimodal modelling ».
À côté de chaque source dans la liste de références il y a un bouton « Ajouter à la bibliographie ». Cliquez sur ce bouton, et nous générerons automatiquement la référence bibliographique pour la source choisie selon votre style de citation préféré : APA, MLA, Harvard, Vancouver, Chicago, etc.
Vous pouvez aussi télécharger le texte intégral de la publication scolaire au format pdf et consulter son résumé en ligne lorsque ces informations sont inclues dans les métadonnées.
Articles de revues sur le sujet "End-to-end multimodal modelling"
Bazaras, Darius, et Ramūnas Palšaitis. « MULTIMODAL APPROACH TO THE INTERNATIONAL TRANSIT TRANSPORT ». TRANSPORT 18, no 6 (31 décembre 2003) : 248–54. http://dx.doi.org/10.3846/16483840.2003.10414106.
Texte intégralCrespi, Pietro, Alberto Franchi et Nicola Giordano. « Multimodal Pushover Analysis for R.C. Bridges ». Applied Mechanics and Materials 725-726 (janvier 2015) : 888–95. http://dx.doi.org/10.4028/www.scientific.net/amm.725-726.888.
Texte intégralDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz et T. Haberzettl. « Sediment transport processes across the Tibetan Plateau inferred from robust grain-size end members in lake sediments ». Climate of the Past 10, no 1 (16 janvier 2014) : 91–106. http://dx.doi.org/10.5194/cp-10-91-2014.
Texte intégralDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz et T. Haberzettl. « Sediment transport processes across the Tibetan Plateau inferred from robust grain size end-members in lake sediments ». Climate of the Past Discussions 9, no 4 (21 août 2013) : 4855–92. http://dx.doi.org/10.5194/cpd-9-4855-2013.
Texte intégralRiva, Marco, Patrick Hiepe, Mona Frommert, Ignazio Divenuto, Lorenzo G. Gay, Tommaso Sciortino, Marco Conti Nibali, Marco Rossi, Federico Pessina et Lorenzo Bello. « Intraoperative Computed Tomography and Finite Element Modelling for Multimodal Image Fusion in Brain Surgery ». Operative Neurosurgery 18, no 5 (25 juillet 2019) : 531–41. http://dx.doi.org/10.1093/ons/opz196.
Texte intégralLi, Zhigang, Aimei Dong et Jing Zhou. « Research of Low-Rank Representation and Discriminant Correlation Analysis for Alzheimer’s Disease Diagnosis ». Computational and Mathematical Methods in Medicine 2020 (19 mars 2020) : 1–8. http://dx.doi.org/10.1155/2020/5294840.
Texte intégralAbourraja, Mohamed Nezar, Mustapha Oudani, Mohamed Yassine Samiri, Jaouad Boukachour, Abdelaziz Elfazziki, Abdelhadi Bouain et Mehdi Najib. « An improving agent-based engineering strategy for minimizing unproductive situations of cranes in a rail–rail transshipment yard ». SIMULATION 94, no 8 (6 octobre 2017) : 681–705. http://dx.doi.org/10.1177/0037549717733050.
Texte intégralProkofieva, E. S., et V. V. Panin. « UNIFORM PRINCIPLES OF ORGANIZATION OF RAIL FREIGHT TRANSPORTATION OPERATIONS ». World of Transport and Transportation 17, no 5 (7 juin 2020) : 186–98. http://dx.doi.org/10.30932/1992-3252-2019-17-5-186-198.
Texte intégralHoroshkova, Lidiia, Olena Vasyl’yeva, Oksana Maslova et Alexander Sumets. « River logistics amid war and post-war recovery in Ukraine : current situation and prospects . » University Economic Bulletin, no 56 (31 mars 2023) : 113–25. http://dx.doi.org/10.31470/2306-546x-2023-56-113-125.
Texte intégralYurttas, Can, Oliver M. Fisher, Delia Cortés-Guiral, Sebastian P. Haen, Ingmar Königsrainer, Alfred Königsrainer, Stefan Beckert, Winston Liauw et Markus W. Löffler. « Cytoreductive surgery and HIPEC in colorectal cancer—did we get hold of the wrong end of the stick ? » memo - Magazine of European Medical Oncology 13, no 4 (20 octobre 2020) : 434–39. http://dx.doi.org/10.1007/s12254-020-00653-6.
Texte intégralThèses sur le sujet "End-to-end multimodal modelling"
Labbé, Etienne. « Description automatique des événements sonores par des méthodes d'apprentissage profond ». Electronic Thesis or Diss., Université de Toulouse (2023-....), 2024. http://www.theses.fr/2024TLSES054.
Texte intégralIn the audio research field, the majority of machine learning systems focus on recognizing a limited number of sound events. However, when a machine interacts with real data, it must be able to handle much more varied and complex situations. To tackle this problem, annotators use natural language, which allows any sound information to be summarized. Automated Audio Captioning (AAC) was introduced recently to develop systems capable of automatically producing a description of any type of sound in text form. This task concerns all kinds of sound events such as environmental, urban, domestic sounds, sound effects, music or speech. This type of system could be used by people who are deaf or hard of hearing, and could improve the indexing of large audio databases. In the first part of this thesis, we present the state of the art of the AAC task through a global description of public datasets, learning methods, architectures and evaluation metrics. Using this knowledge, we then present the architecture of our first AAC system, which obtains encouraging scores on the main AAC metric named SPIDEr: 24.7% on the Clotho corpus and 40.1% on the AudioCaps corpus. Then, subsequently, we explore many aspects of AAC systems in the second part. We first focus on evaluation methods through the study of SPIDEr. For this, we propose a variant called SPIDEr-max, which considers several candidates for each audio file, and which shows that the SPIDEr metric is very sensitive to the predicted words. Then, we improve our reference system by exploring different architectures and numerous hyper-parameters to exceed the state of the art on AudioCaps (SPIDEr of 49.5%). Next, we explore a multi-task learning method aimed at improving the semantics of sentences generated by our system. Finally, we build a general and unbiased AAC system called CONETTE, which can generate different types of descriptions that approximate those of the target datasets. In the third and last part, we propose to study the capabilities of a AAC system to automatically search for audio content in a database. Our approach obtains competitive scores to systems dedicated to this task, while using fewer parameters. We also introduce semi-supervised methods to improve our system using new unlabeled audio data, and we show how pseudo-label generation can impact a AAC model. Finally, we studied the AAC systems in languages other than English: French, Spanish and German. In addition, we propose a system capable of producing all four languages at the same time, and we compare it with systems specialized in each language
Chapitres de livres sur le sujet "End-to-end multimodal modelling"
Huseyinov, Ilham N. « Fuzzy Linguistic Modelling in Multi Modal Human Computer Interaction ». Dans Speech, Image, and Language Processing for Human Computer Interaction, 64–79. IGI Global, 2012. http://dx.doi.org/10.4018/978-1-4666-0954-9.ch004.
Texte intégral