Literatura académica sobre el tema "End-to-end multimodal modelling"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "End-to-end multimodal modelling".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "End-to-end multimodal modelling"
Bazaras, Darius y Ramūnas Palšaitis. "MULTIMODAL APPROACH TO THE INTERNATIONAL TRANSIT TRANSPORT". TRANSPORT 18, n.º 6 (31 de diciembre de 2003): 248–54. http://dx.doi.org/10.3846/16483840.2003.10414106.
Texto completoCrespi, Pietro, Alberto Franchi y Nicola Giordano. "Multimodal Pushover Analysis for R.C. Bridges". Applied Mechanics and Materials 725-726 (enero de 2015): 888–95. http://dx.doi.org/10.4028/www.scientific.net/amm.725-726.888.
Texto completoDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz y T. Haberzettl. "Sediment transport processes across the Tibetan Plateau inferred from robust grain-size end members in lake sediments". Climate of the Past 10, n.º 1 (16 de enero de 2014): 91–106. http://dx.doi.org/10.5194/cp-10-91-2014.
Texto completoDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz y T. Haberzettl. "Sediment transport processes across the Tibetan Plateau inferred from robust grain size end-members in lake sediments". Climate of the Past Discussions 9, n.º 4 (21 de agosto de 2013): 4855–92. http://dx.doi.org/10.5194/cpd-9-4855-2013.
Texto completoRiva, Marco, Patrick Hiepe, Mona Frommert, Ignazio Divenuto, Lorenzo G. Gay, Tommaso Sciortino, Marco Conti Nibali, Marco Rossi, Federico Pessina y Lorenzo Bello. "Intraoperative Computed Tomography and Finite Element Modelling for Multimodal Image Fusion in Brain Surgery". Operative Neurosurgery 18, n.º 5 (25 de julio de 2019): 531–41. http://dx.doi.org/10.1093/ons/opz196.
Texto completoLi, Zhigang, Aimei Dong y Jing Zhou. "Research of Low-Rank Representation and Discriminant Correlation Analysis for Alzheimer’s Disease Diagnosis". Computational and Mathematical Methods in Medicine 2020 (19 de marzo de 2020): 1–8. http://dx.doi.org/10.1155/2020/5294840.
Texto completoAbourraja, Mohamed Nezar, Mustapha Oudani, Mohamed Yassine Samiri, Jaouad Boukachour, Abdelaziz Elfazziki, Abdelhadi Bouain y Mehdi Najib. "An improving agent-based engineering strategy for minimizing unproductive situations of cranes in a rail–rail transshipment yard". SIMULATION 94, n.º 8 (6 de octubre de 2017): 681–705. http://dx.doi.org/10.1177/0037549717733050.
Texto completoProkofieva, E. S. y V. V. Panin. "UNIFORM PRINCIPLES OF ORGANIZATION OF RAIL FREIGHT TRANSPORTATION OPERATIONS". World of Transport and Transportation 17, n.º 5 (7 de junio de 2020): 186–98. http://dx.doi.org/10.30932/1992-3252-2019-17-5-186-198.
Texto completoHoroshkova, Lidiia, Olena Vasyl’yeva, Oksana Maslova y Alexander Sumets. "River logistics amid war and post-war recovery in Ukraine: current situation and prospects ." University Economic Bulletin, n.º 56 (31 de marzo de 2023): 113–25. http://dx.doi.org/10.31470/2306-546x-2023-56-113-125.
Texto completoYurttas, Can, Oliver M. Fisher, Delia Cortés-Guiral, Sebastian P. Haen, Ingmar Königsrainer, Alfred Königsrainer, Stefan Beckert, Winston Liauw y Markus W. Löffler. "Cytoreductive surgery and HIPEC in colorectal cancer—did we get hold of the wrong end of the stick?" memo - Magazine of European Medical Oncology 13, n.º 4 (20 de octubre de 2020): 434–39. http://dx.doi.org/10.1007/s12254-020-00653-6.
Texto completoTesis sobre el tema "End-to-end multimodal modelling"
Labbé, Etienne. "Description automatique des événements sonores par des méthodes d'apprentissage profond". Electronic Thesis or Diss., Université de Toulouse (2023-....), 2024. http://www.theses.fr/2024TLSES054.
Texto completoIn the audio research field, the majority of machine learning systems focus on recognizing a limited number of sound events. However, when a machine interacts with real data, it must be able to handle much more varied and complex situations. To tackle this problem, annotators use natural language, which allows any sound information to be summarized. Automated Audio Captioning (AAC) was introduced recently to develop systems capable of automatically producing a description of any type of sound in text form. This task concerns all kinds of sound events such as environmental, urban, domestic sounds, sound effects, music or speech. This type of system could be used by people who are deaf or hard of hearing, and could improve the indexing of large audio databases. In the first part of this thesis, we present the state of the art of the AAC task through a global description of public datasets, learning methods, architectures and evaluation metrics. Using this knowledge, we then present the architecture of our first AAC system, which obtains encouraging scores on the main AAC metric named SPIDEr: 24.7% on the Clotho corpus and 40.1% on the AudioCaps corpus. Then, subsequently, we explore many aspects of AAC systems in the second part. We first focus on evaluation methods through the study of SPIDEr. For this, we propose a variant called SPIDEr-max, which considers several candidates for each audio file, and which shows that the SPIDEr metric is very sensitive to the predicted words. Then, we improve our reference system by exploring different architectures and numerous hyper-parameters to exceed the state of the art on AudioCaps (SPIDEr of 49.5%). Next, we explore a multi-task learning method aimed at improving the semantics of sentences generated by our system. Finally, we build a general and unbiased AAC system called CONETTE, which can generate different types of descriptions that approximate those of the target datasets. In the third and last part, we propose to study the capabilities of a AAC system to automatically search for audio content in a database. Our approach obtains competitive scores to systems dedicated to this task, while using fewer parameters. We also introduce semi-supervised methods to improve our system using new unlabeled audio data, and we show how pseudo-label generation can impact a AAC model. Finally, we studied the AAC systems in languages other than English: French, Spanish and German. In addition, we propose a system capable of producing all four languages at the same time, and we compare it with systems specialized in each language
Capítulos de libros sobre el tema "End-to-end multimodal modelling"
Huseyinov, Ilham N. "Fuzzy Linguistic Modelling in Multi Modal Human Computer Interaction". En Speech, Image, and Language Processing for Human Computer Interaction, 64–79. IGI Global, 2012. http://dx.doi.org/10.4018/978-1-4666-0954-9.ch004.
Texto completo