Academic literature on the topic 'End-to-end multimodal modelling'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'End-to-end multimodal modelling.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "End-to-end multimodal modelling"
Bazaras, Darius, and Ramūnas Palšaitis. "MULTIMODAL APPROACH TO THE INTERNATIONAL TRANSIT TRANSPORT." TRANSPORT 18, no. 6 (December 31, 2003): 248–54. http://dx.doi.org/10.3846/16483840.2003.10414106.
Full textCrespi, Pietro, Alberto Franchi, and Nicola Giordano. "Multimodal Pushover Analysis for R.C. Bridges." Applied Mechanics and Materials 725-726 (January 2015): 888–95. http://dx.doi.org/10.4028/www.scientific.net/amm.725-726.888.
Full textDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz, and T. Haberzettl. "Sediment transport processes across the Tibetan Plateau inferred from robust grain-size end members in lake sediments." Climate of the Past 10, no. 1 (January 16, 2014): 91–106. http://dx.doi.org/10.5194/cp-10-91-2014.
Full textDietze, E., F. Maussion, M. Ahlborn, B. Diekmann, K. Hartmann, K. Henkel, T. Kasper, G. Lockot, S. Opitz, and T. Haberzettl. "Sediment transport processes across the Tibetan Plateau inferred from robust grain size end-members in lake sediments." Climate of the Past Discussions 9, no. 4 (August 21, 2013): 4855–92. http://dx.doi.org/10.5194/cpd-9-4855-2013.
Full textRiva, Marco, Patrick Hiepe, Mona Frommert, Ignazio Divenuto, Lorenzo G. Gay, Tommaso Sciortino, Marco Conti Nibali, Marco Rossi, Federico Pessina, and Lorenzo Bello. "Intraoperative Computed Tomography and Finite Element Modelling for Multimodal Image Fusion in Brain Surgery." Operative Neurosurgery 18, no. 5 (July 25, 2019): 531–41. http://dx.doi.org/10.1093/ons/opz196.
Full textLi, Zhigang, Aimei Dong, and Jing Zhou. "Research of Low-Rank Representation and Discriminant Correlation Analysis for Alzheimer’s Disease Diagnosis." Computational and Mathematical Methods in Medicine 2020 (March 19, 2020): 1–8. http://dx.doi.org/10.1155/2020/5294840.
Full textAbourraja, Mohamed Nezar, Mustapha Oudani, Mohamed Yassine Samiri, Jaouad Boukachour, Abdelaziz Elfazziki, Abdelhadi Bouain, and Mehdi Najib. "An improving agent-based engineering strategy for minimizing unproductive situations of cranes in a rail–rail transshipment yard." SIMULATION 94, no. 8 (October 6, 2017): 681–705. http://dx.doi.org/10.1177/0037549717733050.
Full textProkofieva, E. S., and V. V. Panin. "UNIFORM PRINCIPLES OF ORGANIZATION OF RAIL FREIGHT TRANSPORTATION OPERATIONS." World of Transport and Transportation 17, no. 5 (June 7, 2020): 186–98. http://dx.doi.org/10.30932/1992-3252-2019-17-5-186-198.
Full textHoroshkova, Lidiia, Olena Vasyl’yeva, Oksana Maslova, and Alexander Sumets. "River logistics amid war and post-war recovery in Ukraine: current situation and prospects ." University Economic Bulletin, no. 56 (March 31, 2023): 113–25. http://dx.doi.org/10.31470/2306-546x-2023-56-113-125.
Full textYurttas, Can, Oliver M. Fisher, Delia Cortés-Guiral, Sebastian P. Haen, Ingmar Königsrainer, Alfred Königsrainer, Stefan Beckert, Winston Liauw, and Markus W. Löffler. "Cytoreductive surgery and HIPEC in colorectal cancer—did we get hold of the wrong end of the stick?" memo - Magazine of European Medical Oncology 13, no. 4 (October 20, 2020): 434–39. http://dx.doi.org/10.1007/s12254-020-00653-6.
Full textDissertations / Theses on the topic "End-to-end multimodal modelling"
Labbé, Etienne. "Description automatique des événements sonores par des méthodes d'apprentissage profond." Electronic Thesis or Diss., Université de Toulouse (2023-....), 2024. http://www.theses.fr/2024TLSES054.
Full textIn the audio research field, the majority of machine learning systems focus on recognizing a limited number of sound events. However, when a machine interacts with real data, it must be able to handle much more varied and complex situations. To tackle this problem, annotators use natural language, which allows any sound information to be summarized. Automated Audio Captioning (AAC) was introduced recently to develop systems capable of automatically producing a description of any type of sound in text form. This task concerns all kinds of sound events such as environmental, urban, domestic sounds, sound effects, music or speech. This type of system could be used by people who are deaf or hard of hearing, and could improve the indexing of large audio databases. In the first part of this thesis, we present the state of the art of the AAC task through a global description of public datasets, learning methods, architectures and evaluation metrics. Using this knowledge, we then present the architecture of our first AAC system, which obtains encouraging scores on the main AAC metric named SPIDEr: 24.7% on the Clotho corpus and 40.1% on the AudioCaps corpus. Then, subsequently, we explore many aspects of AAC systems in the second part. We first focus on evaluation methods through the study of SPIDEr. For this, we propose a variant called SPIDEr-max, which considers several candidates for each audio file, and which shows that the SPIDEr metric is very sensitive to the predicted words. Then, we improve our reference system by exploring different architectures and numerous hyper-parameters to exceed the state of the art on AudioCaps (SPIDEr of 49.5%). Next, we explore a multi-task learning method aimed at improving the semantics of sentences generated by our system. Finally, we build a general and unbiased AAC system called CONETTE, which can generate different types of descriptions that approximate those of the target datasets. In the third and last part, we propose to study the capabilities of a AAC system to automatically search for audio content in a database. Our approach obtains competitive scores to systems dedicated to this task, while using fewer parameters. We also introduce semi-supervised methods to improve our system using new unlabeled audio data, and we show how pseudo-label generation can impact a AAC model. Finally, we studied the AAC systems in languages other than English: French, Spanish and German. In addition, we propose a system capable of producing all four languages at the same time, and we compare it with systems specialized in each language
Book chapters on the topic "End-to-end multimodal modelling"
Huseyinov, Ilham N. "Fuzzy Linguistic Modelling in Multi Modal Human Computer Interaction." In Speech, Image, and Language Processing for Human Computer Interaction, 64–79. IGI Global, 2012. http://dx.doi.org/10.4018/978-1-4666-0954-9.ch004.
Full text