Literatura académica sobre el tema "Multimodal processing"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "Multimodal processing".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "Multimodal processing"
Ng, Vincent y Shengjie Li. "Multimodal Propaganda Processing". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 13 (26 de junio de 2023): 15368–75. http://dx.doi.org/10.1609/aaai.v37i13.26792.
Texto completoSinke, Christopher, Janina Neufeld, Daniel Wiswede, Hinderk M. Emrich, Stefan Bleich y Gregor R. Szycik. "Multisensory processing in synesthesia — differences in the EEG signal during uni- and multimodal processing". Seeing and Perceiving 25 (2012): 53. http://dx.doi.org/10.1163/187847612x646749.
Texto completoD'Ulizia, Arianna, Fernando Ferri y Patrizia Grifoni. "Generating Multimodal Grammars for Multimodal Dialogue Processing". IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans 40, n.º 6 (noviembre de 2010): 1130–45. http://dx.doi.org/10.1109/tsmca.2010.2041227.
Texto completoBarricelli, Barbara Rita, Piero Mussio, Marco Padula y Paolo Luigi Scala. "TMS for multimodal information processing". Multimedia Tools and Applications 54, n.º 1 (27 de abril de 2010): 97–120. http://dx.doi.org/10.1007/s11042-010-0527-x.
Texto completoParsons, Aaron D., Stephen W. T. Price, Nicola Wadeson, Mark Basham, Andrew M. Beale, Alun W. Ashton, J. Frederick W. Mosselmans y Paul D. Quinn. "Automatic processing of multimodal tomography datasets". Journal of Synchrotron Radiation 24, n.º 1 (1 de enero de 2017): 248–56. http://dx.doi.org/10.1107/s1600577516017756.
Texto completoHoller, Judith y Stephen C. Levinson. "Multimodal Language Processing in Human Communication". Trends in Cognitive Sciences 23, n.º 8 (agosto de 2019): 639–52. http://dx.doi.org/10.1016/j.tics.2019.05.006.
Texto completoFarzin, Faraz, Eric P. Charles y Susan M. Rivera. "Development of Multimodal Processing in Infancy". Infancy 14, n.º 5 (1 de septiembre de 2009): 563–78. http://dx.doi.org/10.1080/15250000903144207.
Texto completoZhang, Ge, Tianxiang Luo, Witold Pedrycz, Mohammed A. El-Meligy, Mohamed Abdel Fattah Sharaf y Zhiwu Li. "Outlier Processing in Multimodal Emotion Recognition". IEEE Access 8 (2020): 55688–701. http://dx.doi.org/10.1109/access.2020.2981760.
Texto completoMetaxakis, Athanasios, Dionysia Petratou y Nektarios Tavernarakis. "Multimodal sensory processing in Caenorhabditis elegans". Open Biology 8, n.º 6 (junio de 2018): 180049. http://dx.doi.org/10.1098/rsob.180049.
Texto completoNock, Harriet J., Giridharan Iyengar y Chalapathy Neti. "Multimodal processing by finding common cause". Communications of the ACM 47, n.º 1 (1 de enero de 2004): 51. http://dx.doi.org/10.1145/962081.962105.
Texto completoTesis sobre el tema "Multimodal processing"
Cadène, Rémi. "Deep Multimodal Learning for Vision and Language Processing". Electronic Thesis or Diss., Sorbonne université, 2020. http://www.theses.fr/2020SORUS277.
Texto completoDigital technologies have become instrumental in transforming our society. Recent statistical methods have been successfully deployed to automate the processing of the growing amount of images, videos, and texts we produce daily. In particular, deep neural networks have been adopted by the computer vision and natural language processing communities for their ability to perform accurate image recognition and text understanding once trained on big sets of data. Advances in both communities built the groundwork for new research problems at the intersection of vision and language. Integrating language into visual recognition could have an important impact on human life through the creation of real-world applications such as next-generation search engines or AI assistants.In the first part of this thesis, we focus on systems for cross-modal text-image retrieval. We propose a learning strategy to efficiently align both modalities while structuring the retrieval space with semantic information. In the second part, we focus on systems able to answer questions about an image. We propose a multimodal architecture that iteratively fuses the visual and textual modalities using a factorized bilinear model while modeling pairwise relationships between each region of the image. In the last part, we address the issues related to biases in the modeling. We propose a learning strategy to reduce the language biases which are commonly present in visual question answering systems
Hu, Yongtao y 胡永涛. "Multimodal speaker localization and identification for video processing". Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2014. http://hdl.handle.net/10722/212633.
Texto completoChen, Xun. "Multimodal biomedical signal processing for corticomuscular coupling analysis". Thesis, University of British Columbia, 2014. http://hdl.handle.net/2429/45811.
Texto completoSadr, Lahijany Nadi. "Multimodal Signal Processing for Diagnosis of Cardiorespiratory Disorders". Thesis, The University of Sydney, 2017. http://hdl.handle.net/2123/17636.
Texto completoElshaw, Mark. "Multimodal neural grounding of language processing for robot actions". Thesis, University of Sunderland, 2005. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.420517.
Texto completoFriedel, Paul. "Sensory information processing : detection, feature extraction, & multimodal integration". kostenfrei, 2008. http://mediatum2.ub.tum.de/doc/651333/651333.pdf.
Texto completoSadeghi, Ghandehari Soroush. "Multimodal signal processing in the peripheral and central vestibular pathways". Thesis, McGill University, 2009. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=95559.
Texto completoLes organes sensoriels vestibulaires de l’oreille interne détectent les mouvements de la tète dans r espace. Ces informations sont envoyées aux neurones vestibulaires centraux localises au niveau du tronc cérébral. A ce niveau convergent également d'autres signaux en provenance du cortex, du cervelet. de la moelle ainsi que de divers noyaux du tronc cérébral. Les études présentées ici ont pour but de comprendre le mode de codage et la nature des signaux générés par les neurones vestibulaires périphériques, ainsi que les capacités de traitement des neurones vestibulaires centraux. véritable centres d'intégration sensori-motrice. Ces travaux ont été conduits en condition physiologique et physiopathologique sur le modèle de la compensation vestibulaire. A r aide de mesures issues de la théorie de l'information, nous nous sommes tout d'abord intéresse aux codages effectues par Ies afférences vestibulaires régulières et irrégulières. Ces deux types neuronaux différent notamment par la variabilité de leur fréquence de décharge spontanée (bruit) et leurs sensibilités (signal). Nous avons montre que Ies fibres afférentes régulières utilisent un codage temporel alors que les fibres irrégulières fonctionnent essentiellement sur un codage en modulation de la fréquence, et ce d' autant mieux que les fréquences sont élevées, constituant ainsi de véritables détecteurs d'évènements. Nous avons ensuite étudie les réponses des afférences suite a une stimulation vestibulaire directe ou a une activation du « système efférent ». En conditions physiologiques, nous avons tout d'abord pu démontrer que le système efférent est bien fonctionneI chez le singe éveille. fr
Fateri, Sina. "Advanced signal processing techniques for multimodal ultrasonic guided wave response". Thesis, Brunel University, 2015. http://bura.brunel.ac.uk/handle/2438/11657.
Texto completoCaglayan, Ozan. "Multimodal Machine Translation". Thesis, Le Mans, 2019. http://www.theses.fr/2019LEMA1016/document.
Texto completoMachine translation aims at automatically translating documents from one language to another without human intervention. With the advent of deep neural networks (DNN), neural approaches to machine translation started to dominate the field, reaching state-ofthe-art performance in many languages. Neural machine translation (NMT) also revived the interest in interlingual machine translation due to how it naturally fits the task into an encoder-decoder framework which produces a translation by decoding a latent source representation. Combined with the architectural flexibility of DNNs, this framework paved the way for further research in multimodality with the objective of augmenting the latent representations with other modalities such as vision or speech, for example. This thesis focuses on a multimodal machine translation (MMT) framework that integrates a secondary visual modality to achieve better and visually grounded language understanding. I specifically worked with a dataset containing images and their translated descriptions, where visual context can be useful forword sense disambiguation, missing word imputation, or gender marking when translating from a language with gender-neutral nouns to one with grammatical gender system as is the case with English to French. I propose two main approaches to integrate the visual modality: (i) a multimodal attention mechanism that learns to take into account both sentence and convolutional visual representations, (ii) a method that uses global visual feature vectors to prime the sentence encoders and the decoders. Through automatic and human evaluation conducted on multiple language pairs, the proposed approaches were demonstrated to be beneficial. Finally, I further show that by systematically removing certain linguistic information from the input sentences, the true strength of both methods emerges as they successfully impute missing nouns, colors and can even translate when parts of the source sentences are completely removed
Fridman, Linnea y Victoria Nordberg. "Two Multimodal Image Registration Approaches for Positioning Purposes". Thesis, Linköpings universitet, Datorseende, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-157424.
Texto completoLibros sobre el tema "Multimodal processing"
Renals, Steve, Herve Bourlard, Jean Carletta y Andrei Popescu-Belis, eds. Multimodal Signal Processing. Cambridge: Cambridge University Press, 2009. http://dx.doi.org/10.1017/cbo9781139136310.
Texto completoMaragos, Petros, Alexandros Potamianos y Patrick Gros, eds. Multimodal Processing and Interaction. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3.
Texto completoRenals, Steve. Multimodal signal processing: Human interactions in meetings. Cambridge: Cambridge University Press, 2012.
Buscar texto completoFerran, Marques, Knovel (Firm) y ScienceDirect (Online service), eds. Multimodal signal processing: Theory and applications for human-computer interaction. Amsterdam: Academic, 2010.
Buscar texto completoGavrilova, Marina L. Multimodal biometrics and intelligent image processing for security systems. Hershey, PA: Information Science Reference, 2013.
Buscar texto completo1959-, Grifoni Patrizia, ed. Multimodal human computer interaction and pervasive services. Hershey PA: Information Science Reference, 2009.
Buscar texto completoSappa, Angel D. Multimodal Interaction in Image and Video Applications. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013.
Buscar texto completo1953-, Zimmer H. D., ed. Human memory: A multimodal approach. Seattle: Hogrefe & Huber Publishers, 1994.
Buscar texto completoAdams, Teresa M. Guidelines for the implementation of multimodal transportation location referencing systems. Washington, D.C: National Academy Press, 2001.
Buscar texto completoKühnel, Christine. Quantifying Quality Aspects of Multimodal Interactive Systems. Berlin, Heidelberg: Springer Berlin Heidelberg, 2012.
Buscar texto completoCapítulos de libros sobre el tema "Multimodal processing"
Huang, Lihe. "Collecting and processing multimodal data". En Toward Multimodal Pragmatics, 99–108. London: Routledge, 2021. http://dx.doi.org/10.4324/9781003251774-5.
Texto completoGullberg, Marianne. "Studying Multimodal Language Processing". En The Routledge Handbook of Second Language Acquisition and Psycholinguistics, 137–49. New York: Routledge, 2022. http://dx.doi.org/10.4324/9781003018872-14.
Texto completoWaibel, Alex, Minh Tue Vo, Paul Duchnowski y Stefan Manke. "Multimodal Interfaces". En Integration of Natural Language and Vision Processing, 299–319. Dordrecht: Springer Netherlands, 1996. http://dx.doi.org/10.1007/978-94-009-1716-3_9.
Texto completoRivet, Bertrand y Jonathon Chambers. "Multimodal Speech Separation". En Advances in Nonlinear Speech Processing, 1–11. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-11509-7_1.
Texto completoNéel, Françoise D. y Wolfgang M. Minker. "Multimodal Speech Systems". En Computational Models of Speech Pattern Processing, 404–30. Berlin, Heidelberg: Springer Berlin Heidelberg, 1999. http://dx.doi.org/10.1007/978-3-642-60087-6_34.
Texto completoMaragos, Petros, Patrick Gros, Athanassios Katsamanis y George Papandreou. "Cross-Modal Integration for Performance Improving in Multimedia: A Review". En Multimodal Processing and Interaction, 1–46. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3_1.
Texto completoFerecatu, Marin, Nozha Boujemaa y Michel Crucianu. "Interactive Image Retrieval Using a Hybrid Visual and Conceptual Content Representation". En Multimodal Processing and Interaction, 1–20. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3_10.
Texto completoNeumayer, Robert y Andreas Rauber. "Multimodal Analysis of Text and Audio Features for Music Information Retrieval". En Multimodal Processing and Interaction, 1–17. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3_11.
Texto completoPetrakis, Euripides G. M. "Intelligent Search for Image Information on the Web through Text and Link Structure Analysis". En Multimodal Processing and Interaction, 1–17. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3_12.
Texto completoPotamianos, Alexandros y Manolis Perakakis. "IDesign Principles for Multimodal Spoken Dialogue Systems". En Multimodal Processing and Interaction, 1–18. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76316-3_13.
Texto completoActas de conferencias sobre el tema "Multimodal processing"
Potamianos, Alexandros. "Cognitive Multimodal Processing". En the 2014 Workshop. New York, New York, USA: ACM Press, 2014. http://dx.doi.org/10.1145/2666253.2666264.
Texto completoJohnston, Michael. "Multimodal language processing". En 5th International Conference on Spoken Language Processing (ICSLP 1998). ISCA: ISCA, 1998. http://dx.doi.org/10.21437/icslp.1998-278.
Texto completoYang, Lixin, Genshe Chen, Ronghua Xu, Sherry Chen y Yu Chen. "Decentralized autonomous imaging data processing using blockchain". En Multimodal Biomedical Imaging XIV, editado por Fred S. Azar, Xavier Intes y Qianqian Fang. SPIE, 2019. http://dx.doi.org/10.1117/12.2513243.
Texto completoBarricelli, Barbara Rita, Marco Padula y Paolo Luigi Scala. "TMS for Multimodal Information Processing". En 2009 20th International Workshop on Database and Expert Systems Application. IEEE, 2009. http://dx.doi.org/10.1109/dexa.2009.34.
Texto completoZhu, Junnan, Haoran Li, Tianshang Liu, Yu Zhou, Jiajun Zhang y Chengqing Zong. "MSMO: Multimodal Summarization with Multimodal Output". En Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, 2018. http://dx.doi.org/10.18653/v1/d18-1448.
Texto completoDamian, Ionut, Michael Dietz, Frank Gaibler y Elisabeth André. "Social signal processing for dummies". En ICMI '16: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION. New York, NY, USA: ACM, 2016. http://dx.doi.org/10.1145/2993148.2998527.
Texto completoKokkinidis, K., A. Stergiaki y A. Tsagaris. "Machine learning via multimodal signal processing". En 2017 6th International Conference on Modern Circuits and Systems Technologies (MOCAST). IEEE, 2017. http://dx.doi.org/10.1109/mocast.2017.7937653.
Texto completoOviatt, Sharon. "Multimodal system processing in mobile environments". En the 13th annual ACM symposium. New York, New York, USA: ACM Press, 2000. http://dx.doi.org/10.1145/354401.354408.
Texto completoKharinov, Mikhail V. y Aleksandr N. Bykov. "Data Structure for Multimodal Signal Processing". En 2019 International Russian Automation Conference. IEEE, 2019. http://dx.doi.org/10.1109/rusautocon.2019.8867769.
Texto completoBangalore, Srinivas y Michael Johnston. "Robust gesture processing for multimodal interaction". En the 10th international conference. New York, New York, USA: ACM Press, 2008. http://dx.doi.org/10.1145/1452392.1452439.
Texto completoInformes sobre el tema "Multimodal processing"
Gazzaniga, Michael S. Multimodal Interactions in Sensory-Motor Processing. Fort Belvoir, VA: Defense Technical Information Center, junio de 1992. http://dx.doi.org/10.21236/ada255780.
Texto completoHughes, H. C., P. A. Reuter-Lorenz, R. Fendrich, G. Nozawa y M. S. Gazzaniga. Multimodal Interactions in Sensory-Motor Processing. Fort Belvoir, VA: Defense Technical Information Center, septiembre de 1990. http://dx.doi.org/10.21236/ada229111.
Texto completoLewandowski, Lawrence J., Susan B. Hursh y David A. Kobus. Multimodal versus Unimodal Information Processing of Words. Fort Belvoir, VA: Defense Technical Information Center, julio de 1985. http://dx.doi.org/10.21236/ada160517.
Texto completoVarshney, Pramod K. Multimodal Signal Processing for Personnel Detection and Activity Classification for Indoor Surveillance. Fort Belvoir, VA: Defense Technical Information Center, noviembre de 2013. http://dx.doi.org/10.21236/ada606602.
Texto completoHamlin, Alexandra, Erik Kobylarz, James Lever, Susan Taylor y Laura Ray. Assessing the feasibility of detecting epileptic seizures using non-cerebral sensor. Engineer Research and Development Center (U.S.), diciembre de 2021. http://dx.doi.org/10.21079/11681/42562.
Texto completoLee, W. S., Victor Alchanatis y Asher Levi. Innovative yield mapping system using hyperspectral and thermal imaging for precision tree crop management. United States Department of Agriculture, enero de 2014. http://dx.doi.org/10.32747/2014.7598158.bard.
Texto completoFederal Information Processing Standards Publication: detail specification for 62.5?m core diameter125-?m cladding diameter class IA multimode, graded-index optical waveguide fibers. Gaithersburg, MD: National Institute of Standards and Technology, 1989. http://dx.doi.org/10.6028/nist.fips.159.
Texto completoFederal Information Processing Standards Publication: detail specification for 62.5?m core diameter125-?m cladding diameter class IA multimode, graded-index optical waveguide fibers. Gaithersburg, MD: National Institute of Standards and Technology, 1990. http://dx.doi.org/10.6028/nist.fips.159-1990.
Texto completo