Journal articles on the topic 'Multimodal embedding space'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Multimodal embedding space.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Tyshchuk, Kirill, Polina Karpikova, Andrew Spiridonov, Anastasiia Prutianova, Anton Razzhigaev, and Alexander Panchenko. "On Isotropy of Multimodal Embeddings." Information 14, no. 7 (July 10, 2023): 392. http://dx.doi.org/10.3390/info14070392.
Full textMai, Sijie, Haifeng Hu, and Songlong Xing. "Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 01 (April 3, 2020): 164–72. http://dx.doi.org/10.1609/aaai.v34i01.5347.
Full textZhang, Linhai, Deyu Zhou, Yulan He, and Zeng Yang. "MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 16 (May 18, 2021): 14420–27. http://dx.doi.org/10.1609/aaai.v35i16.17695.
Full textGuo, Zhiqiang, Jianjun Li, Guohui Li, Chaoyang Wang, Si Shi, and Bin Ruan. "LGMRec: Local and Global Graph Learning for Multimodal Recommendation." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 8 (March 24, 2024): 8454–62. http://dx.doi.org/10.1609/aaai.v38i8.28688.
Full textMoon, Jucheol, Nhat Anh Le, Nelson Hebert Minaya, and Sang-Il Choi. "Multimodal Few-Shot Learning for Gait Recognition." Applied Sciences 10, no. 21 (October 29, 2020): 7619. http://dx.doi.org/10.3390/app10217619.
Full textZhang, Rongchao, Yiwei Lou, Dexuan Xu, Yongzhi Cao, Hanpin Wang, and Yu Huang. "A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 15 (March 24, 2024): 16803–11. http://dx.doi.org/10.1609/aaai.v38i15.29621.
Full textMerkx, Danny, and Stefan L. Frank. "Learning semantic sentence representations from visually grounded language without lexical knowledge." Natural Language Engineering 25, no. 4 (July 2019): 451–66. http://dx.doi.org/10.1017/s1351324919000196.
Full textFan, Yunpeng, Wenyou Du, Yingwei Zhang, and Xiaogang Wang. "Fault Detection for Multimodal Process Using Quality-Relevant Kernel Neighborhood Preserving Embedding." Mathematical Problems in Engineering 2015 (2015): 1–15. http://dx.doi.org/10.1155/2015/210125.
Full textOta, Kosuke, Keiichiro Shirai, Hidetoshi Miyao, and Minoru Maruyama. "Multimodal Analogy-Based Image Retrieval by Improving Semantic Embeddings." Journal of Advanced Computational Intelligence and Intelligent Informatics 26, no. 6 (November 20, 2022): 995–1003. http://dx.doi.org/10.20965/jaciii.2022.p0995.
Full textKim, Jongseok, Youngjae Yu, Hoeseong Kim, and Gunhee Kim. "Dual Compositional Learning in Interactive Image Retrieval." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 2 (May 18, 2021): 1771–79. http://dx.doi.org/10.1609/aaai.v35i2.16271.
Full textAbiyev, Rahib H., Mohamad Ziad Altabel, Manal Darwish, and Abdulkader Helwan. "A Multimodal Transformer Model for Recognition of Images from Complex Laparoscopic Surgical Videos." Diagnostics 14, no. 7 (March 23, 2024): 681. http://dx.doi.org/10.3390/diagnostics14070681.
Full textSkantze, Gabriel, and Bram Willemsen. "CoLLIE: Continual Learning of Language Grounding from Language-Image Embeddings." Journal of Artificial Intelligence Research 74 (July 9, 2022): 1201–23. http://dx.doi.org/10.1613/jair.1.13689.
Full textZhang, Linhao, Li Jin, Xian Sun, Guangluan Xu, Zequn Zhang, Xiaoyu Li, Nayu Liu, Qing Liu, and Shiyao Yan. "TOT:Topology-Aware Optimal Transport for Multimodal Hate Detection." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 4 (June 26, 2023): 4884–92. http://dx.doi.org/10.1609/aaai.v37i4.25614.
Full textLiang, Meiyu, Junping Du, Zhengyang Liang, Yongwang Xing, Wei Huang, and Zhe Xue. "Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 12 (March 24, 2024): 13744–53. http://dx.doi.org/10.1609/aaai.v38i12.29280.
Full textZhang, Yachao, Runze Hu, Ronghui Li, Yanyun Qu, Yuan Xie, and Xiu Li. "Cross-Modal Match for Language Conditioned 3D Object Grounding." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 7 (March 24, 2024): 7359–67. http://dx.doi.org/10.1609/aaai.v38i7.28566.
Full textAkalya, Devi C., Renuka D. Karthika, T. Harisudhan, V. K. Jeevanantham, J. Jhanani, and Varshini S. Kavi. "Text emotion recognition using fast text word embedding in bi-directional gated recurrent unit." i-manager's Journal on Information Technology 11, no. 4 (2022): 1. http://dx.doi.org/10.26634/jit.11.4.19119.
Full textHnini, Ghizlane, Jamal Riffi, Mohamed Adnane Mahraz, Ali Yahyaouy, and Hamid Tairi. "MMPC-RF: A Deep Multimodal Feature-Level Fusion Architecture for Hybrid Spam E-mail Detection." Applied Sciences 11, no. 24 (December 16, 2021): 11968. http://dx.doi.org/10.3390/app112411968.
Full textWang, Kaijie, Tiejun Wang, Xiaoran Guo, Kui Xu, and Jiao Wu. "Thangka Image—Text Matching Based on Adaptive Pooling Layer and Improved Transformer." Applied Sciences 14, no. 2 (January 17, 2024): 807. http://dx.doi.org/10.3390/app14020807.
Full textMeo, Giuseppe, Pilar M. Ferraro, Marta Cillerai, Chiara Gemelli, Corrado Cabona, Federico Zaottini, Luca Roccatagliata, Flavio Villani, Angelo Schenone, and Claudia Caponnetto. "MND Phenotypes Differentiation: The Role of Multimodal Characterization at the Time of Diagnosis." Life 12, no. 10 (September 27, 2022): 1506. http://dx.doi.org/10.3390/life12101506.
Full textBiswas, Rajarshi, Michael Barz, and Daniel Sonntag. "Towards Explanatory Interactive Image Captioning Using Top-Down and Bottom-Up Features, Beam Search and Re-ranking." KI - Künstliche Intelligenz 34, no. 4 (July 8, 2020): 571–84. http://dx.doi.org/10.1007/s13218-020-00679-2.
Full textBalabin, Helena, Charles Tapley Hoyt, Colin Birkenbihl, Benjamin M. Gyori, John Bachman, Alpha Tom Kodamullil, Paul G. Plöger, Martin Hofmann-Apitius, and Daniel Domingo-Fernández. "STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs." Bioinformatics 38, no. 6 (January 5, 2022): 1648–56. http://dx.doi.org/10.1093/bioinformatics/btac001.
Full textYuan, Xinpan, Xinxin Mao, Wei Xia, Zhiqi Zhang, Shaojun Xie, and Chengyuan Zhang. "PTF-SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric." Complexity 2022 (September 16, 2022): 1–14. http://dx.doi.org/10.1155/2022/2343707.
Full textTang, Zhenchao, Jiehui Huang, Guanxing Chen, and Calvin Yu-Chian Chen. "Comprehensive View Embedding Learning for Single-Cell Multimodal Integration." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 14 (March 24, 2024): 15292–300. http://dx.doi.org/10.1609/aaai.v38i14.29453.
Full textChen, Ziwei, Shaokun An, Xiangqi Bai, Fuzhou Gong, Liang Ma, and Lin Wan. "DensityPath: an algorithm to visualize and reconstruct cell state-transition path on density landscape for single-cell RNA sequencing data." Bioinformatics 35, no. 15 (December 7, 2018): 2593–601. http://dx.doi.org/10.1093/bioinformatics/bty1009.
Full textYin, Ziyi, Muchao Ye, Tianrong Zhang, Jiaqi Wang, Han Liu, Jinghui Chen, Ting Wang, and Fenglong Ma. "VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 7 (March 24, 2024): 6755–63. http://dx.doi.org/10.1609/aaai.v38i7.28499.
Full textLin, Kaiyi, Xing Xu, Lianli Gao, Zheng Wang, and Heng Tao Shen. "Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11515–22. http://dx.doi.org/10.1609/aaai.v34i07.6817.
Full textXu, Xing, Jialin Tian, Kaiyi Lin, Huimin Lu, Jie Shao, and Heng Tao Shen. "Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network." ACM Transactions on Multimedia Computing, Communications, and Applications 17, no. 1s (March 31, 2021): 1–17. http://dx.doi.org/10.1145/3424341.
Full textVijaya Kamble. "Design of an Iterative Method for Enhanced Multimodal Time Series Analysis Using Graph Attention Networks, Variational Graph Autoencoders, and Transfer Learning." Journal of Electrical Systems 20, no. 5s (April 13, 2024): 2579–98. http://dx.doi.org/10.52783/jes.2699.
Full textHan, Kezhen, Shaohang Lu, Zhengce Liu, and Zipeng Wang. "Active Fault Isolation for Multimode Fault Systems Based on a Set Separation Indicator." Entropy 25, no. 6 (May 30, 2023): 876. http://dx.doi.org/10.3390/e25060876.
Full textWeiner, Pascal, Caterina Neef, Yoshihisa Shibata, Yoshihiko Nakamura, and Tamim Asfour. "An Embedded, Multi-Modal Sensor System for Scalable Robotic and Prosthetic Hand Fingers." Sensors 20, no. 1 (December 23, 2019): 101. http://dx.doi.org/10.3390/s20010101.
Full textMyles, David, David Milne, and Jonathan D. Shephard. "Scanned Mask Imaging Ablative DPSS UV Laser Process for 2μm L/S RDL." Additional Conferences (Device Packaging, HiTEC, HiTEN, and CICMT) 2015, DPC (January 1, 2015): 000554–89. http://dx.doi.org/10.4071/2015dpc-tp21.
Full textSuguitan, Michael, Nick DePalma, Guy Hoffman, and Jessica Hodgins. "Face2Gesture: Translating Facial Expressions Into Robot Movements Through Shared Latent Space Neural Networks." ACM Transactions on Human-Robot Interaction, October 4, 2023. http://dx.doi.org/10.1145/3623386.
Full textWen, Jun, Xiang Zhang, Everett Rush, Vidul A. Panickan, Xingyu Li, Tianrun Cai, Doudou Zhou, et al. "Multimodal representation learning for predicting molecule–disease relations." Bioinformatics 39, no. 2 (February 1, 2023). http://dx.doi.org/10.1093/bioinformatics/btad085.
Full textChang, Jun Qing, Deepu Rajan, and Nicholas Vun. "Multimodal few-shot classification without attribute embedding." EURASIP Journal on Image and Video Processing 2024, no. 1 (January 10, 2024). http://dx.doi.org/10.1186/s13640-024-00620-9.
Full textElhoseiny, Mohamed, Jingen Liu, Hui Cheng, Harpreet Sawhney, and Ahmed Elgammal. "Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos." Proceedings of the AAAI Conference on Artificial Intelligence 30, no. 1 (March 5, 2016). http://dx.doi.org/10.1609/aaai.v30i1.10458.
Full textFeng, Duoduo, Xiangteng He, and Yuxin Peng. "MKVSE: Multimodal Knowledge Enhanced Visual-Semantic Embedding for Image-Text Retrieval." ACM Transactions on Multimedia Computing, Communications, and Applications, January 19, 2023. http://dx.doi.org/10.1145/3580501.
Full textRivas, Ryan, Sudipta Paul, Vagelis Hristidis, Evangelos E. Papalexakis, and Amit K. Roy-Chowdhury. "Task-agnostic representation learning of multimodal twitter data for downstream applications." Journal of Big Data 9, no. 1 (February 10, 2022). http://dx.doi.org/10.1186/s40537-022-00570-x.
Full textDong, Shanshan, Tianzi Niu, Xin Luo, Wu Liu, and Xin-Shun Xu. "Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning." ACM Transactions on Multimedia Computing, Communications, and Applications, July 22, 2022. http://dx.doi.org/10.1145/3550276.
Full textChang, Jinho, and Jong Chul Ye. "Bidirectional generation of structure and properties through a single molecular foundation model." Nature Communications 15, no. 1 (March 14, 2024). http://dx.doi.org/10.1038/s41467-024-46440-3.
Full textGhodsizad, Talayeh, Hamid Behnam, Emad Fatemizadeh, Taraneh Faghihi Langroudi, and Fariba Bayat. "Temporal Registration of Cardiac Multimodal Images Using Locally Linear Embedding Algorithm." Frontiers in Biomedical Technologies, November 15, 2021. http://dx.doi.org/10.18502/fbt.v8i4.7757.
Full textIkegawa, Yuya, Ryohei Fukuma, Hidenori Sugano, Satoru Oshino, Naoki Tani, Kentaro Tamura, Yasushi Iimura, et al. "Text and image generation from intracranial electroencephalography using an embedding space for text and images." Journal of Neural Engineering, April 22, 2024. http://dx.doi.org/10.1088/1741-2552/ad417a.
Full textHu, Yue, Ghalia Rehawi, Lambert Moyon, Nathalie Gerstner, Christoph Ogris, Janine Knauer-Arloth, Florian Bittner, Annalisa Marsico, and Nikola S. Mueller. "Network Embedding Across Multiple Tissues and Data Modalities Elucidates the Context of Host Factors Important for COVID-19 Infection." Frontiers in Genetics 13 (July 8, 2022). http://dx.doi.org/10.3389/fgene.2022.909714.
Full textAxås, Joar, and George Haller. "Model reduction for nonlinearizable dynamics via delay-embedded spectral submanifolds." Nonlinear Dynamics, July 16, 2023. http://dx.doi.org/10.1007/s11071-023-08705-2.
Full textDeng, Li. "Deep learning: from speech recognition to language and multimodal processing." APSIPA Transactions on Signal and Information Processing 5 (2016). http://dx.doi.org/10.1017/atsip.2015.22.
Full textQayyum, Abdul, Imran Razzak, M. Tanveer, and Moona Mazher. "Spontaneous Facial Behavior Analysis using Deep Transformer Based Framework for Child–Computer Interaction." ACM Transactions on Multimedia Computing, Communications, and Applications, May 26, 2022. http://dx.doi.org/10.1145/3539577.
Full textZhang, Qing, Jing Zhang, Xiangdong Su, Feilong Bao, and Guanglai Gao. "Contour detection network for zero-shot sketch-based image retrieval." Complex & Intelligent Systems, June 2, 2023. http://dx.doi.org/10.1007/s40747-023-01096-2.
Full textShickel, Benjamin, Brandon Silva, Tezcan Ozrazgat-Baslanti, Yuanfang Ren, Kia Khezeli, Ziyuan Guan, Patrick J. Tighe, Azra Bihorac, and Parisa Rashidi. "Multi-dimensional patient acuity estimation with longitudinal EHR tokenization and flexible transformer networks." Frontiers in Digital Health 4 (November 9, 2022). http://dx.doi.org/10.3389/fdgth.2022.1029191.
Full textGkikas, Stefanos, Nikolaos S. Tachos, Stelios Andreadis, Vasileios C. Pezoulas, Dimitrios Zaridis, George Gkois, Anastasia Matonaki, Thanos G. Stavropoulos, and Dimitrios I. Fotiadis. "Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures." Frontiers in Pain Research 5 (March 27, 2024). http://dx.doi.org/10.3389/fpain.2024.1372814.
Full textDu, Jin-Hong, Zhanrui Cai, and Kathryn Roeder. "Robust probabilistic modeling for single-cell multimodal mosaic integration and imputation via scVAEIT." Proceedings of the National Academy of Sciences 119, no. 49 (December 2, 2022). http://dx.doi.org/10.1073/pnas.2214414119.
Full textLu, Shanghui, Yong Liang, Le Li, Shuilin Liao, Yongfu Zou, Chengjun Yang, and Dong Ouyang. "Inferring circRNA-drug sensitivity associations via dual hierarchical attention networks and multiple kernel fusion." BMC Genomics 24, no. 1 (December 21, 2023). http://dx.doi.org/10.1186/s12864-023-09899-w.
Full text