Artigos de revistas sobre o tema "Multimodal embedding space"
Crie uma referência precisa em APA, MLA, Chicago, Harvard, e outros estilos
Veja os 50 melhores artigos de revistas para estudos sobre o assunto "Multimodal embedding space".
Ao lado de cada fonte na lista de referências, há um botão "Adicionar à bibliografia". Clique e geraremos automaticamente a citação bibliográfica do trabalho escolhido no estilo de citação de que você precisa: APA, MLA, Harvard, Chicago, Vancouver, etc.
Você também pode baixar o texto completo da publicação científica em formato .pdf e ler o resumo do trabalho online se estiver presente nos metadados.
Veja os artigos de revistas das mais diversas áreas científicas e compile uma bibliografia correta.
Tyshchuk, Kirill, Polina Karpikova, Andrew Spiridonov, Anastasiia Prutianova, Anton Razzhigaev e Alexander Panchenko. "On Isotropy of Multimodal Embeddings". Information 14, n.º 7 (10 de julho de 2023): 392. http://dx.doi.org/10.3390/info14070392.
Texto completo da fonteMai, Sijie, Haifeng Hu e Songlong Xing. "Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 01 (3 de abril de 2020): 164–72. http://dx.doi.org/10.1609/aaai.v34i01.5347.
Texto completo da fonteZhang, Linhai, Deyu Zhou, Yulan He e Zeng Yang. "MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 16 (18 de maio de 2021): 14420–27. http://dx.doi.org/10.1609/aaai.v35i16.17695.
Texto completo da fonteGuo, Zhiqiang, Jianjun Li, Guohui Li, Chaoyang Wang, Si Shi e Bin Ruan. "LGMRec: Local and Global Graph Learning for Multimodal Recommendation". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 8 (24 de março de 2024): 8454–62. http://dx.doi.org/10.1609/aaai.v38i8.28688.
Texto completo da fonteMoon, Jucheol, Nhat Anh Le, Nelson Hebert Minaya e Sang-Il Choi. "Multimodal Few-Shot Learning for Gait Recognition". Applied Sciences 10, n.º 21 (29 de outubro de 2020): 7619. http://dx.doi.org/10.3390/app10217619.
Texto completo da fonteZhang, Rongchao, Yiwei Lou, Dexuan Xu, Yongzhi Cao, Hanpin Wang e Yu Huang. "A Learnable Discrete-Prior Fusion Autoencoder with Contrastive Learning for Tabular Data Synthesis". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 15 (24 de março de 2024): 16803–11. http://dx.doi.org/10.1609/aaai.v38i15.29621.
Texto completo da fonteMerkx, Danny, e Stefan L. Frank. "Learning semantic sentence representations from visually grounded language without lexical knowledge". Natural Language Engineering 25, n.º 4 (julho de 2019): 451–66. http://dx.doi.org/10.1017/s1351324919000196.
Texto completo da fonteFan, Yunpeng, Wenyou Du, Yingwei Zhang e Xiaogang Wang. "Fault Detection for Multimodal Process Using Quality-Relevant Kernel Neighborhood Preserving Embedding". Mathematical Problems in Engineering 2015 (2015): 1–15. http://dx.doi.org/10.1155/2015/210125.
Texto completo da fonteOta, Kosuke, Keiichiro Shirai, Hidetoshi Miyao e Minoru Maruyama. "Multimodal Analogy-Based Image Retrieval by Improving Semantic Embeddings". Journal of Advanced Computational Intelligence and Intelligent Informatics 26, n.º 6 (20 de novembro de 2022): 995–1003. http://dx.doi.org/10.20965/jaciii.2022.p0995.
Texto completo da fonteKim, Jongseok, Youngjae Yu, Hoeseong Kim e Gunhee Kim. "Dual Compositional Learning in Interactive Image Retrieval". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 2 (18 de maio de 2021): 1771–79. http://dx.doi.org/10.1609/aaai.v35i2.16271.
Texto completo da fonteAbiyev, Rahib H., Mohamad Ziad Altabel, Manal Darwish e Abdulkader Helwan. "A Multimodal Transformer Model for Recognition of Images from Complex Laparoscopic Surgical Videos". Diagnostics 14, n.º 7 (23 de março de 2024): 681. http://dx.doi.org/10.3390/diagnostics14070681.
Texto completo da fonteSkantze, Gabriel, e Bram Willemsen. "CoLLIE: Continual Learning of Language Grounding from Language-Image Embeddings". Journal of Artificial Intelligence Research 74 (9 de julho de 2022): 1201–23. http://dx.doi.org/10.1613/jair.1.13689.
Texto completo da fonteZhang, Linhao, Li Jin, Xian Sun, Guangluan Xu, Zequn Zhang, Xiaoyu Li, Nayu Liu, Qing Liu e Shiyao Yan. "TOT:Topology-Aware Optimal Transport for Multimodal Hate Detection". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 4 (26 de junho de 2023): 4884–92. http://dx.doi.org/10.1609/aaai.v37i4.25614.
Texto completo da fonteLiang, Meiyu, Junping Du, Zhengyang Liang, Yongwang Xing, Wei Huang e Zhe Xue. "Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 12 (24 de março de 2024): 13744–53. http://dx.doi.org/10.1609/aaai.v38i12.29280.
Texto completo da fonteZhang, Yachao, Runze Hu, Ronghui Li, Yanyun Qu, Yuan Xie e Xiu Li. "Cross-Modal Match for Language Conditioned 3D Object Grounding". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 7 (24 de março de 2024): 7359–67. http://dx.doi.org/10.1609/aaai.v38i7.28566.
Texto completo da fonteAkalya, Devi C., Renuka D. Karthika, T. Harisudhan, V. K. Jeevanantham, J. Jhanani e Varshini S. Kavi. "Text emotion recognition using fast text word embedding in bi-directional gated recurrent unit". i-manager's Journal on Information Technology 11, n.º 4 (2022): 1. http://dx.doi.org/10.26634/jit.11.4.19119.
Texto completo da fonteHnini, Ghizlane, Jamal Riffi, Mohamed Adnane Mahraz, Ali Yahyaouy e Hamid Tairi. "MMPC-RF: A Deep Multimodal Feature-Level Fusion Architecture for Hybrid Spam E-mail Detection". Applied Sciences 11, n.º 24 (16 de dezembro de 2021): 11968. http://dx.doi.org/10.3390/app112411968.
Texto completo da fonteWang, Kaijie, Tiejun Wang, Xiaoran Guo, Kui Xu e Jiao Wu. "Thangka Image—Text Matching Based on Adaptive Pooling Layer and Improved Transformer". Applied Sciences 14, n.º 2 (17 de janeiro de 2024): 807. http://dx.doi.org/10.3390/app14020807.
Texto completo da fonteMeo, Giuseppe, Pilar M. Ferraro, Marta Cillerai, Chiara Gemelli, Corrado Cabona, Federico Zaottini, Luca Roccatagliata, Flavio Villani, Angelo Schenone e Claudia Caponnetto. "MND Phenotypes Differentiation: The Role of Multimodal Characterization at the Time of Diagnosis". Life 12, n.º 10 (27 de setembro de 2022): 1506. http://dx.doi.org/10.3390/life12101506.
Texto completo da fonteBiswas, Rajarshi, Michael Barz e Daniel Sonntag. "Towards Explanatory Interactive Image Captioning Using Top-Down and Bottom-Up Features, Beam Search and Re-ranking". KI - Künstliche Intelligenz 34, n.º 4 (8 de julho de 2020): 571–84. http://dx.doi.org/10.1007/s13218-020-00679-2.
Texto completo da fonteBalabin, Helena, Charles Tapley Hoyt, Colin Birkenbihl, Benjamin M. Gyori, John Bachman, Alpha Tom Kodamullil, Paul G. Plöger, Martin Hofmann-Apitius e Daniel Domingo-Fernández. "STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs". Bioinformatics 38, n.º 6 (5 de janeiro de 2022): 1648–56. http://dx.doi.org/10.1093/bioinformatics/btac001.
Texto completo da fonteYuan, Xinpan, Xinxin Mao, Wei Xia, Zhiqi Zhang, Shaojun Xie e Chengyuan Zhang. "PTF-SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric". Complexity 2022 (16 de setembro de 2022): 1–14. http://dx.doi.org/10.1155/2022/2343707.
Texto completo da fonteTang, Zhenchao, Jiehui Huang, Guanxing Chen e Calvin Yu-Chian Chen. "Comprehensive View Embedding Learning for Single-Cell Multimodal Integration". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 14 (24 de março de 2024): 15292–300. http://dx.doi.org/10.1609/aaai.v38i14.29453.
Texto completo da fonteChen, Ziwei, Shaokun An, Xiangqi Bai, Fuzhou Gong, Liang Ma e Lin Wan. "DensityPath: an algorithm to visualize and reconstruct cell state-transition path on density landscape for single-cell RNA sequencing data". Bioinformatics 35, n.º 15 (7 de dezembro de 2018): 2593–601. http://dx.doi.org/10.1093/bioinformatics/bty1009.
Texto completo da fonteYin, Ziyi, Muchao Ye, Tianrong Zhang, Jiaqi Wang, Han Liu, Jinghui Chen, Ting Wang e Fenglong Ma. "VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 7 (24 de março de 2024): 6755–63. http://dx.doi.org/10.1609/aaai.v38i7.28499.
Texto completo da fonteLin, Kaiyi, Xing Xu, Lianli Gao, Zheng Wang e Heng Tao Shen. "Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 11515–22. http://dx.doi.org/10.1609/aaai.v34i07.6817.
Texto completo da fonteXu, Xing, Jialin Tian, Kaiyi Lin, Huimin Lu, Jie Shao e Heng Tao Shen. "Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network". ACM Transactions on Multimedia Computing, Communications, and Applications 17, n.º 1s (31 de março de 2021): 1–17. http://dx.doi.org/10.1145/3424341.
Texto completo da fonteVijaya Kamble. "Design of an Iterative Method for Enhanced Multimodal Time Series Analysis Using Graph Attention Networks, Variational Graph Autoencoders, and Transfer Learning". Journal of Electrical Systems 20, n.º 5s (13 de abril de 2024): 2579–98. http://dx.doi.org/10.52783/jes.2699.
Texto completo da fonteHan, Kezhen, Shaohang Lu, Zhengce Liu e Zipeng Wang. "Active Fault Isolation for Multimode Fault Systems Based on a Set Separation Indicator". Entropy 25, n.º 6 (30 de maio de 2023): 876. http://dx.doi.org/10.3390/e25060876.
Texto completo da fonteWeiner, Pascal, Caterina Neef, Yoshihisa Shibata, Yoshihiko Nakamura e Tamim Asfour. "An Embedded, Multi-Modal Sensor System for Scalable Robotic and Prosthetic Hand Fingers". Sensors 20, n.º 1 (23 de dezembro de 2019): 101. http://dx.doi.org/10.3390/s20010101.
Texto completo da fonteMyles, David, David Milne e Jonathan D. Shephard. "Scanned Mask Imaging Ablative DPSS UV Laser Process for 2μm L/S RDL". Additional Conferences (Device Packaging, HiTEC, HiTEN, and CICMT) 2015, DPC (1 de janeiro de 2015): 000554–89. http://dx.doi.org/10.4071/2015dpc-tp21.
Texto completo da fonteSuguitan, Michael, Nick DePalma, Guy Hoffman e Jessica Hodgins. "Face2Gesture: Translating Facial Expressions Into Robot Movements Through Shared Latent Space Neural Networks". ACM Transactions on Human-Robot Interaction, 4 de outubro de 2023. http://dx.doi.org/10.1145/3623386.
Texto completo da fonteWen, Jun, Xiang Zhang, Everett Rush, Vidul A. Panickan, Xingyu Li, Tianrun Cai, Doudou Zhou et al. "Multimodal representation learning for predicting molecule–disease relations". Bioinformatics 39, n.º 2 (1 de fevereiro de 2023). http://dx.doi.org/10.1093/bioinformatics/btad085.
Texto completo da fonteChang, Jun Qing, Deepu Rajan e Nicholas Vun. "Multimodal few-shot classification without attribute embedding". EURASIP Journal on Image and Video Processing 2024, n.º 1 (10 de janeiro de 2024). http://dx.doi.org/10.1186/s13640-024-00620-9.
Texto completo da fonteElhoseiny, Mohamed, Jingen Liu, Hui Cheng, Harpreet Sawhney e Ahmed Elgammal. "Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos". Proceedings of the AAAI Conference on Artificial Intelligence 30, n.º 1 (5 de março de 2016). http://dx.doi.org/10.1609/aaai.v30i1.10458.
Texto completo da fonteFeng, Duoduo, Xiangteng He e Yuxin Peng. "MKVSE: Multimodal Knowledge Enhanced Visual-Semantic Embedding for Image-Text Retrieval". ACM Transactions on Multimedia Computing, Communications, and Applications, 19 de janeiro de 2023. http://dx.doi.org/10.1145/3580501.
Texto completo da fonteRivas, Ryan, Sudipta Paul, Vagelis Hristidis, Evangelos E. Papalexakis e Amit K. Roy-Chowdhury. "Task-agnostic representation learning of multimodal twitter data for downstream applications". Journal of Big Data 9, n.º 1 (10 de fevereiro de 2022). http://dx.doi.org/10.1186/s40537-022-00570-x.
Texto completo da fonteDong, Shanshan, Tianzi Niu, Xin Luo, Wu Liu e Xin-Shun Xu. "Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning". ACM Transactions on Multimedia Computing, Communications, and Applications, 22 de julho de 2022. http://dx.doi.org/10.1145/3550276.
Texto completo da fonteChang, Jinho, e Jong Chul Ye. "Bidirectional generation of structure and properties through a single molecular foundation model". Nature Communications 15, n.º 1 (14 de março de 2024). http://dx.doi.org/10.1038/s41467-024-46440-3.
Texto completo da fonteGhodsizad, Talayeh, Hamid Behnam, Emad Fatemizadeh, Taraneh Faghihi Langroudi e Fariba Bayat. "Temporal Registration of Cardiac Multimodal Images Using Locally Linear Embedding Algorithm". Frontiers in Biomedical Technologies, 15 de novembro de 2021. http://dx.doi.org/10.18502/fbt.v8i4.7757.
Texto completo da fonteIkegawa, Yuya, Ryohei Fukuma, Hidenori Sugano, Satoru Oshino, Naoki Tani, Kentaro Tamura, Yasushi Iimura et al. "Text and image generation from intracranial electroencephalography using an embedding space for text and images". Journal of Neural Engineering, 22 de abril de 2024. http://dx.doi.org/10.1088/1741-2552/ad417a.
Texto completo da fonteHu, Yue, Ghalia Rehawi, Lambert Moyon, Nathalie Gerstner, Christoph Ogris, Janine Knauer-Arloth, Florian Bittner, Annalisa Marsico e Nikola S. Mueller. "Network Embedding Across Multiple Tissues and Data Modalities Elucidates the Context of Host Factors Important for COVID-19 Infection". Frontiers in Genetics 13 (8 de julho de 2022). http://dx.doi.org/10.3389/fgene.2022.909714.
Texto completo da fonteAxås, Joar, e George Haller. "Model reduction for nonlinearizable dynamics via delay-embedded spectral submanifolds". Nonlinear Dynamics, 16 de julho de 2023. http://dx.doi.org/10.1007/s11071-023-08705-2.
Texto completo da fonteDeng, Li. "Deep learning: from speech recognition to language and multimodal processing". APSIPA Transactions on Signal and Information Processing 5 (2016). http://dx.doi.org/10.1017/atsip.2015.22.
Texto completo da fonteQayyum, Abdul, Imran Razzak, M. Tanveer e Moona Mazher. "Spontaneous Facial Behavior Analysis using Deep Transformer Based Framework for Child–Computer Interaction". ACM Transactions on Multimedia Computing, Communications, and Applications, 26 de maio de 2022. http://dx.doi.org/10.1145/3539577.
Texto completo da fonteZhang, Qing, Jing Zhang, Xiangdong Su, Feilong Bao e Guanglai Gao. "Contour detection network for zero-shot sketch-based image retrieval". Complex & Intelligent Systems, 2 de junho de 2023. http://dx.doi.org/10.1007/s40747-023-01096-2.
Texto completo da fonteShickel, Benjamin, Brandon Silva, Tezcan Ozrazgat-Baslanti, Yuanfang Ren, Kia Khezeli, Ziyuan Guan, Patrick J. Tighe, Azra Bihorac e Parisa Rashidi. "Multi-dimensional patient acuity estimation with longitudinal EHR tokenization and flexible transformer networks". Frontiers in Digital Health 4 (9 de novembro de 2022). http://dx.doi.org/10.3389/fdgth.2022.1029191.
Texto completo da fonteGkikas, Stefanos, Nikolaos S. Tachos, Stelios Andreadis, Vasileios C. Pezoulas, Dimitrios Zaridis, George Gkois, Anastasia Matonaki, Thanos G. Stavropoulos e Dimitrios I. Fotiadis. "Multimodal automatic assessment of acute pain through facial videos and heart rate signals utilizing transformer-based architectures". Frontiers in Pain Research 5 (27 de março de 2024). http://dx.doi.org/10.3389/fpain.2024.1372814.
Texto completo da fonteDu, Jin-Hong, Zhanrui Cai e Kathryn Roeder. "Robust probabilistic modeling for single-cell multimodal mosaic integration and imputation via scVAEIT". Proceedings of the National Academy of Sciences 119, n.º 49 (2 de dezembro de 2022). http://dx.doi.org/10.1073/pnas.2214414119.
Texto completo da fonteLu, Shanghui, Yong Liang, Le Li, Shuilin Liao, Yongfu Zou, Chengjun Yang e Dong Ouyang. "Inferring circRNA-drug sensitivity associations via dual hierarchical attention networks and multiple kernel fusion". BMC Genomics 24, n.º 1 (21 de dezembro de 2023). http://dx.doi.org/10.1186/s12864-023-09899-w.
Texto completo da fonte