Literatura académica sobre el tema "Image Captioning (IC)"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "Image Captioning (IC)".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "Image Captioning (IC)"
Li, Jingyu, Zhendong Mao, Hao Li, Weidong Chen y Yongdong Zhang. "Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning". ACM Transactions on Multimedia Computing, Communications, and Applications, 25 de diciembre de 2023. http://dx.doi.org/10.1145/3638558.
Texto completoYu, Mengying y Aixin Sun. "Dataset versus reality: Understanding model performance from the perspective of information need". Journal of the Association for Information Science and Technology, 18 de agosto de 2023. http://dx.doi.org/10.1002/asi.24825.
Texto completoTesis sobre el tema "Image Captioning (IC)"
Elguendouze, Sofiane. "Explainable Artificial Intelligence approaches for Image Captioning". Electronic Thesis or Diss., Orléans, 2024. http://www.theses.fr/2024ORLE1003.
Texto completoThe rapid advancement of image captioning models, driven by the integration of deep learning techniques that combine image and text modalities, has resulted in increasingly complex systems. However, these models often operate as black boxes, lacking the ability to provide transparent explanations for their decisions. This thesis addresses the explainability of image captioning systems based on Encoder-Attention-Decoder architectures, through four aspects. First, it explores the concept of the latent space, marking a departure from traditional approaches relying on the original representation space. Second, it introduces the notion of decisiveness, leading to the formulation of a new definition for the concept of component influence/decisiveness in the context of explainable image captioning, as well as a perturbation-based approach to capturing decisiveness. The third aspect aims to elucidate the factors influencing explanation quality, in particular the scope of explanation methods. Accordingly, latent-based variants of well-established explanation methods such as LRP and LIME have been developed, along with the introduction of a latent-centered evaluation approach called Latent Ablation. The fourth aspect of this work involves investigating what we call saliency and the representation of certain visual concepts, such as object quantity, at different levels of the captioning architecture
Actas de conferencias sobre el tema "Image Captioning (IC)"
Guo, Qilin, Yajing Xu y Sheng Gao. "Recorrect Net: Visual Guidance for Image Captioning". En 2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC). IEEE, 2021. http://dx.doi.org/10.1109/ic-nidc54101.2021.9660494.
Texto completoLi, Jingyu, Zhendong Mao, Shancheng Fang y Hao Li. "ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning". En Thirty-First International Joint Conference on Artificial Intelligence {IJCAI-22}. California: International Joint Conferences on Artificial Intelligence Organization, 2022. http://dx.doi.org/10.24963/ijcai.2022/151.
Texto completo