Letteratura scientifica selezionata sul tema "Image Captioning (IC)"
Cita una fonte nei formati APA, MLA, Chicago, Harvard e in molti altri stili
Consulta la lista di attuali articoli, libri, tesi, atti di convegni e altre fonti scientifiche attinenti al tema "Image Captioning (IC)".
Accanto a ogni fonte nell'elenco di riferimenti c'è un pulsante "Aggiungi alla bibliografia". Premilo e genereremo automaticamente la citazione bibliografica dell'opera scelta nello stile citazionale di cui hai bisogno: APA, MLA, Harvard, Chicago, Vancouver ecc.
Puoi anche scaricare il testo completo della pubblicazione scientifica nel formato .pdf e leggere online l'abstract (il sommario) dell'opera se è presente nei metadati.
Articoli di riviste sul tema "Image Captioning (IC)"
Li, Jingyu, Zhendong Mao, Hao Li, Weidong Chen e Yongdong Zhang. "Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning". ACM Transactions on Multimedia Computing, Communications, and Applications, 25 dicembre 2023. http://dx.doi.org/10.1145/3638558.
Testo completoYu, Mengying, e Aixin Sun. "Dataset versus reality: Understanding model performance from the perspective of information need". Journal of the Association for Information Science and Technology, 18 agosto 2023. http://dx.doi.org/10.1002/asi.24825.
Testo completoTesi sul tema "Image Captioning (IC)"
Elguendouze, Sofiane. "Explainable Artificial Intelligence approaches for Image Captioning". Electronic Thesis or Diss., Orléans, 2024. http://www.theses.fr/2024ORLE1003.
Testo completoThe rapid advancement of image captioning models, driven by the integration of deep learning techniques that combine image and text modalities, has resulted in increasingly complex systems. However, these models often operate as black boxes, lacking the ability to provide transparent explanations for their decisions. This thesis addresses the explainability of image captioning systems based on Encoder-Attention-Decoder architectures, through four aspects. First, it explores the concept of the latent space, marking a departure from traditional approaches relying on the original representation space. Second, it introduces the notion of decisiveness, leading to the formulation of a new definition for the concept of component influence/decisiveness in the context of explainable image captioning, as well as a perturbation-based approach to capturing decisiveness. The third aspect aims to elucidate the factors influencing explanation quality, in particular the scope of explanation methods. Accordingly, latent-based variants of well-established explanation methods such as LRP and LIME have been developed, along with the introduction of a latent-centered evaluation approach called Latent Ablation. The fourth aspect of this work involves investigating what we call saliency and the representation of certain visual concepts, such as object quantity, at different levels of the captioning architecture
Atti di convegni sul tema "Image Captioning (IC)"
Guo, Qilin, Yajing Xu e Sheng Gao. "Recorrect Net: Visual Guidance for Image Captioning". In 2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC). IEEE, 2021. http://dx.doi.org/10.1109/ic-nidc54101.2021.9660494.
Testo completoLi, Jingyu, Zhendong Mao, Shancheng Fang e Hao Li. "ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning". In Thirty-First International Joint Conference on Artificial Intelligence {IJCAI-22}. California: International Joint Conferences on Artificial Intelligence Organization, 2022. http://dx.doi.org/10.24963/ijcai.2022/151.
Testo completo