Добірка наукової літератури з теми "Image Captioning (IC)"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся зі списками актуальних статей, книг, дисертацій, тез та інших наукових джерел на тему "Image Captioning (IC)".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Статті в журналах з теми "Image Captioning (IC)":
Li, Jingyu, Zhendong Mao, Hao Li, Weidong Chen, and Yongdong Zhang. "Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning." ACM Transactions on Multimedia Computing, Communications, and Applications, December 25, 2023. http://dx.doi.org/10.1145/3638558.
Yu, Mengying, and Aixin Sun. "Dataset versus reality: Understanding model performance from the perspective of information need." Journal of the Association for Information Science and Technology, August 18, 2023. http://dx.doi.org/10.1002/asi.24825.
Дисертації з теми "Image Captioning (IC)":
Elguendouze, Sofiane. "Explainable Artificial Intelligence approaches for Image Captioning." Electronic Thesis or Diss., Orléans, 2024. http://www.theses.fr/2024ORLE1003.
The rapid advancement of image captioning models, driven by the integration of deep learning techniques that combine image and text modalities, has resulted in increasingly complex systems. However, these models often operate as black boxes, lacking the ability to provide transparent explanations for their decisions. This thesis addresses the explainability of image captioning systems based on Encoder-Attention-Decoder architectures, through four aspects. First, it explores the concept of the latent space, marking a departure from traditional approaches relying on the original representation space. Second, it introduces the notion of decisiveness, leading to the formulation of a new definition for the concept of component influence/decisiveness in the context of explainable image captioning, as well as a perturbation-based approach to capturing decisiveness. The third aspect aims to elucidate the factors influencing explanation quality, in particular the scope of explanation methods. Accordingly, latent-based variants of well-established explanation methods such as LRP and LIME have been developed, along with the introduction of a latent-centered evaluation approach called Latent Ablation. The fourth aspect of this work involves investigating what we call saliency and the representation of certain visual concepts, such as object quantity, at different levels of the captioning architecture
Тези доповідей конференцій з теми "Image Captioning (IC)":
Guo, Qilin, Yajing Xu, and Sheng Gao. "Recorrect Net: Visual Guidance for Image Captioning." In 2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC). IEEE, 2021. http://dx.doi.org/10.1109/ic-nidc54101.2021.9660494.
Li, Jingyu, Zhendong Mao, Shancheng Fang, and Hao Li. "ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning." In Thirty-First International Joint Conference on Artificial Intelligence {IJCAI-22}. California: International Joint Conferences on Artificial Intelligence Organization, 2022. http://dx.doi.org/10.24963/ijcai.2022/151.