Journal articles on the topic 'Video question answering'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Video question answering.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Lei, Chenyi, Lei Wu, Dong Liu, Zhao Li, Guoxin Wang, Haihong Tang, and Houqiang Li. "Multi-Question Learning for Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11328–35. http://dx.doi.org/10.1609/aaai.v34i07.6794.
Full textRuwa, Nelson, Qirong Mao, Liangjun Wang, and Jianping Gou. "Affective question answering on video." Neurocomputing 363 (October 2019): 125–39. http://dx.doi.org/10.1016/j.neucom.2019.06.046.
Full textWang, Yueqian, Yuxuan Wang, Kai Chen, and Dongyan Zhao. "STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 17 (March 24, 2024): 19215–23. http://dx.doi.org/10.1609/aaai.v38i17.29890.
Full textZong, Linlin, Jiahui Wan, Xianchao Zhang, Xinyue Liu, Wenxin Liang, and Bo Xu. "Video-Context Aligned Transformer for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 17 (March 24, 2024): 19795–803. http://dx.doi.org/10.1609/aaai.v38i17.29954.
Full textHuang, Deng, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, and Chuang Gan. "Location-Aware Graph Convolutional Networks for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11021–28. http://dx.doi.org/10.1609/aaai.v34i07.6737.
Full textGao, Lianli, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, and Heng Tao Shen. "Structured Two-Stream Attention Network for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 6391–98. http://dx.doi.org/10.1609/aaai.v33i01.33016391.
Full textKumar, Krishnamoorthi Magesh, and P. Valarmathie. "Domain and Intelligence Based Multimedia Question Answering System." International Journal of Evaluation and Research in Education (IJERE) 5, no. 3 (September 1, 2016): 227. http://dx.doi.org/10.11591/ijere.v5i3.4544.
Full textXue, Hongyang, Zhou Zhao, and Deng Cai. "Unifying the Video and Question Attentions for Open-Ended Video Question Answering." IEEE Transactions on Image Processing 26, no. 12 (December 2017): 5656–66. http://dx.doi.org/10.1109/tip.2017.2746267.
Full textJang, Yunseok, Yale Song, Chris Dongjoo Kim, Youngjae Yu, Youngjin Kim, and Gunhee Kim. "Video Question Answering with Spatio-Temporal Reasoning." International Journal of Computer Vision 127, no. 10 (June 18, 2019): 1385–412. http://dx.doi.org/10.1007/s11263-019-01189-x.
Full textZhuang, Yueting, Dejing Xu, Xin Yan, Wenzhuo Cheng, Zhou Zhao, Shiliang Pu, and Jun Xiao. "Multichannel Attention Refinement for Video Question Answering." ACM Transactions on Multimedia Computing, Communications, and Applications 16, no. 1s (April 28, 2020): 1–23. http://dx.doi.org/10.1145/3366710.
Full textGarcia, Noa, Mayu Otani, Chenhui Chu, and Yuta Nakashima. "KnowIT VQA: Answering Knowledge-Based Questions about Videos." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 10826–34. http://dx.doi.org/10.1609/aaai.v34i07.6713.
Full textMao, Jianguo, Wenbin Jiang, Hong Liu, Xiangdong Wang, and Yajuan Lyu. "Inferential Knowledge-Enhanced Integrated Reasoning for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 11 (June 26, 2023): 13380–88. http://dx.doi.org/10.1609/aaai.v37i11.26570.
Full textJiang, Jianwen, Ziqiang Chen, Haojie Lin, Xibin Zhao, and Yue Gao. "Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11101–8. http://dx.doi.org/10.1609/aaai.v34i07.6766.
Full textYang, Saelyne, Sunghyun Park, Yunseok Jang, and Moontae Lee. "YTCommentQA: Video Question Answerability in Instructional Videos." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 17 (March 24, 2024): 19359–67. http://dx.doi.org/10.1609/aaai.v38i17.29906.
Full textYu, Zhou, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang, and Dacheng Tao. "ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 9127–34. http://dx.doi.org/10.1609/aaai.v33i01.33019127.
Full textCherian, Anoop, Chiori Hori, Tim K. Marks, and Jonathan Le Roux. "(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 1 (June 28, 2022): 444–53. http://dx.doi.org/10.1609/aaai.v36i1.19922.
Full textChu, Wenqing, Hongyang Xue, Zhou Zhao, Deng Cai, and Chengwei Yao. "The forgettable-watcher model for video question answering." Neurocomputing 314 (November 2018): 386–93. http://dx.doi.org/10.1016/j.neucom.2018.06.069.
Full textZhu, Linchao, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. "Uncovering the Temporal Context for Video Question Answering." International Journal of Computer Vision 124, no. 3 (July 13, 2017): 409–21. http://dx.doi.org/10.1007/s11263-017-1033-7.
Full textLee, Yue-Shi, Yu-Chieh Wu, and Jie-Chi Yang. "BVideoQA: Online English/Chinese bilingual video question answering." Journal of the American Society for Information Science and Technology 60, no. 3 (March 2009): 509–25. http://dx.doi.org/10.1002/asi.21002.
Full textXiao, Junbin, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji, and Tat-Seng Chua. "Video as Conditional Graph Hierarchy for Multi-Granular Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 3 (June 28, 2022): 2804–12. http://dx.doi.org/10.1609/aaai.v36i3.20184.
Full textLee, Kyungjae, Nan Duan, Lei Ji, Jason Li, and Seung-won Hwang. "Segment-Then-Rank: Non-Factoid Question Answering on Instructional Videos." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 8147–54. http://dx.doi.org/10.1609/aaai.v34i05.6327.
Full textGao, Feng, Yuanyuan Ge, and Yongge Liu. "Remember and forget: video and text fusion for video question answering." Multimedia Tools and Applications 77, no. 22 (March 27, 2018): 29269–82. http://dx.doi.org/10.1007/s11042-018-5868-x.
Full textLi, Zhangbin, Dan Guo, Jinxing Zhou, Jing Zhang, and Meng Wang. "Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 4 (March 24, 2024): 3306–14. http://dx.doi.org/10.1609/aaai.v38i4.28116.
Full textShao, Zhuang, Jiahui Wan, and Linlin Zong. "A Video Question Answering Model Based on Knowledge Distillation." Information 14, no. 6 (June 12, 2023): 328. http://dx.doi.org/10.3390/info14060328.
Full textLi, Xiangpeng, Jingkuan Song, Lianli Gao, Xianglong Liu, Wenbing Huang, Xiangnan He, and Chuang Gan. "Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 8658–65. http://dx.doi.org/10.1609/aaai.v33i01.33018658.
Full textJiang, Pin, and Yahong Han. "Reasoning with Heterogeneous Graph Alignment for Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 11109–16. http://dx.doi.org/10.1609/aaai.v34i07.6767.
Full textGu, Mao, Zhou Zhao, Weike Jin, Richang Hong, and Fei Wu. "Graph-Based Multi-Interaction Network for Video Question Answering." IEEE Transactions on Image Processing 30 (2021): 2758–70. http://dx.doi.org/10.1109/tip.2021.3051756.
Full textYu-Chieh Wu and Jie-Chi Yang. "A Robust Passage Retrieval Algorithm for Video Question Answering." IEEE Transactions on Circuits and Systems for Video Technology 18, no. 10 (October 2008): 1411–21. http://dx.doi.org/10.1109/tcsvt.2008.2002831.
Full textWang, Weining, Yan Huang, and Liang Wang. "Long video question answering: A Matching-guided Attention Model." Pattern Recognition 102 (June 2020): 107248. http://dx.doi.org/10.1016/j.patcog.2020.107248.
Full textYe, Yunan, Shifeng Zhang, Yimeng Li, Xufeng Qian, Siliang Tang, Shiliang Pu, and Jun Xiao. "Video question answering via grounded cross-attention network learning." Information Processing & Management 57, no. 4 (July 2020): 102265. http://dx.doi.org/10.1016/j.ipm.2020.102265.
Full textZhang, Wenqiao, Siliang Tang, Yanpeng Cao, Shiliang Pu, Fei Wu, and Yueting Zhuang. "Frame Augmented Alternating Attention Network for Video Question Answering." IEEE Transactions on Multimedia 22, no. 4 (April 2020): 1032–41. http://dx.doi.org/10.1109/tmm.2019.2935678.
Full textZha, Zheng-Jun, Jiawei Liu, Tianhao Yang, and Yongdong Zhang. "Spatiotemporal-Textual Co-Attention Network for Video Question Answering." ACM Transactions on Multimedia Computing, Communications, and Applications 15, no. 2s (August 12, 2019): 1–18. http://dx.doi.org/10.1145/3320061.
Full textJiang, Yimin, Tingfei Yan, Mingze Yao, Huibing Wang, and Wenzhe Liu. "Cascade transformers with dynamic attention for video question answering." Computer Vision and Image Understanding 242 (May 2024): 103983. http://dx.doi.org/10.1016/j.cviu.2024.103983.
Full textJiao, Guie. "Realization of Video Question Answering System Based on Flash under RIA." Applied Mechanics and Materials 411-414 (September 2013): 970–73. http://dx.doi.org/10.4028/www.scientific.net/amm.411-414.970.
Full textLiu, Mingyang, Ruomei Wang, Fan Zhou, and Ge Lin. "Temporally Multi-Modal Semantic Reasoning with Spatial Language Constraints for Video Question Answering." Symmetry 14, no. 6 (May 31, 2022): 1133. http://dx.doi.org/10.3390/sym14061133.
Full textJin, Yao, Guocheng Niu, Xinyan Xiao, Jian Zhang, Xi Peng, and Jun Yu. "Knowledge-Constrained Answer Generation for Open-Ended Video Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 7 (June 26, 2023): 8141–49. http://dx.doi.org/10.1609/aaai.v37i7.25983.
Full textPatel, Hardik Bhikhabhai, and Sailesh Suryanarayan Iyer. "Comparative Study of Multimedia Question Answering System Models." ECS Transactions 107, no. 1 (April 24, 2022): 2033–42. http://dx.doi.org/10.1149/10701.2033ecst.
Full textGao, Lianli, Yu Lei, Pengpeng Zeng, Jingkuan Song, Meng Wang, and Heng Tao Shen. "Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question Answering." IEEE Transactions on Image Processing 31 (2022): 202–15. http://dx.doi.org/10.1109/tip.2021.3120867.
Full textYang, Zekun, Noa Garcia, Chenhui Chu, Mayu Otani, Yuta Nakashima, and Haruo Takemura. "A comparative study of language transformers for video question answering." Neurocomputing 445 (July 2021): 121–33. http://dx.doi.org/10.1016/j.neucom.2021.02.092.
Full textZhao, Zhou, Zhu Zhang, Shuwen Xiao, Zhenxin Xiao, Xiaohui Yan, Jun Yu, Deng Cai, and Fei Wu. "Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks." IEEE Transactions on Image Processing 28, no. 12 (December 2019): 5939–52. http://dx.doi.org/10.1109/tip.2019.2922062.
Full textYin, Chengxiang, Jian Tang, Zhiyuan Xu, and Yanzhi Wang. "Memory Augmented Deep Recurrent Neural Network for Video Question Answering." IEEE Transactions on Neural Networks and Learning Systems 31, no. 9 (September 2020): 3159–67. http://dx.doi.org/10.1109/tnnls.2019.2938015.
Full textWang, Zheng, Fangtao Li, Kaoru Ota, Mianxiong Dong, and Bin Wu. "ReGR: Relation-aware graph reasoning framework for video question answering." Information Processing & Management 60, no. 4 (July 2023): 103375. http://dx.doi.org/10.1016/j.ipm.2023.103375.
Full textAl Mehmadi, Shima M., Yakoub Bazi, Mohamad M. Al Rahhal, and Mansour Zuair. "Learning to enhance areal video captioning with visual question answering." International Journal of Remote Sensing 45, no. 18 (August 30, 2024): 6395–407. http://dx.doi.org/10.1080/01431161.2024.2388875.
Full textZhuang, Xuqiang, Fang’ai Liu, Jian Hou, Jianhua Hao, and Xiaohong Cai. "Modality attention fusion model with hybrid multi-head self-attention for video understanding." PLOS ONE 17, no. 10 (October 6, 2022): e0275156. http://dx.doi.org/10.1371/journal.pone.0275156.
Full textPeng, Min, Chongyang Wang, Yu Shi, and Xiang-Dong Zhou. "Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 2 (June 26, 2023): 2038–46. http://dx.doi.org/10.1609/aaai.v37i2.25296.
Full textKim, Seonhoon, Seohyeong Jeong, Eunbyul Kim, Inho Kang, and Nojun Kwak. "Self-supervised Pre-training and Contrastive Representation Learning for Multiple-choice Video QA." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 14 (May 18, 2021): 13171–79. http://dx.doi.org/10.1609/aaai.v35i14.17556.
Full textPark, Gyu-Min, A.-Yeong Kim, and Seong-Bae Park. "Confident Multiple Choice Learning-based Ensemble Model for Video Question-Answering." Journal of KIISE 49, no. 4 (April 30, 2022): 284–90. http://dx.doi.org/10.5626/jok.2022.49.4.284.
Full textLiu, Yun, Xiaoming Zhang, Feiran Huang, Bo Zhang, and Zhoujun Li. "Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering." IEEE Transactions on Image Processing 31 (2022): 1684–96. http://dx.doi.org/10.1109/tip.2022.3142526.
Full textZhao, Zhou, Zhu Zhang, Xinghua Jiang, and Deng Cai. "Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks." IEEE Transactions on Image Processing 28, no. 8 (August 2019): 3860–72. http://dx.doi.org/10.1109/tip.2019.2902106.
Full textYu, Ting, Jun Yu, Zhou Yu, and Dacheng Tao. "Compositional Attention Networks With Two-Stream Fusion for Video Question Answering." IEEE Transactions on Image Processing 29 (2020): 1204–18. http://dx.doi.org/10.1109/tip.2019.2940677.
Full text