Artigos de revistas sobre o tema "Video question answering"
Crie uma referência precisa em APA, MLA, Chicago, Harvard, e outros estilos
Veja os 50 melhores artigos de revistas para estudos sobre o assunto "Video question answering".
Ao lado de cada fonte na lista de referências, há um botão "Adicionar à bibliografia". Clique e geraremos automaticamente a citação bibliográfica do trabalho escolhido no estilo de citação de que você precisa: APA, MLA, Harvard, Chicago, Vancouver, etc.
Você também pode baixar o texto completo da publicação científica em formato .pdf e ler o resumo do trabalho online se estiver presente nos metadados.
Veja os artigos de revistas das mais diversas áreas científicas e compile uma bibliografia correta.
Lei, Chenyi, Lei Wu, Dong Liu, Zhao Li, Guoxin Wang, Haihong Tang e Houqiang Li. "Multi-Question Learning for Visual Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 11328–35. http://dx.doi.org/10.1609/aaai.v34i07.6794.
Texto completo da fonteRuwa, Nelson, Qirong Mao, Liangjun Wang e Jianping Gou. "Affective question answering on video". Neurocomputing 363 (outubro de 2019): 125–39. http://dx.doi.org/10.1016/j.neucom.2019.06.046.
Texto completo da fonteWang, Yueqian, Yuxuan Wang, Kai Chen e Dongyan Zhao. "STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 17 (24 de março de 2024): 19215–23. http://dx.doi.org/10.1609/aaai.v38i17.29890.
Texto completo da fonteZong, Linlin, Jiahui Wan, Xianchao Zhang, Xinyue Liu, Wenxin Liang e Bo Xu. "Video-Context Aligned Transformer for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 17 (24 de março de 2024): 19795–803. http://dx.doi.org/10.1609/aaai.v38i17.29954.
Texto completo da fonteHuang, Deng, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan e Chuang Gan. "Location-Aware Graph Convolutional Networks for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 11021–28. http://dx.doi.org/10.1609/aaai.v34i07.6737.
Texto completo da fonteGao, Lianli, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei e Heng Tao Shen. "Structured Two-Stream Attention Network for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julho de 2019): 6391–98. http://dx.doi.org/10.1609/aaai.v33i01.33016391.
Texto completo da fonteKumar, Krishnamoorthi Magesh, e P. Valarmathie. "Domain and Intelligence Based Multimedia Question Answering System". International Journal of Evaluation and Research in Education (IJERE) 5, n.º 3 (1 de setembro de 2016): 227. http://dx.doi.org/10.11591/ijere.v5i3.4544.
Texto completo da fonteXue, Hongyang, Zhou Zhao e Deng Cai. "Unifying the Video and Question Attentions for Open-Ended Video Question Answering". IEEE Transactions on Image Processing 26, n.º 12 (dezembro de 2017): 5656–66. http://dx.doi.org/10.1109/tip.2017.2746267.
Texto completo da fonteJang, Yunseok, Yale Song, Chris Dongjoo Kim, Youngjae Yu, Youngjin Kim e Gunhee Kim. "Video Question Answering with Spatio-Temporal Reasoning". International Journal of Computer Vision 127, n.º 10 (18 de junho de 2019): 1385–412. http://dx.doi.org/10.1007/s11263-019-01189-x.
Texto completo da fonteZhuang, Yueting, Dejing Xu, Xin Yan, Wenzhuo Cheng, Zhou Zhao, Shiliang Pu e Jun Xiao. "Multichannel Attention Refinement for Video Question Answering". ACM Transactions on Multimedia Computing, Communications, and Applications 16, n.º 1s (28 de abril de 2020): 1–23. http://dx.doi.org/10.1145/3366710.
Texto completo da fonteGarcia, Noa, Mayu Otani, Chenhui Chu e Yuta Nakashima. "KnowIT VQA: Answering Knowledge-Based Questions about Videos". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 10826–34. http://dx.doi.org/10.1609/aaai.v34i07.6713.
Texto completo da fonteMao, Jianguo, Wenbin Jiang, Hong Liu, Xiangdong Wang e Yajuan Lyu. "Inferential Knowledge-Enhanced Integrated Reasoning for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 11 (26 de junho de 2023): 13380–88. http://dx.doi.org/10.1609/aaai.v37i11.26570.
Texto completo da fonteJiang, Jianwen, Ziqiang Chen, Haojie Lin, Xibin Zhao e Yue Gao. "Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 11101–8. http://dx.doi.org/10.1609/aaai.v34i07.6766.
Texto completo da fonteYang, Saelyne, Sunghyun Park, Yunseok Jang e Moontae Lee. "YTCommentQA: Video Question Answerability in Instructional Videos". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 17 (24 de março de 2024): 19359–67. http://dx.doi.org/10.1609/aaai.v38i17.29906.
Texto completo da fonteYu, Zhou, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang e Dacheng Tao. "ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julho de 2019): 9127–34. http://dx.doi.org/10.1609/aaai.v33i01.33019127.
Texto completo da fonteCherian, Anoop, Chiori Hori, Tim K. Marks e Jonathan Le Roux. "(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 1 (28 de junho de 2022): 444–53. http://dx.doi.org/10.1609/aaai.v36i1.19922.
Texto completo da fonteChu, Wenqing, Hongyang Xue, Zhou Zhao, Deng Cai e Chengwei Yao. "The forgettable-watcher model for video question answering". Neurocomputing 314 (novembro de 2018): 386–93. http://dx.doi.org/10.1016/j.neucom.2018.06.069.
Texto completo da fonteZhu, Linchao, Zhongwen Xu, Yi Yang e Alexander G. Hauptmann. "Uncovering the Temporal Context for Video Question Answering". International Journal of Computer Vision 124, n.º 3 (13 de julho de 2017): 409–21. http://dx.doi.org/10.1007/s11263-017-1033-7.
Texto completo da fonteLee, Yue-Shi, Yu-Chieh Wu e Jie-Chi Yang. "BVideoQA: Online English/Chinese bilingual video question answering". Journal of the American Society for Information Science and Technology 60, n.º 3 (março de 2009): 509–25. http://dx.doi.org/10.1002/asi.21002.
Texto completo da fonteXiao, Junbin, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji e Tat-Seng Chua. "Video as Conditional Graph Hierarchy for Multi-Granular Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 3 (28 de junho de 2022): 2804–12. http://dx.doi.org/10.1609/aaai.v36i3.20184.
Texto completo da fonteLee, Kyungjae, Nan Duan, Lei Ji, Jason Li e Seung-won Hwang. "Segment-Then-Rank: Non-Factoid Question Answering on Instructional Videos". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 05 (3 de abril de 2020): 8147–54. http://dx.doi.org/10.1609/aaai.v34i05.6327.
Texto completo da fonteGao, Feng, Yuanyuan Ge e Yongge Liu. "Remember and forget: video and text fusion for video question answering". Multimedia Tools and Applications 77, n.º 22 (27 de março de 2018): 29269–82. http://dx.doi.org/10.1007/s11042-018-5868-x.
Texto completo da fonteLi, Zhangbin, Dan Guo, Jinxing Zhou, Jing Zhang e Meng Wang. "Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 4 (24 de março de 2024): 3306–14. http://dx.doi.org/10.1609/aaai.v38i4.28116.
Texto completo da fonteShao, Zhuang, Jiahui Wan e Linlin Zong. "A Video Question Answering Model Based on Knowledge Distillation". Information 14, n.º 6 (12 de junho de 2023): 328. http://dx.doi.org/10.3390/info14060328.
Texto completo da fonteLi, Xiangpeng, Jingkuan Song, Lianli Gao, Xianglong Liu, Wenbing Huang, Xiangnan He e Chuang Gan. "Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julho de 2019): 8658–65. http://dx.doi.org/10.1609/aaai.v33i01.33018658.
Texto completo da fonteJiang, Pin, e Yahong Han. "Reasoning with Heterogeneous Graph Alignment for Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 07 (3 de abril de 2020): 11109–16. http://dx.doi.org/10.1609/aaai.v34i07.6767.
Texto completo da fonteGu, Mao, Zhou Zhao, Weike Jin, Richang Hong e Fei Wu. "Graph-Based Multi-Interaction Network for Video Question Answering". IEEE Transactions on Image Processing 30 (2021): 2758–70. http://dx.doi.org/10.1109/tip.2021.3051756.
Texto completo da fonteYu-Chieh Wu e Jie-Chi Yang. "A Robust Passage Retrieval Algorithm for Video Question Answering". IEEE Transactions on Circuits and Systems for Video Technology 18, n.º 10 (outubro de 2008): 1411–21. http://dx.doi.org/10.1109/tcsvt.2008.2002831.
Texto completo da fonteWang, Weining, Yan Huang e Liang Wang. "Long video question answering: A Matching-guided Attention Model". Pattern Recognition 102 (junho de 2020): 107248. http://dx.doi.org/10.1016/j.patcog.2020.107248.
Texto completo da fonteYe, Yunan, Shifeng Zhang, Yimeng Li, Xufeng Qian, Siliang Tang, Shiliang Pu e Jun Xiao. "Video question answering via grounded cross-attention network learning". Information Processing & Management 57, n.º 4 (julho de 2020): 102265. http://dx.doi.org/10.1016/j.ipm.2020.102265.
Texto completo da fonteZhang, Wenqiao, Siliang Tang, Yanpeng Cao, Shiliang Pu, Fei Wu e Yueting Zhuang. "Frame Augmented Alternating Attention Network for Video Question Answering". IEEE Transactions on Multimedia 22, n.º 4 (abril de 2020): 1032–41. http://dx.doi.org/10.1109/tmm.2019.2935678.
Texto completo da fonteZha, Zheng-Jun, Jiawei Liu, Tianhao Yang e Yongdong Zhang. "Spatiotemporal-Textual Co-Attention Network for Video Question Answering". ACM Transactions on Multimedia Computing, Communications, and Applications 15, n.º 2s (12 de agosto de 2019): 1–18. http://dx.doi.org/10.1145/3320061.
Texto completo da fonteJiang, Yimin, Tingfei Yan, Mingze Yao, Huibing Wang e Wenzhe Liu. "Cascade transformers with dynamic attention for video question answering". Computer Vision and Image Understanding 242 (maio de 2024): 103983. http://dx.doi.org/10.1016/j.cviu.2024.103983.
Texto completo da fonteJiao, Guie. "Realization of Video Question Answering System Based on Flash under RIA". Applied Mechanics and Materials 411-414 (setembro de 2013): 970–73. http://dx.doi.org/10.4028/www.scientific.net/amm.411-414.970.
Texto completo da fonteLiu, Mingyang, Ruomei Wang, Fan Zhou e Ge Lin. "Temporally Multi-Modal Semantic Reasoning with Spatial Language Constraints for Video Question Answering". Symmetry 14, n.º 6 (31 de maio de 2022): 1133. http://dx.doi.org/10.3390/sym14061133.
Texto completo da fonteJin, Yao, Guocheng Niu, Xinyan Xiao, Jian Zhang, Xi Peng e Jun Yu. "Knowledge-Constrained Answer Generation for Open-Ended Video Question Answering". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 7 (26 de junho de 2023): 8141–49. http://dx.doi.org/10.1609/aaai.v37i7.25983.
Texto completo da fontePatel, Hardik Bhikhabhai, e Sailesh Suryanarayan Iyer. "Comparative Study of Multimedia Question Answering System Models". ECS Transactions 107, n.º 1 (24 de abril de 2022): 2033–42. http://dx.doi.org/10.1149/10701.2033ecst.
Texto completo da fonteGao, Lianli, Yu Lei, Pengpeng Zeng, Jingkuan Song, Meng Wang e Heng Tao Shen. "Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question Answering". IEEE Transactions on Image Processing 31 (2022): 202–15. http://dx.doi.org/10.1109/tip.2021.3120867.
Texto completo da fonteYang, Zekun, Noa Garcia, Chenhui Chu, Mayu Otani, Yuta Nakashima e Haruo Takemura. "A comparative study of language transformers for video question answering". Neurocomputing 445 (julho de 2021): 121–33. http://dx.doi.org/10.1016/j.neucom.2021.02.092.
Texto completo da fonteZhao, Zhou, Zhu Zhang, Shuwen Xiao, Zhenxin Xiao, Xiaohui Yan, Jun Yu, Deng Cai e Fei Wu. "Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks". IEEE Transactions on Image Processing 28, n.º 12 (dezembro de 2019): 5939–52. http://dx.doi.org/10.1109/tip.2019.2922062.
Texto completo da fonteYin, Chengxiang, Jian Tang, Zhiyuan Xu e Yanzhi Wang. "Memory Augmented Deep Recurrent Neural Network for Video Question Answering". IEEE Transactions on Neural Networks and Learning Systems 31, n.º 9 (setembro de 2020): 3159–67. http://dx.doi.org/10.1109/tnnls.2019.2938015.
Texto completo da fonteWang, Zheng, Fangtao Li, Kaoru Ota, Mianxiong Dong e Bin Wu. "ReGR: Relation-aware graph reasoning framework for video question answering". Information Processing & Management 60, n.º 4 (julho de 2023): 103375. http://dx.doi.org/10.1016/j.ipm.2023.103375.
Texto completo da fonteAl Mehmadi, Shima M., Yakoub Bazi, Mohamad M. Al Rahhal e Mansour Zuair. "Learning to enhance areal video captioning with visual question answering". International Journal of Remote Sensing 45, n.º 18 (30 de agosto de 2024): 6395–407. http://dx.doi.org/10.1080/01431161.2024.2388875.
Texto completo da fonteZhuang, Xuqiang, Fang’ai Liu, Jian Hou, Jianhua Hao e Xiaohong Cai. "Modality attention fusion model with hybrid multi-head self-attention for video understanding". PLOS ONE 17, n.º 10 (6 de outubro de 2022): e0275156. http://dx.doi.org/10.1371/journal.pone.0275156.
Texto completo da fontePeng, Min, Chongyang Wang, Yu Shi e Xiang-Dong Zhou. "Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 2 (26 de junho de 2023): 2038–46. http://dx.doi.org/10.1609/aaai.v37i2.25296.
Texto completo da fonteKim, Seonhoon, Seohyeong Jeong, Eunbyul Kim, Inho Kang e Nojun Kwak. "Self-supervised Pre-training and Contrastive Representation Learning for Multiple-choice Video QA". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 14 (18 de maio de 2021): 13171–79. http://dx.doi.org/10.1609/aaai.v35i14.17556.
Texto completo da fontePark, Gyu-Min, A.-Yeong Kim e Seong-Bae Park. "Confident Multiple Choice Learning-based Ensemble Model for Video Question-Answering". Journal of KIISE 49, n.º 4 (30 de abril de 2022): 284–90. http://dx.doi.org/10.5626/jok.2022.49.4.284.
Texto completo da fonteLiu, Yun, Xiaoming Zhang, Feiran Huang, Bo Zhang e Zhoujun Li. "Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering". IEEE Transactions on Image Processing 31 (2022): 1684–96. http://dx.doi.org/10.1109/tip.2022.3142526.
Texto completo da fonteZhao, Zhou, Zhu Zhang, Xinghua Jiang e Deng Cai. "Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks". IEEE Transactions on Image Processing 28, n.º 8 (agosto de 2019): 3860–72. http://dx.doi.org/10.1109/tip.2019.2902106.
Texto completo da fonteYu, Ting, Jun Yu, Zhou Yu e Dacheng Tao. "Compositional Attention Networks With Two-Stream Fusion for Video Question Answering". IEEE Transactions on Image Processing 29 (2020): 1204–18. http://dx.doi.org/10.1109/tip.2019.2940677.
Texto completo da fonte