Articoli di riviste sul tema "Video Vision Transformer"
Cita una fonte nei formati APA, MLA, Chicago, Harvard e in molti altri stili
Vedi i top-50 articoli di riviste per l'attività di ricerca sul tema "Video Vision Transformer".
Accanto a ogni fonte nell'elenco di riferimenti c'è un pulsante "Aggiungi alla bibliografia". Premilo e genereremo automaticamente la citazione bibliografica dell'opera scelta nello stile citazionale di cui hai bisogno: APA, MLA, Harvard, Chicago, Vancouver ecc.
Puoi anche scaricare il testo completo della pubblicazione scientifica nel formato .pdf e leggere online l'abstract (il sommario) dell'opera se è presente nei metadati.
Vedi gli articoli di riviste di molte aree scientifiche e compila una bibliografia corretta.
Naikwadi, Sanket Shashikant. "Video Summarization Using Vision and Language Transformer Models". International Journal of Research Publication and Reviews 6, n. 6 (gennaio 2025): 5217–21. https://doi.org/10.55248/gengpi.6.0125.0654.
Testo completoMoutik, Oumaima, Hiba Sekkat, Smail Tigani, Abdellah Chehri, Rachid Saadane, Taha Ait Tchakoucht e Anand Paul. "Convolutional Neural Networks or Vision Transformers: Who Will Win the Race for Action Recognitions in Visual Data?" Sensors 23, n. 2 (9 gennaio 2023): 734. http://dx.doi.org/10.3390/s23020734.
Testo completoYuan, Hongchun, Zhenyu Cai, Hui Zhou, Yue Wang e Xiangzhi Chen. "TransAnomaly: Video Anomaly Detection Using Video Vision Transformer". IEEE Access 9 (2021): 123977–86. http://dx.doi.org/10.1109/access.2021.3109102.
Testo completoSarraf, Saman, e Milton Kabia. "Optimal Topology of Vision Transformer for Real-Time Video Action Recognition in an End-To-End Cloud Solution". Machine Learning and Knowledge Extraction 5, n. 4 (29 settembre 2023): 1320–39. http://dx.doi.org/10.3390/make5040067.
Testo completoZhao, Hong, Zhiwen Chen, Lan Guo e Zeyu Han. "Video captioning based on vision transformer and reinforcement learning". PeerJ Computer Science 8 (16 marzo 2022): e916. http://dx.doi.org/10.7717/peerj-cs.916.
Testo completoIm, Heeju, e Yong Suk Choi. "A Full Transformer Video Captioning Model via Vision Transformer". KIISE Transactions on Computing Practices 29, n. 8 (31 agosto 2023): 378–83. http://dx.doi.org/10.5626/ktcp.2023.29.8.378.
Testo completoUgile, Tukaram, e Dr Nilesh Uke. "TRANSFORMER ARCHITECTURES FOR COMPUTER VISION: A COMPREHENSIVE REVIEW AND FUTURE RESEARCH DIRECTIONS". Journal of Dynamics and Control 9, n. 3 (15 marzo 2025): 70–79. https://doi.org/10.71058/jodac.v9i3005.
Testo completoWu, Pengfei, Le Wang, Sanping Zhou, Gang Hua e Changyin Sun. "Temporal Correlation Vision Transformer for Video Person Re-Identification". Proceedings of the AAAI Conference on Artificial Intelligence 38, n. 6 (24 marzo 2024): 6083–91. http://dx.doi.org/10.1609/aaai.v38i6.28424.
Testo completoJin, Yanxiu, e Rulin Ma. "Applications of transformers in computer vision". Applied and Computational Engineering 16, n. 1 (23 ottobre 2023): 234–41. http://dx.doi.org/10.54254/2755-2721/16/20230898.
Testo completoPei, Pengfei, Xianfeng Zhao, Jinchuan Li, Yun Cao e Xuyuan Lai. "Vision Transformer-Based Video Hashing Retrieval for Tracing the Source of Fake Videos". Security and Communication Networks 2023 (28 giugno 2023): 1–16. http://dx.doi.org/10.1155/2023/5349392.
Testo completoWang, Hao, Wenjia Zhang e Guohua Liu. "TSNet: Token Sparsification for Efficient Video Transformer". Applied Sciences 13, n. 19 (24 settembre 2023): 10633. http://dx.doi.org/10.3390/app131910633.
Testo completoKim, Dahyun, e Myung Hwan Na. "Rice yield prediction and self-attention visualization using Video Vision Transformer". Korean Data Analysis Society 25, n. 4 (31 agosto 2023): 1249–59. http://dx.doi.org/10.37727/jkdas.2023.25.4.1249.
Testo completoLee, Jaewoo, Sungjun Lee, Wonki Cho, Zahid Ali Siddiqui e Unsang Park. "Vision Transformer-Based Tailing Detection in Videos". Applied Sciences 11, n. 24 (7 dicembre 2021): 11591. http://dx.doi.org/10.3390/app112411591.
Testo completoAbdlrazg, Bassma A. Awad, Sumaia Masoud e Mnal M. Ali. "Human Action Detection Using A hybrid Architecture of CNN and Transformer". International Science and Technology Journal 34, n. 1 (25 gennaio 2024): 1–15. http://dx.doi.org/10.62341/bsmh2119.
Testo completoLi, Xue, Huibo Zhou e Ming Zhao. "Transformer-based cascade networks with spatial and channel reconstruction convolution for deepfake detection". Mathematical Biosciences and Engineering 21, n. 3 (2024): 4142–64. http://dx.doi.org/10.3934/mbe.2024183.
Testo completoZhou, Siyuan, Chunru Zhan, Biao Wang, Tiezheng Ge, Yuning Jiang e Li Niu. "Video Object of Interest Segmentation". Proceedings of the AAAI Conference on Artificial Intelligence 37, n. 3 (26 giugno 2023): 3805–13. http://dx.doi.org/10.1609/aaai.v37i3.25493.
Testo completoHuo, Hua, e Bingjie Li. "MgMViT: Multi-Granularity and Multi-Scale Vision Transformer for Efficient Action Recognition". Electronics 13, n. 5 (29 febbraio 2024): 948. http://dx.doi.org/10.3390/electronics13050948.
Testo completoKumar, Pavan. "Revolutionizing Deepfake Detection and Realtime Video Vision with CNN-based Deep Learning Model". International Journal of Innovative Research in Information Security 10, n. 04 (8 maggio 2024): 173–77. http://dx.doi.org/10.26562/ijiris.2024.v1004.10.
Testo completoReddy, Sai Krishna. "Advancements in Video Deblurring: A Comprehensive Review". INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, n. 05 (7 maggio 2024): 1–5. http://dx.doi.org/10.55041/ijsrem32759.
Testo completoIm, Heeju, e Yong-Suk Choi. "UAT: Universal Attention Transformer for Video Captioning". Sensors 22, n. 13 (25 giugno 2022): 4817. http://dx.doi.org/10.3390/s22134817.
Testo completoYamazaki, Kashu, Khoa Vo, Quang Sang Truong, Bhiksha Raj e Ngan Le. "VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning". Proceedings of the AAAI Conference on Artificial Intelligence 37, n. 3 (26 giugno 2023): 3081–90. http://dx.doi.org/10.1609/aaai.v37i3.25412.
Testo completoChoksi, Sarah, Sanjeev Narasimhan, Mattia Ballo, Mehmet Turkcan, Yiran Hu, Chengbo Zang, Alex Farrell et al. "Automatic assessment of robotic suturing utilizing computer vision in a dry-lab simulation". Artificial Intelligence Surgery 5, n. 2 (1 aprile 2025): 160–9. https://doi.org/10.20517/ais.2024.84.
Testo completoNarsina, Deekshith, Nicholas Richardson, Arjun Kamisetty, Jaya Chandra Srikanth Gummadi e Krishna Devarapu. "Neural Network Architectures for Real-Time Image and Video Processing Applications". Engineering International 10, n. 2 (2022): 131–44. https://doi.org/10.18034/ei.v10i2.735.
Testo completoHan, Xiao, Yongbin Wang, Shouxun Liu e Cong Jin. "Online Multiplayer Tracking by Extracting Temporal Contexts with Transformer". Wireless Communications and Mobile Computing 2022 (11 ottobre 2022): 1–10. http://dx.doi.org/10.1155/2022/6177973.
Testo completoZhang, Fan, Jiawei Tian, Jianhao Wang, Guanyou Liu e Ying Liu. "ECViST: Mine Intelligent Monitoring Based on Edge Computing and Vision Swin Transformer-YOLOv5". Energies 15, n. 23 (29 novembre 2022): 9015. http://dx.doi.org/10.3390/en15239015.
Testo completoMardani, Konstantina, Nicholas Vretos e Petros Daras. "Transformer-Based Fire Detection in Videos". Sensors 23, n. 6 (11 marzo 2023): 3035. http://dx.doi.org/10.3390/s23063035.
Testo completoPeng, Pengfei, Guoqing Liang e Tao Luan. "Multi-View Inconsistency Analysis for Video Object-Level Splicing Localization". International Journal of Emerging Technologies and Advanced Applications 1, n. 3 (24 aprile 2024): 1–5. http://dx.doi.org/10.62677/ijetaa.2403111.
Testo completoWang, Jing, e ZongJu Yang. "Transformer-Guided Video Inpainting Algorithm Based on Local Spatial-Temporal joint". EAI Endorsed Transactions on e-Learning 8, n. 4 (15 agosto 2023): e2. http://dx.doi.org/10.4108/eetel.3156.
Testo completoLe, Viet-Tuan, Kiet Tran-Trung e Vinh Truong Hoang. "A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition". Computational Intelligence and Neuroscience 2022 (20 aprile 2022): 1–17. http://dx.doi.org/10.1155/2022/8323962.
Testo completoHong, Jiuk, Chaehyeon Lee e Heechul Jung. "Late Fusion-Based Video Transformer for Facial Micro-Expression Recognition". Applied Sciences 12, n. 3 (23 gennaio 2022): 1169. http://dx.doi.org/10.3390/app12031169.
Testo completoD, Mrs Srivalli, e Divya Sri V. "Video Inpainting with Local and Global Refinement". INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, n. 03 (17 marzo 2024): 1–5. http://dx.doi.org/10.55041/ijsrem29385.
Testo completoHabeb, Mohamed H., May Salama e Lamiaa A. Elrefaei. "Enhancing Video Anomaly Detection Using a Transformer Spatiotemporal Attention Unsupervised Framework for Large Datasets". Algorithms 17, n. 7 (1 luglio 2024): 286. http://dx.doi.org/10.3390/a17070286.
Testo completoUsmani, Shaheen, Sunil Kumar e Debanjan Sadhya. "Spatio-temporal knowledge distilled video vision transformer (STKD-VViT) for multimodal deepfake detection". Neurocomputing 620 (marzo 2025): 129256. https://doi.org/10.1016/j.neucom.2024.129256.
Testo completoKumar, Yulia, Kuan Huang, Chin-Chien Lin, Annaliese Watson, J. Jenny Li, Patricia Morreale e Justin Delgado. "Applying Swin Architecture to Diverse Sign Language Datasets". Electronics 13, n. 8 (16 aprile 2024): 1509. http://dx.doi.org/10.3390/electronics13081509.
Testo completoLi, Yixiao, Lixiang Li, Zirui Zhuang, Yuan Fang, Haipeng Peng e Nam Ling. "Transformer-Based Data-Driven Video Coding Acceleration for Industrial Applications". Mathematical Problems in Engineering 2022 (27 settembre 2022): 1–11. http://dx.doi.org/10.1155/2022/1440323.
Testo completoNikulina, Olena, Valerii Severyn, Oleksii Kondratov e Oleksii Olhovoy. "MODELS OF REMOTE IDENTIFICATION OF PARAMETERS OF DYNAMIC OBJECTS USING DETECTION TRANSFORMERS AND OPTICAL FLOW". Bulletin of National Technical University "KhPI". Series: System Analysis, Control and Information Technologies, n. 1 (11) (30 luglio 2024): 52–57. http://dx.doi.org/10.20998/2079-0023.2024.01.08.
Testo completoEl Moaqet, Hisham, Rami Janini, Tamer Abdulbaki Alshirbaji, Nour Aldeen Jalal e Knut Möller. "Using Vision Transformers for Classifying Surgical Tools in Computer Aided Surgeries". Current Directions in Biomedical Engineering 10, n. 4 (1 dicembre 2024): 232–35. https://doi.org/10.1515/cdbme-2024-2056.
Testo completoJang, Hee-Deok, Seokjoon Kwon, Hyunwoo Nam e Dong Eui Chang. "Chemical Gas Source Localization with Synthetic Time Series Diffusion Data Using Video Vision Transformer". Applied Sciences 14, n. 11 (23 maggio 2024): 4451. http://dx.doi.org/10.3390/app14114451.
Testo completoMozaffari, M. Hamed, Yuchuan Li, Niloofar Hooshyaripour e Yoon Ko. "Vision-Based Prediction of Flashover Using Transformers and Convolutional Long Short-Term Memory Model". Electronics 13, n. 23 (3 dicembre 2024): 4776. https://doi.org/10.3390/electronics13234776.
Testo completoGeng, Xiaozhong, Cheng Chen, Ping Yu, Baijin Liu, Weixin Hu, Qipeng Liang e Xintong Zhang. "OM-VST: A video action recognition model based on optimized downsampling module combined with multi-scale feature fusion". PLOS ONE 20, n. 3 (6 marzo 2025): e0318884. https://doi.org/10.1371/journal.pone.0318884.
Testo completoKim, Nayeon, Sukhee Cho e Byungjun Bae. "SMaTE: A Segment-Level Feature Mixing and Temporal Encoding Framework for Facial Expression Recognition". Sensors 22, n. 15 (1 agosto 2022): 5753. http://dx.doi.org/10.3390/s22155753.
Testo completoLai, Derek Ka-Hei, Ethan Shiu-Wang Cheng, Bryan Pak-Hei So, Ye-Jiao Mao, Sophia Ming-Yan Cheung, Daphne Sze Ki Cheung, Duo Wai-Chi Wong e James Chung-Wai Cheung. "Transformer Models and Convolutional Networks with Different Activation Functions for Swallow Classification Using Depth Video Data". Mathematics 11, n. 14 (12 luglio 2023): 3081. http://dx.doi.org/10.3390/math11143081.
Testo completoLiu, Yuqi, Luhui Xu, Pengfei Xiong e Qin Jin. "Token Mixing: Parameter-Efficient Transfer Learning from Image-Language to Video-Language". Proceedings of the AAAI Conference on Artificial Intelligence 37, n. 2 (26 giugno 2023): 1781–89. http://dx.doi.org/10.1609/aaai.v37i2.25267.
Testo completoLorenzo, Javier, Ignacio Parra Alonso, Rubén Izquierdo, Augusto Luis Ballardini, Álvaro Hernández Saz, David Fernández Llorca e Miguel Ángel Sotelo. "CAPformer: Pedestrian Crossing Action Prediction Using Transformer". Sensors 21, n. 17 (24 agosto 2021): 5694. http://dx.doi.org/10.3390/s21175694.
Testo completoGuo, Zizhao, e Sancong Ying. "Whole-Body Keypoint and Skeleton Augmented RGB Networks for Video Action Recognition". Applied Sciences 12, n. 12 (18 giugno 2022): 6215. http://dx.doi.org/10.3390/app12126215.
Testo completoZhang, Renhong, Tianheng Cheng, Shusheng Yang, Haoyi Jiang, Shuai Zhang, Jiancheng Lyu, Xin Li et al. "MobileInst: Video Instance Segmentation on the Mobile". Proceedings of the AAAI Conference on Artificial Intelligence 38, n. 7 (24 marzo 2024): 7260–68. http://dx.doi.org/10.1609/aaai.v38i7.28555.
Testo completoZang, Chengbo, Mehmet Kerem Turkcan, Sanjeev Narasimhan, Yuqing Cao, Kaan Yarali, Zixuan Xiang, Skyler Szot et al. "Surgical Phase Recognition in Inguinal Hernia Repair—AI-Based Confirmatory Baseline and Exploration of Competitive Models". Bioengineering 10, n. 6 (27 maggio 2023): 654. http://dx.doi.org/10.3390/bioengineering10060654.
Testo completoLiu, Hao, Jiwen Lu, Jianjiang Feng e Jie Zhou. "Two-Stream Transformer Networks for Video-Based Face Alignment". IEEE Transactions on Pattern Analysis and Machine Intelligence 40, n. 11 (1 novembre 2018): 2546–54. http://dx.doi.org/10.1109/tpami.2017.2734779.
Testo completoKhan, Salman, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan e Mubarak Shah. "Transformers in Vision: A Survey". ACM Computing Surveys, 6 gennaio 2022. http://dx.doi.org/10.1145/3505244.
Testo completoHsu, Tzu-Chun, Yi-Sheng Liao e Chun-Rong Huang. "Video Summarization With Spatiotemporal Vision Transformer". IEEE Transactions on Image Processing, 2023, 1. http://dx.doi.org/10.1109/tip.2023.3275069.
Testo completo