Artículos de revistas sobre el tema "Audio speech recognition"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 50 mejores artículos de revistas para su investigación sobre el tema "Audio speech recognition".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore artículos de revistas sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Beadles, Robert L. "Audio visual speech recognition". Journal of the Acoustical Society of America 87, n.º 5 (mayo de 1990): 2274. http://dx.doi.org/10.1121/1.399137.
Texto completoBahal, Akriti. "Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Speech Recognition". IOSR Journal of Computer Engineering 5, n.º 1 (2012): 31–36. http://dx.doi.org/10.9790/0661-0513136.
Texto completoHwang, Jung-Wook, Jeongkyun Park, Rae-Hong Park y Hyung-Min Park. "Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition". Applied Acoustics 211 (agosto de 2023): 109478. http://dx.doi.org/10.1016/j.apacoust.2023.109478.
Texto completoNakadai, Kazuhiro y Tomoaki Koiwa. "Psychologically-Inspired Audio-Visual Speech Recognition Using Coarse Speech Recognition and Missing Feature Theory". Journal of Robotics and Mechatronics 29, n.º 1 (20 de febrero de 2017): 105–13. http://dx.doi.org/10.20965/jrm.2017.p0105.
Texto completoBASYSTIUK, Oleh y Nataliia MELNYKOVA. "MULTIMODAL SPEECH RECOGNITION BASED ON AUDIO AND TEXT DATA". Herald of Khmelnytskyi National University. Technical sciences 313, n.º 5 (27 de octubre de 2022): 22–25. http://dx.doi.org/10.31891/2307-5732-2022-313-5-22-25.
Texto completoDupont, S. y J. Luettin. "Audio-visual speech modeling for continuous speech recognition". IEEE Transactions on Multimedia 2, n.º 3 (2000): 141–51. http://dx.doi.org/10.1109/6046.865479.
Texto completoKubanek, M., J. Bobulski y L. Adrjanowicz. "Characteristics of the use of coupled hidden Markov models for audio-visual polish speech recognition". Bulletin of the Polish Academy of Sciences: Technical Sciences 60, n.º 2 (1 de octubre de 2012): 307–16. http://dx.doi.org/10.2478/v10175-012-0041-6.
Texto completoKacur, Juraj, Boris Puterka, Jarmila Pavlovicova y Milos Oravec. "Frequency, Time, Representation and Modeling Aspects for Major Speech and Audio Processing Applications". Sensors 22, n.º 16 (22 de agosto de 2022): 6304. http://dx.doi.org/10.3390/s22166304.
Texto completoShowkat Ahmad Dar, Showkat Ahmad Dar. "Emotion Recognition Based On Audio Speech". IOSR Journal of Computer Engineering 11, n.º 6 (2013): 46–50. http://dx.doi.org/10.9790/0661-1164650.
Texto completoAucouturier, Jean-Julien y Laurent Daudet. "Pattern recognition of non-speech audio". Pattern Recognition Letters 31, n.º 12 (septiembre de 2010): 1487–88. http://dx.doi.org/10.1016/j.patrec.2010.05.003.
Texto completoChaturvedi, Iti, Tim Noel y Ranjan Satapathy. "Speech Emotion Recognition Using Audio Matching". Electronics 11, n.º 23 (29 de noviembre de 2022): 3943. http://dx.doi.org/10.3390/electronics11233943.
Texto completoGnanamanickam, Jenifa, Yuvaraj Natarajan y Sri Preethaa K. R. "A Hybrid Speech Enhancement Algorithm for Voice Assistance Application". Sensors 21, n.º 21 (23 de octubre de 2021): 7025. http://dx.doi.org/10.3390/s21217025.
Texto completoConnell, Jonathan H. "Audio-only backoff in audio-visual speech recognition system". Journal of the Acoustical Society of America 125, n.º 6 (2009): 4109. http://dx.doi.org/10.1121/1.3155497.
Texto completoHazra, Sumon Kumar, Romana Rahman Ema, Syed Md Galib, Shalauddin Kabir y Nasim Adnan. "Emotion recognition of human speech using deep learning method and MFCC features". Radioelectronic and Computer Systems, n.º 4 (29 de noviembre de 2022): 161–72. http://dx.doi.org/10.32620/reks.2022.4.13.
Texto completoRyumin, Dmitry, Denis Ivanko y Elena Ryumina. "Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices". Sensors 23, n.º 4 (17 de febrero de 2023): 2284. http://dx.doi.org/10.3390/s23042284.
Texto completoJeon, Sanghun y Mun Sang Kim. "Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications". Sensors 22, n.º 20 (12 de octubre de 2022): 7738. http://dx.doi.org/10.3390/s22207738.
Texto completoS.Salama, Elham, Reda A. El-Khoribi y Mahmoud E. Shoman. "Audio-Visual Speech Recognition for People with Speech Disorders". International Journal of Computer Applications 96, n.º 2 (18 de junio de 2014): 51–56. http://dx.doi.org/10.5120/16770-6337.
Texto completoReggiswarashari, Fauzivy y Sari Widya Sihwi. "Speech emotion recognition using 2D-convolutional neural network". International Journal of Electrical and Computer Engineering (IJECE) 12, n.º 6 (1 de diciembre de 2022): 6594. http://dx.doi.org/10.11591/ijece.v12i6.pp6594-6601.
Texto completoS*, Manisha, Nafisa H. Saida, Nandita Gopal y Roshni P. Anand. "Bimodal Emotion Recognition using Machine Learning". International Journal of Engineering and Advanced Technology 10, n.º 4 (30 de abril de 2021): 189–94. http://dx.doi.org/10.35940/ijeat.d2451.0410421.
Texto completoCAO, JIANGTAO, NAOYUKI KUBOTA, PING LI y HONGHAI LIU. "THE VISUAL-AUDIO INTEGRATED RECOGNITION METHOD FOR USER AUTHENTICATION SYSTEM OF PARTNER ROBOTS". International Journal of Humanoid Robotics 08, n.º 04 (diciembre de 2011): 691–705. http://dx.doi.org/10.1142/s0219843611002678.
Texto completoStewart, Darryl, Rowan Seymour, Adrian Pass y Ji Ming. "Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions". IEEE Transactions on Cybernetics 44, n.º 2 (febrero de 2014): 175–84. http://dx.doi.org/10.1109/tcyb.2013.2250954.
Texto completoGornostal, Alexandr y Yaroslaw Dorogyy. "Development of audio-visual speech recognition system". ScienceRise 12, n.º 1 (30 de diciembre de 2017): 42–47. http://dx.doi.org/10.15587/2313-8416.2017.118212.
Texto completoMishra, Saumya, Anup Kumar Gupta y Puneet Gupta. "DARE: Deceiving Audio–Visual speech Recognition model". Knowledge-Based Systems 232 (noviembre de 2021): 107503. http://dx.doi.org/10.1016/j.knosys.2021.107503.
Texto completoHasegawa-Johnson, Mark A., Jui-Ting Huang, Sarah King y Xi Zhou. "Normalized recognition of speech and audio events". Journal of the Acoustical Society of America 130, n.º 4 (octubre de 2011): 2524. http://dx.doi.org/10.1121/1.3655075.
Texto completoZick, Gregory L. y Lawrence Yapp. "Speech recognition of MPEG/audio encoded files". Journal of the Acoustical Society of America 112, n.º 6 (2002): 2520. http://dx.doi.org/10.1121/1.1536509.
Texto completoNoda, Kuniaki, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno y Tetsuya Ogata. "Audio-visual speech recognition using deep learning". Applied Intelligence 42, n.º 4 (20 de diciembre de 2014): 722–37. http://dx.doi.org/10.1007/s10489-014-0629-7.
Texto completoUpadhyaya, Prashant, Omar Farooq, M. R. Abidi y Priyanka Varshney. "Comparative Study of Visual Feature for Bimodal Hindi Speech Recognition". Archives of Acoustics 40, n.º 4 (1 de diciembre de 2015): 609–19. http://dx.doi.org/10.1515/aoa-2015-0061.
Texto completoSalian, Beenaa, Omkar Narvade, Rujuta Tambewagh y Smita Bharne. "Speech Emotion Recognition using Time Distributed CNN and LSTM". ITM Web of Conferences 40 (2021): 03006. http://dx.doi.org/10.1051/itmconf/20214003006.
Texto completoMuhammad, Ghulam y Khalid Alghathbar. "Environment Recognition for Digital Audio Forensics Using MPEG-7 and MEL Cepstral Features". Journal of Electrical Engineering 62, n.º 4 (1 de julio de 2011): 199–205. http://dx.doi.org/10.2478/v10187-011-0032-0.
Texto completoWolfe, Jace y Erin C. Schafer. "Optimizing The Benefit of Sound Processors Coupled to Personal FM Systems". Journal of the American Academy of Audiology 19, n.º 08 (septiembre de 2008): 585–94. http://dx.doi.org/10.3766/jaaa.19.8.2.
Texto completoSaitoh, Takeshi. "Research on multi-modal silent speech recognition technology". Impact 2018, n.º 3 (15 de junio de 2018): 47–49. http://dx.doi.org/10.21820/23987073.2018.3.47.
Texto completoYang, Wenfeng, Pengyi Li, Wei Yang, Yuxing Liu, Yulong He, Ovanes Petrosian y Aleksandr Davydenko. "Research on Robust Audio-Visual Speech Recognition Algorithms". Mathematics 11, n.º 7 (5 de abril de 2023): 1733. http://dx.doi.org/10.3390/math11071733.
Texto completoGavali, A. B., Ghugarkar Pooja S., Khatake Supriya R. y Kothawale Rajnandini A. "Visual Speech Recognition Using Lips Movement". Journal of Signal Processing 9, n.º 2 (29 de mayo de 2023): 1–7. http://dx.doi.org/10.46610/josp.2023.v09i02.001.
Texto completoHe, Yibo, Kah Phooi Seng y Li Minn Ang. "Multimodal Sensor-Input Architecture with Deep Learning for Audio-Visual Speech Recognition in Wild". Sensors 23, n.º 4 (7 de febrero de 2023): 1834. http://dx.doi.org/10.3390/s23041834.
Texto completoKozma-Spytek, Linda y Christian Vogler. "Factors Affecting the Accessibility of Voice Telephony for People with Hearing Loss: Audio Encoding, Network Impairments, Video and Environmental Noise". ACM Transactions on Accessible Computing 14, n.º 4 (31 de diciembre de 2021): 1–35. http://dx.doi.org/10.1145/3479160.
Texto completoAuti, Dr Nisha, Atharva Pujari, Anagha Desai, Shreya Patil, Sanika Kshirsagar y Rutika Rindhe. "Advanced Audio Signal Processing for Speaker Recognition and Sentiment Analysis". International Journal for Research in Applied Science and Engineering Technology 11, n.º 5 (31 de mayo de 2023): 1717–24. http://dx.doi.org/10.22214/ijraset.2023.51825.
Texto completoYin, Bing, Shutong Niu, Haitao Tang, Lei Sun, Jun Du, Zhenhua Ling y Cong Liu. "An Investigation into Audio–Visual Speech Recognition under a Realistic Home–TV Scenario". Applied Sciences 13, n.º 7 (23 de marzo de 2023): 4100. http://dx.doi.org/10.3390/app13074100.
Texto completoOng, Kah Liang, Chin Poo Lee, Heng Siong Lim y Kian Ming Lim. "Speech emotion recognition with light gradient boosting decision trees machine". International Journal of Electrical and Computer Engineering (IJECE) 13, n.º 4 (1 de agosto de 2023): 4020. http://dx.doi.org/10.11591/ijece.v13i4.pp4020-4028.
Texto completoA, Prof Swethashree. "Speech Emotion Recognition". International Journal for Research in Applied Science and Engineering Technology 9, n.º 8 (31 de agosto de 2021): 2637–40. http://dx.doi.org/10.22214/ijraset.2021.37375.
Texto completoYu, Wentao, Steffen Zeiler y Dorothea Kolossa. "Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition". Sensors 22, n.º 15 (23 de julio de 2022): 5501. http://dx.doi.org/10.3390/s22155501.
Texto completoWang, Junyi, Bingyao Li y Jiahong Zhang. "Use Brain-Like Audio Features to Improve Speech Recognition Performance". Journal of Sensors 2022 (19 de septiembre de 2022): 1–12. http://dx.doi.org/10.1155/2022/6742474.
Texto completoSeong, Thum Wei, M. Z. Ibrahim y D. J. Mulvaney. "WADA-W: A Modified WADA SNR Estimator for Audio-Visual Speech Recognition". International Journal of Machine Learning and Computing 9, n.º 4 (agosto de 2019): 446–51. http://dx.doi.org/10.18178/ijmlc.2019.9.4.824.
Texto completoEt. al., D. N. V. S. L. S. Indira,. "An Enhanced CNN-2D for Audio-Visual Emotion Recognition (AVER) Using ADAM Optimizer". Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12, n.º 5 (11 de abril de 2021): 1378–88. http://dx.doi.org/10.17762/turcomat.v12i5.2030.
Texto completoTiwari, Rishin, Saloni Birthare y Mr Mayank Lovanshi. "Audio to Sign Language Converter". International Journal for Research in Applied Science and Engineering Technology 10, n.º 11 (30 de noviembre de 2022): 206–11. http://dx.doi.org/10.22214/ijraset.2022.47271.
Texto completoAxyonov, A. A., D. V. Ivanko, I. B. Lashkov, D. A. Ryumin, A. M. Kashevnik y A. A. Karpov. "A methodology of multimodal corpus creation for audio-visual speech recognition in assistive transport systems". Informatization and communication 5 (diciembre de 2020): 87–93. http://dx.doi.org/10.34219/2078-8320-2020-11-5-87-93.
Texto completoIvanko, Denis, Dmitry Ryumin y Alexey Karpov. "A Review of Recent Advances on Deep Learning Methods for Audio-Visual Speech Recognition". Mathematics 11, n.º 12 (12 de junio de 2023): 2665. http://dx.doi.org/10.3390/math11122665.
Texto completoWu, Xuan, Silong Zhou, Mingwei Chen, Yihang Zhao, Yifei Wang, Xianmeng Zhao, Danyang Li y Haibo Pu. "Combined spectral and speech features for pig speech recognition". PLOS ONE 17, n.º 12 (1 de diciembre de 2022): e0276778. http://dx.doi.org/10.1371/journal.pone.0276778.
Texto completoReddy, P. Deepak. "Multilingual Speech to Text using Deep Learning based on MFCC Features". Machine Learning and Applications: An International Journal 9, n.º 02 (30 de junio de 2022): 21–30. http://dx.doi.org/10.5121/mlaij.2022.9202.
Texto completoAiman, Aisha, Yao Shen, Malika Bendechache, Irum Inayat y Teerath Kumar. "AUDD: Audio Urdu Digits Dataset for Automatic Audio Urdu Digit Recognition". Applied Sciences 11, n.º 19 (23 de septiembre de 2021): 8842. http://dx.doi.org/10.3390/app11198842.
Texto completoHASHIMOTO, Masahiro y Masaharu KUMASHIRO. "Intermodal Timing Cues for Audio-Visual Speech Recognition". Journal of UOEH 26, n.º 2 (2004): 215–25. http://dx.doi.org/10.7888/juoeh.26.215.
Texto completo