Literatura académica sobre el tema "Audio speech recognition"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "Audio speech recognition".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "Audio speech recognition"
Beadles, Robert L. "Audio visual speech recognition". Journal of the Acoustical Society of America 87, n.º 5 (mayo de 1990): 2274. http://dx.doi.org/10.1121/1.399137.
Texto completoBahal, Akriti. "Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Speech Recognition". IOSR Journal of Computer Engineering 5, n.º 1 (2012): 31–36. http://dx.doi.org/10.9790/0661-0513136.
Texto completoHwang, Jung-Wook, Jeongkyun Park, Rae-Hong Park y Hyung-Min Park. "Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition". Applied Acoustics 211 (agosto de 2023): 109478. http://dx.doi.org/10.1016/j.apacoust.2023.109478.
Texto completoNakadai, Kazuhiro y Tomoaki Koiwa. "Psychologically-Inspired Audio-Visual Speech Recognition Using Coarse Speech Recognition and Missing Feature Theory". Journal of Robotics and Mechatronics 29, n.º 1 (20 de febrero de 2017): 105–13. http://dx.doi.org/10.20965/jrm.2017.p0105.
Texto completoBASYSTIUK, Oleh y Nataliia MELNYKOVA. "MULTIMODAL SPEECH RECOGNITION BASED ON AUDIO AND TEXT DATA". Herald of Khmelnytskyi National University. Technical sciences 313, n.º 5 (27 de octubre de 2022): 22–25. http://dx.doi.org/10.31891/2307-5732-2022-313-5-22-25.
Texto completoDupont, S. y J. Luettin. "Audio-visual speech modeling for continuous speech recognition". IEEE Transactions on Multimedia 2, n.º 3 (2000): 141–51. http://dx.doi.org/10.1109/6046.865479.
Texto completoKubanek, M., J. Bobulski y L. Adrjanowicz. "Characteristics of the use of coupled hidden Markov models for audio-visual polish speech recognition". Bulletin of the Polish Academy of Sciences: Technical Sciences 60, n.º 2 (1 de octubre de 2012): 307–16. http://dx.doi.org/10.2478/v10175-012-0041-6.
Texto completoKacur, Juraj, Boris Puterka, Jarmila Pavlovicova y Milos Oravec. "Frequency, Time, Representation and Modeling Aspects for Major Speech and Audio Processing Applications". Sensors 22, n.º 16 (22 de agosto de 2022): 6304. http://dx.doi.org/10.3390/s22166304.
Texto completoShowkat Ahmad Dar, Showkat Ahmad Dar. "Emotion Recognition Based On Audio Speech". IOSR Journal of Computer Engineering 11, n.º 6 (2013): 46–50. http://dx.doi.org/10.9790/0661-1164650.
Texto completoAucouturier, Jean-Julien y Laurent Daudet. "Pattern recognition of non-speech audio". Pattern Recognition Letters 31, n.º 12 (septiembre de 2010): 1487–88. http://dx.doi.org/10.1016/j.patrec.2010.05.003.
Texto completoTesis sobre el tema "Audio speech recognition"
Miyajima, C., D. Negi, Y. Ninomiya, M. Sano, K. Mori, K. Itou, K. Takeda y Y. Suenaga. "Audio-Visual Speech Database for Bimodal Speech Recognition". INTELLIGENT MEDIA INTEGRATION NAGOYA UNIVERSITY / COE, 2005. http://hdl.handle.net/2237/10460.
Texto completoSeymour, R. "Audio-visual speech and speaker recognition". Thesis, Queen's University Belfast, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.492489.
Texto completoPachoud, Samuel. "Audio-visual speech and emotion recognition". Thesis, Queen Mary, University of London, 2010. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.528923.
Texto completoMatthews, Iain. "Features for audio-visual speech recognition". Thesis, University of East Anglia, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.266736.
Texto completoKaucic, Robert August. "Lip tracking for audio-visual speech recognition". Thesis, University of Oxford, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.360392.
Texto completoLucey, Simon. "Audio-visual speech processing". Thesis, Queensland University of Technology, 2002. https://eprints.qut.edu.au/36172/7/SimonLuceyPhDThesis.pdf.
Texto completoEriksson, Mattias. "Speech recognition availability". Thesis, Linköping University, Department of Computer and Information Science, 2004. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-2651.
Texto completoThis project investigates the importance of availability in the scope of dictation programs. Using speech recognition technology for dictating has not reached the public, and that may very well be a result of poor availability in today’s technical solutions.
I have constructed a persona character, Johanna, who personalizes the target user. I have also developed a solution that streams audio into a speech recognition server and sends back interpreted text. Johanna affirmed that the solution was successful in theory.
I then incorporated test users that tried out the solution in practice. Half of them do indeed claim that their usage has been and will continue to be increased thanks to the new level of availability.
Rao, Ram Raghavendra. "Audio-visual interaction in multimedia". Diss., Georgia Institute of Technology, 1998. http://hdl.handle.net/1853/13349.
Texto completoDean, David Brendan. "Synchronous HMMs for audio-visual speech processing". Thesis, Queensland University of Technology, 2008. https://eprints.qut.edu.au/17689/3/David_Dean_Thesis.pdf.
Texto completoDean, David Brendan. "Synchronous HMMs for audio-visual speech processing". Queensland University of Technology, 2008. http://eprints.qut.edu.au/17689/.
Texto completoLibros sobre el tema "Audio speech recognition"
Sen, Soumya, Anjan Dutta y Nilanjan Dey. Audio Processing and Speech Recognition. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5.
Texto completoOgunfunmi, Tokunbo, Roberto Togneri y Madihally Narasimha, eds. Speech and Audio Processing for Coding, Enhancement and Recognition. New York, NY: Springer New York, 2015. http://dx.doi.org/10.1007/978-1-4939-1456-2.
Texto completoJunqua, Jean-Claude. Robustness in automatic speech recognition: Fundamentals and applications. Boston: Kluwer Academic Publishers, 1996.
Buscar texto completoHarrison, Mark. The use of interactive audio and speech recognition techniques in training. [U.K.]: [s.n.], 1993.
Buscar texto completoAVBPA '97 ((1st 1997 Montana,Switzerland). Audio- and video-based biometric person authentication: First International Conference, AVBPA '97, Crans-Montana, Switzerland, March 1997 : proceedings. Berlin: Springer, 1997.
Buscar texto completo1946-, Kittler Josef y Nixon Mark S, eds. Audio-and video-based biometric person authentication: 4th International Conference, AVBPA 2003, Guildford, UK, June 2003 : proceedings. Berlin: Springer, 2003.
Buscar texto completoInternational Conference, AVBPA (1st 1997 Montana, Switzerland). Audio- and video-based biometric person authentication: First International Conference, AVBPA '97, Crans-Montana, Switzerland, March 12-14, 1997 : proceedings. Berlin: Springer, 1997.
Buscar texto completoIEEE Workshop on Automatic Speech Recognition and Understanding (1997 Santa Barbara, Calif.). 1997 IEEE Workshop on Automatic Speech Recognition and Understanding proceedings. Piscataway, NJ: Published under the sponsorship of the IEEE Signal Processing Society, 1997.
Buscar texto completoMinker, Wolfgang. Speech and human-machine dialog. Boston: Kluwer Academic Publishers, 2004.
Buscar texto completoMinker, Wolfgang. Speech and human-machine dialog. Boston: Kluwer Academic Publishers, 2004.
Buscar texto completoCapítulos de libros sobre el tema "Audio speech recognition"
Sen, Soumya, Anjan Dutta y Nilanjan Dey. "Audio Indexing". En Audio Processing and Speech Recognition, 1–11. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5_1.
Texto completoSen, Soumya, Anjan Dutta y Nilanjan Dey. "Audio Classification". En Audio Processing and Speech Recognition, 67–93. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5_4.
Texto completoRichter, Michael M., Sheuli Paul, Veton Këpuska y Marius Silaghi. "Audio Signals and Speech Recognition". En Signal Processing and Machine Learning with Applications, 345–68. Cham: Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-319-45372-9_18.
Texto completoLuettin, Juergen y Stéphane Dupont. "Continuous audio-visual speech recognition". En Lecture Notes in Computer Science, 657–73. Berlin, Heidelberg: Springer Berlin Heidelberg, 1998. http://dx.doi.org/10.1007/bfb0054771.
Texto completoSen, Soumya, Anjan Dutta y Nilanjan Dey. "Speech Processing and Recognition System". En Audio Processing and Speech Recognition, 13–43. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5_2.
Texto completoSen, Soumya, Anjan Dutta y Nilanjan Dey. "Feature Extraction". En Audio Processing and Speech Recognition, 45–66. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5_3.
Texto completoSen, Soumya, Anjan Dutta y Nilanjan Dey. "Conclusion". En Audio Processing and Speech Recognition, 95–96. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-13-6098-5_5.
Texto completoSethu, Vidhyasaharan, Julien Epps y Eliathamby Ambikairajah. "Speech Based Emotion Recognition". En Speech and Audio Processing for Coding, Enhancement and Recognition, 197–228. New York, NY: Springer New York, 2014. http://dx.doi.org/10.1007/978-1-4939-1456-2_7.
Texto completoKarpov, Alexey, Alexander Ronzhin, Irina Kipyatkova, Andrey Ronzhin, Vasilisa Verkhodanova, Anton Saveliev y Milos Zelezny. "Bimodal Speech Recognition Fusing Audio-Visual Modalities". En Lecture Notes in Computer Science, 170–79. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-39516-6_16.
Texto completoKratt, Jan, Florian Metze, Rainer Stiefelhagen y Alex Waibel. "Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit". En Lecture Notes in Computer Science, 488–95. Berlin, Heidelberg: Springer Berlin Heidelberg, 2004. http://dx.doi.org/10.1007/978-3-540-28649-3_60.
Texto completoActas de conferencias sobre el tema "Audio speech recognition"
Ko, Tom, Vijayaditya Peddinti, Daniel Povey y Sanjeev Khudanpur. "Audio augmentation for speech recognition". En Interspeech 2015. ISCA: ISCA, 2015. http://dx.doi.org/10.21437/interspeech.2015-711.
Texto completoLi, Xinyu, Venkata Chebiyyam y Katrin Kirchhoff. "Speech Audio Super-Resolution for Speech Recognition". En Interspeech 2019. ISCA: ISCA, 2019. http://dx.doi.org/10.21437/interspeech.2019-3043.
Texto completoPalecek, Karel y Josef Chaloupka. "Audio-visual speech recognition in noisy audio environments". En 2013 36th International Conference on Telecommunications and Signal Processing (TSP). IEEE, 2013. http://dx.doi.org/10.1109/tsp.2013.6613979.
Texto completoFook, C. Y., M. Hariharan, Sazali Yaacob y AH Adom. "A review: Malay speech recognition and audio visual speech recognition". En 2012 International Conference on Biomedical Engineering (ICoBE). IEEE, 2012. http://dx.doi.org/10.1109/icobe.2012.6179063.
Texto completoSinha, Arryan y G. Suseela. "Deep Learning-Based Speech Emotion Recognition". En International Research Conference on IOT, Cloud and Data Science. Switzerland: Trans Tech Publications Ltd, 2023. http://dx.doi.org/10.4028/p-0892re.
Texto completoBenhaim, Eric, Hichem Sahbi y Guillaume Vittey. "Continuous visual speech recognition for audio speech enhancement". En ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015. http://dx.doi.org/10.1109/icassp.2015.7178370.
Texto completoReikeras, Helge, Ben Herbst, Johan du Preez y Herman Engelbrecht. "Audio-Visual Speech Recognition using SciPy". En Python in Science Conference. SciPy, 2010. http://dx.doi.org/10.25080/majora-92bf1922-010.
Texto completoTan, Hao, Chenwei Liu, Yinyu Lyu, Xiao Zhang, Denghui Zhang y Zhaoquan Gu. "Audio Steganography with Speech Recognition System". En 2021 IEEE Sixth International Conference on Data Science in Cyberspace (DSC). IEEE, 2021. http://dx.doi.org/10.1109/dsc53577.2021.00042.
Texto completoNarisetty, Chaitanya, Emiru Tsunoo, Xuankai Chang, Yosuke Kashiwagi, Michael Hentschel y Shinji Watanabe. "Joint Speech Recognition and Audio Captioning". En ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. http://dx.doi.org/10.1109/icassp43922.2022.9746601.
Texto completoYang, Karren, Dejan Markovic, Steven Krenn, Vasu Agrawal y Alexander Richard. "Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis". En 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2022. http://dx.doi.org/10.1109/cvpr52688.2022.00805.
Texto completoInformes sobre el tema "Audio speech recognition"
STANDARD OBJECT SYSTEMS INC. Advanced Audio Interface for Phonetic Speech Recognition in a High Noise Environment. Fort Belvoir, VA: Defense Technical Information Center, enero de 2000. http://dx.doi.org/10.21236/ada373461.
Texto completoIssues in Data Processing and Relevant Population Selection. OSAC Speaker Recognition Subcommittee, noviembre de 2022. http://dx.doi.org/10.29325/osac.tg.0006.
Texto completo