Academic literature on the topic 'Visual speech recognition'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Visual speech recognition.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Visual speech recognition"
Beadles, Robert L. "Audio visual speech recognition." Journal of the Acoustical Society of America 87, no. 5 (May 1990): 2274. http://dx.doi.org/10.1121/1.399137.
Full textDupont, S., and J. Luettin. "Audio-visual speech modeling for continuous speech recognition." IEEE Transactions on Multimedia 2, no. 3 (2000): 141–51. http://dx.doi.org/10.1109/6046.865479.
Full textBrahme, Aparna, and Umesh Bhadade. "Effect of Various Visual Speech Units on Language Identification Using Visual Speech Recognition." International Journal of Image and Graphics 20, no. 04 (October 2020): 2050029. http://dx.doi.org/10.1142/s0219467820500291.
Full textElrefaei, Lamiaa A., Tahani Q. Alhassan, and Shefaa S. Omar. "An Arabic Visual Dataset for Visual Speech Recognition." Procedia Computer Science 163 (2019): 400–409. http://dx.doi.org/10.1016/j.procs.2019.12.122.
Full textRosenblum, Lawrence D., Deborah A. Yakel, Naser Baseer, Anjani Panchal, Brynn C. Nodarse, and Ryan P. Niehus. "Visual speech information for face recognition." Perception & Psychophysics 64, no. 2 (February 2002): 220–29. http://dx.doi.org/10.3758/bf03195788.
Full textYu, Dahai, Ovidiu Ghita, Alistair Sutherland, and Paul F. Whelan. "A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition." IPSJ Transactions on Computer Vision and Applications 2 (2010): 25–38. http://dx.doi.org/10.2197/ipsjtcva.2.25.
Full textS.Salama, Elham, Reda A. El-Khoribi, and Mahmoud E. Shoman. "Audio-Visual Speech Recognition for People with Speech Disorders." International Journal of Computer Applications 96, no. 2 (June 18, 2014): 51–56. http://dx.doi.org/10.5120/16770-6337.
Full textNakadai, Kazuhiro, and Tomoaki Koiwa. "Psychologically-Inspired Audio-Visual Speech Recognition Using Coarse Speech Recognition and Missing Feature Theory." Journal of Robotics and Mechatronics 29, no. 1 (February 20, 2017): 105–13. http://dx.doi.org/10.20965/jrm.2017.p0105.
Full textBahal, Akriti. "Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Speech Recognition." IOSR Journal of Computer Engineering 5, no. 1 (2012): 31–36. http://dx.doi.org/10.9790/0661-0513136.
Full textSeong, Thum Wei, M. Z. Ibrahim, and D. J. Mulvaney. "WADA-W: A Modified WADA SNR Estimator for Audio-Visual Speech Recognition." International Journal of Machine Learning and Computing 9, no. 4 (August 2019): 446–51. http://dx.doi.org/10.18178/ijmlc.2019.9.4.824.
Full textDissertations / Theses on the topic "Visual speech recognition"
Luettin, Juergen. "Visual speech and speaker recognition." Thesis, University of Sheffield, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.264432.
Full textMiyajima, C., D. Negi, Y. Ninomiya, M. Sano, K. Mori, K. Itou, K. Takeda, and Y. Suenaga. "Audio-Visual Speech Database for Bimodal Speech Recognition." INTELLIGENT MEDIA INTEGRATION NAGOYA UNIVERSITY / COE, 2005. http://hdl.handle.net/2237/10460.
Full textPachoud, Samuel. "Audio-visual speech and emotion recognition." Thesis, Queen Mary, University of London, 2010. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.528923.
Full textMatthews, Iain. "Features for audio-visual speech recognition." Thesis, University of East Anglia, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.266736.
Full textSeymour, R. "Audio-visual speech and speaker recognition." Thesis, Queen's University Belfast, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.492489.
Full textRabi, Gihad. "Visual speech recognition by recurrent neural networks." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1997. http://www.collectionscanada.ca/obj/s4/f2/dsk2/tape16/PQDD_0010/MQ36169.pdf.
Full textKaucic, Robert August. "Lip tracking for audio-visual speech recognition." Thesis, University of Oxford, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.360392.
Full textSaeed, Mehreen. "Soft AI methods and visual speech recognition." Thesis, University of Bristol, 1999. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.299270.
Full textSaenko, Ekaterina 1976. "Articulatory features for robust visual speech recognition." Thesis, Massachusetts Institute of Technology, 2004. http://hdl.handle.net/1721.1/28736.
Full textIncludes bibliographical references (p. 99-105).
This thesis explores a novel approach to visual speech modeling. Visual speech, or a sequence of images of the speaker's face, is traditionally viewed as a single stream of contiguous units, each corresponding to a phonetic segment. These units are defined heuristically by mapping several visually similar phonemes to one visual phoneme, sometimes referred to as a viseme. However, experimental evidence shows that phonetic models trained from visual data are not synchronous in time with acoustic phonetic models, indicating that visemes may not be the most natural building blocks of visual speech. Instead, we propose to model the visual signal in terms of the underlying articulatory features. This approach is a natural extension of feature-based modeling of acoustic speech, which has been shown to increase robustness of audio-based speech recognition systems. We start by exploring ways of defining visual articulatory features: first in a data-driven manner, using a large, multi-speaker visual speech corpus, and then in a knowledge-driven manner, using the rules of speech production. Based on these studies, we propose a set of articulatory features, and describe a computational framework for feature-based visual speech recognition. Multiple feature streams are detected in the input image sequence using Support Vector Machines, and then incorporated in a Dynamic Bayesian Network to obtain the final word hypothesis. Preliminary experiments show that our approach increases viseme classification rates in visually noisy conditions, and improves visual word recognition through feature-based context modeling.
by Ekaterina Saenko.
S.M.
Pass, A. R. "Towards pose invariant visual speech processing." Thesis, Queen's University Belfast, 2013. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.580170.
Full textBooks on the topic "Visual speech recognition"
Liew, Alan Wee-Chung. Visual speech recognition: Lip segmentation and mapping. Edited by Wang Shilin. Hershey PA: Medical Information Science Reference, 2009.
Find full textJoachim, Hornegger, ed. Pattern recognition and image processing in C [plus] [plus]. Wiesbaden: Vieweg, 1995.
Find full textWindows speech recognition programming: With Visual Basic and ActiveX voice controls ; exploring Speech API (SAPI) & Software Developer Kit (SDK) for voice input & output enabling of Windows applications. New York: IUniverse, Inc., 2004.
Find full textWayne, Cranton, Fihn Mark, and SpringerLink (Online service), eds. Handbook of Visual Display Technology. Berlin, Heidelberg: Springer Berlin Heidelberg, 2012.
Find full textLip Tracking for Audio-Visual Speech Recognition. Storming Media, 1997.
Find full textC, Schwab Eileen, and Nusbaum Howard C, eds. Pattern recognition by humans and machines. Orlando, Fla: Academic Press, 1986.
Find full textHornegger, Joachim, and Dietrich W. R. Paulus. Pattern Recognition and Image Processing in C++ (Vieweg Advanced Studies in Computer Science). Friedrich Vieweg & Sohn Verlag, 1995.
Find full textIntegrating Face and Voice in Person Perception. Springer, 2012.
Find full textCranton, Wayne, Janglin Chen, and Mark Fihn. Handbook of Visual Display Technology. Springer, 2016.
Find full textCranton, Wayne, Janglin Chen, and Mark Fihn. Handbook of Visual Display Technology. Springer, 2012.
Find full textBook chapters on the topic "Visual speech recognition"
Luettin, Juergen, and Stéphane Dupont. "Continuous audio-visual speech recognition." In Lecture Notes in Computer Science, 657–73. Berlin, Heidelberg: Springer Berlin Heidelberg, 1998. http://dx.doi.org/10.1007/bfb0054771.
Full textMahadevaswamy, U. B., M. Shashank Rao, S. Vrushab, C. Anagha, and V. Sangameshwar. "Visual Speech Processing and Recognition." In Advances in Intelligent Systems and Computing, 481–91. Singapore: Springer Singapore, 2020. http://dx.doi.org/10.1007/978-981-15-3383-9_44.
Full textWachsmuth, Sven, Gernot A. Fink, Franz Kümmert, and Gerhard Sagerer. "Using Speech in Visual Object Recognition." In Informatik aktuell, 428–35. Berlin, Heidelberg: Springer Berlin Heidelberg, 2000. http://dx.doi.org/10.1007/978-3-642-59802-9_54.
Full textGupta, Deepika, Preety Singh, V. Laxmi, and Manoj S. Gaur. "Boundary Descriptors for Visual Speech Recognition." In Computer and Information Sciences II, 307–13. London: Springer London, 2011. http://dx.doi.org/10.1007/978-1-4471-2155-8_39.
Full textYu, Dahai, Ovidiu Ghita, Alistair Sutherland, and Paul F. Whelan. "A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition." In Advances in Image and Video Technology, 398–409. Berlin, Heidelberg: Springer Berlin Heidelberg, 2009. http://dx.doi.org/10.1007/978-3-540-92957-4_35.
Full textKratt, Jan, Florian Metze, Rainer Stiefelhagen, and Alex Waibel. "Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit." In Lecture Notes in Computer Science, 488–95. Berlin, Heidelberg: Springer Berlin Heidelberg, 2004. http://dx.doi.org/10.1007/978-3-540-28649-3_60.
Full textIvanko, Denis, Dmitry Ryumin, Alexandr Axyonov, and Miloš Železný. "Designing Advanced Geometric Features for Automatic Russian Visual Speech Recognition." In Speech and Computer, 245–54. Cham: Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-319-99579-3_26.
Full textAnwar, M. A., Jim F. Baldwin, and Trevor P. Martin. "Learning Fuzzy Rules for Visual Speech Recognition." In Adaptive Multimedia Retrieval, 164–75. Berlin, Heidelberg: Springer Berlin Heidelberg, 2004. http://dx.doi.org/10.1007/978-3-540-25981-7_11.
Full textKarpov, Alexey, Alexander Ronzhin, Irina Kipyatkova, Andrey Ronzhin, Vasilisa Verkhodanova, Anton Saveliev, and Milos Zelezny. "Bimodal Speech Recognition Fusing Audio-Visual Modalities." In Lecture Notes in Computer Science, 170–79. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-39516-6_16.
Full textSingh, Preety, Vijay Laxmi, and Manoj Singh Gaur. "Visual Speech Recognition with Selected Boundary Descriptors." In Image Feature Detectors and Descriptors, 367–83. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-28854-3_14.
Full textConference papers on the topic "Visual speech recognition"
Devi, Sulochana, Siddhi Chokshi, Kritika Kotian, and Juili Warwatkar. "Visual Speech Recognition." In 2021 4th Biennial International Conference on Nascent Technologies in Engineering (ICNTE). IEEE, 2021. http://dx.doi.org/10.1109/icnte51185.2021.9487784.
Full textShillingford, Brendan, Yannis Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, et al. "Large-Scale Visual Speech Recognition." In Interspeech 2019. ISCA: ISCA, 2019. http://dx.doi.org/10.21437/interspeech.2019-1669.
Full textFook, C. Y., M. Hariharan, Sazali Yaacob, and AH Adom. "A review: Malay speech recognition and audio visual speech recognition." In 2012 International Conference on Biomedical Engineering (ICoBE). IEEE, 2012. http://dx.doi.org/10.1109/icobe.2012.6179063.
Full textGalatas, Georgios, Gerasimos Potamianos, Alexandros Papangelis, and Fillia Makedon. "Audio visual speech recognition in noisy visual environments." In the 4th International Conference. New York, New York, USA: ACM Press, 2011. http://dx.doi.org/10.1145/2141622.2141646.
Full textZhang, X., R. M. Mersereau, M. Clements, and C. C. Broun. "Visual speech feature extraction for improved speech recognition." In Proceedings of ICASSP '02. IEEE, 2002. http://dx.doi.org/10.1109/icassp.2002.5745022.
Full textZhang, Mersereau, Clements, and Broun. "Visual speech feature extraction for improved speech recognition." In IEEE International Conference on Acoustics Speech and Signal Processing ICASSP-02. IEEE, 2002. http://dx.doi.org/10.1109/icassp.2002.1006162.
Full textBenhaim, Eric, Hichem Sahbi, and Guillaume Vittey. "Continuous visual speech recognition for audio speech enhancement." In ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015. http://dx.doi.org/10.1109/icassp.2015.7178370.
Full textSilsbee, Peter L., and Alan C. Bovik. "Audio-visual speech recognition for a vowel discrimination task." In Visual Communications '93, edited by Barry G. Haskell and Hsueh-Ming Hang. SPIE, 1993. http://dx.doi.org/10.1117/12.157855.
Full textReikeras, Helge, Ben Herbst, Johan du Preez, and Herman Engelbrecht. "Audio-Visual Speech Recognition using SciPy." In Python in Science Conference. SciPy, 2010. http://dx.doi.org/10.25080/majora-92bf1922-010.
Full textFrisky, Aufaclav Zatu Kusuma, Chien-Yao Wang, Andri Santoso, and Jia-Ching Wang. "Lip-based visual speech recognition system." In 2015 International Carnahan Conference on Security Technology (ICCST). IEEE, 2015. http://dx.doi.org/10.1109/ccst.2015.7389703.
Full text