Literatura académica sobre el tema "Speech recognition"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "Speech recognition".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "Speech recognition"
Miyazaki, Toshiyuki y Yoji Ishikawa. "Speech recognition device and speech recognition method". Journal of the Acoustical Society of America 126, n.º 3 (2009): 1648. http://dx.doi.org/10.1121/1.3230481.
Texto completoShinotsuka, Hiroshi y Noritoshi Hino. "Speech recognition method and speech recognition device". Journal of the Acoustical Society of America 111, n.º 4 (2002): 1518. http://dx.doi.org/10.1121/1.1479008.
Texto completoMohamad Nasir, A. B., N. R. M. Nasir y F. H. M. Salleh. "SPEESH: speech-based mobile application for dysarthric speech recognition". Journal of Physics: Conference Series 1860, n.º 1 (1 de marzo de 2021): 012003. http://dx.doi.org/10.1088/1742-6596/1860/1/012003.
Texto completoDowney, Simon N. "Speech Recognition". Journal of the Acoustical Society of America 130, n.º 6 (2011): 4183. http://dx.doi.org/10.1121/1.3669379.
Texto completoLennig, Matthew. "Speech recognition". Journal of the Acoustical Society of America 91, n.º 1 (enero de 1992): 546. http://dx.doi.org/10.1121/1.402661.
Texto completoAlotaibi, Y. A. y M. M. Shahsavari. "Speech recognition". IEEE Potentials 17, n.º 1 (1998): 23–28. http://dx.doi.org/10.1109/45.652853.
Texto completoFeldman, Joel A. "Speech recognition". Journal of the Acoustical Society of America 86, n.º 6 (diciembre de 1989): 2478. http://dx.doi.org/10.1121/1.398356.
Texto completoGadbois, Gregory J. y Stijn A. Van Even. "Speech recognition". Journal of the Acoustical Society of America 107, n.º 5 (2000): 2325. http://dx.doi.org/10.1121/1.428607.
Texto completoCameron, Ian R. y Paul C. Millar. "Speech recognition". Journal of the Acoustical Society of America 95, n.º 2 (febrero de 1994): 1185. http://dx.doi.org/10.1121/1.408412.
Texto completoAbe, Kenji. "SPEECH RECOGNITION SYSTEM AND METHOD FOR SPEECH RECOGNITION". Journal of the Acoustical Society of America 134, n.º 1 (2013): 738. http://dx.doi.org/10.1121/1.4813052.
Texto completoTesis sobre el tema "Speech recognition"
Chuchilina, L. M. y I. E. Yeskov. "Speech recognition". Thesis, Видавництво СумДУ, 2008. http://essuir.sumdu.edu.ua/handle/123456789/15995.
Texto completoAlcaraz, Meseguer Noelia. "Speech Analysis for Automatic Speech Recognition". Thesis, Norwegian University of Science and Technology, Department of Electronics and Telecommunications, 2009. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-9092.
Texto completoThe classical front end analysis in speech recognition is a spectral analysis which parametrizes the speech signal into feature vectors; the most popular set of them is the Mel Frequency Cepstral Coefficients (MFCC). They are based on a standard power spectrum estimate which is first subjected to a log-based transform of the frequency axis (mel- frequency scale), and then decorrelated by using a modified discrete cosine transform. Following a focused introduction on speech production, perception and analysis, this paper gives a study of the implementation of a speech generative model; whereby the speech is synthesized and recovered back from its MFCC representations. The work has been developed into two steps: first, the computation of the MFCC vectors from the source speech files by using HTK Software; and second, the implementation of the generative model in itself, which, actually, represents the conversion chain from HTK-generated MFCC vectors to speech reconstruction. In order to know the goodness of the speech coding into feature vectors and to evaluate the generative model, the spectral distance between the original speech signal and the one produced from the MFCC vectors has been computed. For that, spectral models based on Linear Prediction Coding (LPC) analysis have been used. During the implementation of the generative model some results have been obtained in terms of the reconstruction of the spectral representation and the quality of the synthesized speech.
Kleinschmidt, Tristan Friedrich. "Robust speech recognition using speech enhancement". Thesis, Queensland University of Technology, 2010. https://eprints.qut.edu.au/31895/1/Tristan_Kleinschmidt_Thesis.pdf.
Texto completoEriksson, Mattias. "Speech recognition availability". Thesis, Linköping University, Department of Computer and Information Science, 2004. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-2651.
Texto completoThis project investigates the importance of availability in the scope of dictation programs. Using speech recognition technology for dictating has not reached the public, and that may very well be a result of poor availability in today’s technical solutions.
I have constructed a persona character, Johanna, who personalizes the target user. I have also developed a solution that streams audio into a speech recognition server and sends back interpreted text. Johanna affirmed that the solution was successful in theory.
I then incorporated test users that tried out the solution in practice. Half of them do indeed claim that their usage has been and will continue to be increased thanks to the new level of availability.
Uebler, Ulla. "Multilingual speech recognition /". Berlin : Logos Verlag, 2000. http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&doc_number=009117880&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA.
Texto completoWang, Yonglian. "Speech Recognition under Stress". Available to subscribers only, 2009. http://proquest.umi.com/pqdweb?did=1968468151&sid=9&Fmt=2&clientId=1509&RQT=309&VName=PQD.
Texto completoLucas, Adrian Edward. "Acoustic level speech recognition". Thesis, University of Surrey, 1991. http://epubs.surrey.ac.uk/2819/.
Texto completoŽmolíková, Kateřina. "Far-Field Speech Recognition". Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2016. http://www.nusl.cz/ntk/nusl-255331.
Texto completoSun, Felix (Felix W. ). "Speech Representation Models for Speech Synthesis and Multimodal Speech Recognition". Thesis, Massachusetts Institute of Technology, 2016. http://hdl.handle.net/1721.1/106378.
Texto completoThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 59-63).
The field of speech recognition has seen steady advances over the last two decades, leading to the accurate, real-time recognition systems available on mobile phones today. In this thesis, I apply speech modeling techniques developed for recognition to two other speech problems: speech synthesis and multimodal speech recognition with images. In both problems, there is a need to learn a relationship between speech sounds and another source of information. For speech synthesis, I show that using a neural network acoustic model results in a synthesizer that is more tolerant of noisy training data than previous work. For multimodal recognition, I show how information from images can be effectively integrated into the recognition search framework, resulting in improved accuracy when image data is available.
by Felix Sun.
M. Eng.
Miyajima, C., D. Negi, Y. Ninomiya, M. Sano, K. Mori, K. Itou, K. Takeda y Y. Suenaga. "Audio-Visual Speech Database for Bimodal Speech Recognition". INTELLIGENT MEDIA INTEGRATION NAGOYA UNIVERSITY / COE, 2005. http://hdl.handle.net/2237/10460.
Texto completoLibros sobre el tema "Speech recognition"
Yu, Dong y Li Deng. Automatic Speech Recognition. London: Springer London, 2015. http://dx.doi.org/10.1007/978-1-4471-5779-3.
Texto completoLee, Kai-Fu. Automatic Speech Recognition. Boston, MA: Springer US, 1989. http://dx.doi.org/10.1007/978-1-4615-3650-5.
Texto completoBourlard, Hervé A. y Nelson Morgan. Connectionist Speech Recognition. Boston, MA: Springer US, 1994. http://dx.doi.org/10.1007/978-1-4615-3210-1.
Texto completoMarkowitz, Judith A. Using speech recognition. Upper Saddle River, N.J: Prentice Hall PTR, 1996.
Buscar texto completoWoelfel, Matthias. Distant speech recognition. Chichester, West Sussex, U.K: Wiley, 2009.
Buscar texto completoBuydos, John F. Speech recognition and processing. Washington, D.C: Science Reference Section, Science and Technology Division, Library of Congress, 1994.
Buscar texto completoHuang, X. D. Hidden Markov models for speech recognition. Edinburgh: Edinburgh University Press, 1990.
Buscar texto completoHolmes, J. N. Speech synthesis and recognition. 2a ed. New York: Taylor & Francis, 2001.
Buscar texto completoLaface, Pietro y Renato De Mori, eds. Speech Recognition and Understanding. Berlin, Heidelberg: Springer Berlin Heidelberg, 1992. http://dx.doi.org/10.1007/978-3-642-76626-8.
Texto completoNeustein, Amy, ed. Advances in Speech Recognition. Boston, MA: Springer US, 2010. http://dx.doi.org/10.1007/978-1-4419-5951-5.
Texto completoCapítulos de libros sobre el tema "Speech recognition"
Fink, Gernot A. "Speech Recognition". En Markov Models for Pattern Recognition, 229–36. London: Springer London, 2014. http://dx.doi.org/10.1007/978-1-4471-6308-4_13.
Texto completoMorris, Tim. "Speech Recognition". En Multimedia Systems, 89–100. London: Springer London, 2000. http://dx.doi.org/10.1007/978-1-4471-0455-1_7.
Texto completoXin, Jack y Yingyong Qi. "Speech Recognition". En Mathematical Modeling and Signal Processing in Speech and Hearing Sciences, 115–39. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-03086-9_4.
Texto completoPaulus, Dietrich W. R. y Joachim Hornegger. "Speech Recognition". En Pattern Recognition of Images and Speech in C++, 329–53. Wiesbaden: Vieweg+Teubner Verlag, 1997. http://dx.doi.org/10.1007/978-3-663-13991-1_25.
Texto completoFarouk, Mohamed Hesham. "Speech Recognition". En SpringerBriefs in Electrical and Computer Engineering, 27–29. Cham: Springer International Publishing, 2013. http://dx.doi.org/10.1007/978-3-319-02732-6_6.
Texto completoStölzle, Anton. "Speech Recognition". En The Kluwer International Series in Engineering and Computer Science, 321–38. Boston, MA: Springer US, 1992. http://dx.doi.org/10.1007/978-1-4615-3570-6_21.
Texto completoFarouk, Mohamed Hesham. "Speech Recognition". En SpringerBriefs in Electrical and Computer Engineering, 41–46. Cham: Springer International Publishing, 2017. http://dx.doi.org/10.1007/978-3-319-69002-5_7.
Texto completoWeik, Martin H. "speech recognition". En Computer Science and Communications Dictionary, 1636. Boston, MA: Springer US, 2000. http://dx.doi.org/10.1007/1-4020-0613-6_17919.
Texto completoSinha, Priyabrata. "Speech Recognition". En Speech Processing in Embedded Systems, 143–55. Boston, MA: Springer US, 2009. http://dx.doi.org/10.1007/978-0-387-75581-6_10.
Texto completoRenals, Steve y Thomas Hain. "Speech Recognition". En The Handbook of Computational Linguistics and Natural Language Processing, 297–332. Oxford, UK: Wiley-Blackwell, 2010. http://dx.doi.org/10.1002/9781444324044.ch12.
Texto completoActas de conferencias sobre el tema "Speech recognition"
Manabe, Hiroyuki, Akira Hiraiwa y Toshiaki Sugimura. "Speech recognition using EMG; mime speech recognition". En 8th European Conference on Speech Communication and Technology (Eurospeech 2003). ISCA: ISCA, 2003. http://dx.doi.org/10.21437/eurospeech.2003-524.
Texto completoNegoita, Alexandru, George Suciu, Svetlana Segarceanu y Dan Trufin. "SPEECH RECOGNITION SYSTEM". En eLSE 2021. ADL Romania, 2021. http://dx.doi.org/10.12753/2066-026x-21-095.
Texto completoWoodland, P. "Speech recognition". En IEE Colloquium Speech and Language Engineering - State of the Art. IEE, 1998. http://dx.doi.org/10.1049/ic:19980956.
Texto completoManabe, Hiroyuki, Akira Hiraiwa y Toshiaki Sugimura. ""Unvoiced speech recognition using EMG - mime speech recognition"". En CHI '03 extended abstracts. New York, New York, USA: ACM Press, 2003. http://dx.doi.org/10.1145/765891.765996.
Texto completoDong Wang, Lie Lu y Hong-Jiang Zhang. "Speech segmentation without speech recognition". En 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698). IEEE, 2003. http://dx.doi.org/10.1109/icme.2003.1220940.
Texto completoNewell, Alan F., John L. Arnott y R. Dye. "A full speed speech simulation of speech recognition machines". En European Conference on Speech Technology. ISCA: ISCA, 1987. http://dx.doi.org/10.21437/ecst.1987-207.
Texto completoPaulose, Supriya, Shikhamoni Nath y Samudravijaya K. "Marathi Speech Recognition". En The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages. ISCA: ISCA, 2018. http://dx.doi.org/10.21437/sltu.2018-48.
Texto completoThorat, Roopa A. y Ruchira A. Jadhav. "Speech recognition system". En the International Conference. New York, New York, USA: ACM Press, 2009. http://dx.doi.org/10.1145/1523103.1523226.
Texto completoHui Lin y Jeff Bilmes. "Polyphase speech recognition". En ICASSP 2008 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2008. http://dx.doi.org/10.1109/icassp.2008.4518558.
Texto completoDevi, Sulochana, Siddhi Chokshi, Kritika Kotian y Juili Warwatkar. "Visual Speech Recognition". En 2021 4th Biennial International Conference on Nascent Technologies in Engineering (ICNTE). IEEE, 2021. http://dx.doi.org/10.1109/icnte51185.2021.9487784.
Texto completoInformes sobre el tema "Speech recognition"
Hoeferlin, David M., Brian M. Ore, Stephen A. Thorn y David Snyder. Speech Processing and Recognition (SPaRe). Fort Belvoir, VA: Defense Technical Information Center, enero de 2011. http://dx.doi.org/10.21236/ada540142.
Texto completoKubala, F., S. Austin, C. Barry, J. Makhoul, P. Placeway y R. Schwartz. Byblos Speech Recognition Benchmark Results. Fort Belvoir, VA: Defense Technical Information Center, enero de 1991. http://dx.doi.org/10.21236/ada459943.
Texto completoSchwartz, Richard y Owen Kimball. Toward Real-Time Continuous Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, marzo de 1989. http://dx.doi.org/10.21236/ada208196.
Texto completoLiu, Fu-Hua, Pedro J. Moreno, Richard M. Stern y Alejandro Acero. Signal Processing for Robust Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, enero de 1994. http://dx.doi.org/10.21236/ada457798.
Texto completoSchwartz, R., Y.-L. Chow, A. Derr, M.-W. Feng y O. Kimball. Statistical Modeling for Continuous Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, febrero de 1988. http://dx.doi.org/10.21236/ada192054.
Texto completoSTANDARD OBJECT SYSTEMS INC SHALIMAR FL. Auditory Modeling for Noisy Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, enero de 2000. http://dx.doi.org/10.21236/ada373379.
Texto completoPfister, M. Software Package for Speaker Independent or Dependent Speech Recognition Using Standard Objects for Phonetic Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, febrero de 1998. http://dx.doi.org/10.21236/ada341198.
Texto completoOre, Brian M. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages. Fort Belvoir, VA: Defense Technical Information Center, noviembre de 2009. http://dx.doi.org/10.21236/ada519140.
Texto completoDraelos, Timothy J., Stephen Heck, Jennifer Galasso y Ronald Brogan. Seismic Phase Identification with Speech Recognition Algorithms. Office of Scientific and Technical Information (OSTI), septiembre de 2018. http://dx.doi.org/10.2172/1474260.
Texto completoSchwartz, Richard y John Makhoul. Combining Multiple Knowledge Sources for Speech Recognition. Fort Belvoir, VA: Defense Technical Information Center, septiembre de 1988. http://dx.doi.org/10.21236/ada198928.
Texto completo