Academic literature on the topic 'Speech waveforms'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Speech waveforms.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Speech waveforms"
Milenkovic, Paul. "Least Mean Square Measures of Voice Perturbation." Journal of Speech, Language, and Hearing Research 30, no. 4 (December 1987): 529–38. http://dx.doi.org/10.1044/jshr.3004.529.
Full textCoorman, Geert. "Speech synthesis using concatenation of speech waveforms." Journal of the Acoustical Society of America 124, no. 6 (2008): 3371. http://dx.doi.org/10.1121/1.3047443.
Full textFerit Gigi, Ercan. "Speech Synthesis Using Concatenation Of Speech Waveforms." Journal of the Acoustical Society of America 129, no. 1 (2011): 545. http://dx.doi.org/10.1121/1.3554813.
Full textCoorman, Geert. "Speech synthesis using concatenation of speech waveforms." Journal of the Acoustical Society of America 116, no. 3 (2004): 1331. http://dx.doi.org/10.1121/1.1809938.
Full textXiong, Yan, Fang Xu, Qiang Chen, and Jun Zhang. "Speech Enhancement Using Heterogeneous Information." International Journal of Grid and High Performance Computing 10, no. 3 (July 2018): 46–59. http://dx.doi.org/10.4018/ijghpc.2018070104.
Full textKleijn, W. B. "Encoding speech using prototype waveforms." IEEE Transactions on Speech and Audio Processing 1, no. 4 (1993): 386–99. http://dx.doi.org/10.1109/89.242484.
Full textMaddieson, Ian. "Commentary on ‘Reading waveforms’." Journal of the International Phonetic Association 21, no. 2 (December 1991): 89–91. http://dx.doi.org/10.1017/s0025100300004436.
Full textYohanes, Banu W. "Linear Prediction and Long Term Predictor Analysis and Synthesis." Techné : Jurnal Ilmiah Elektroteknika 16, no. 01 (April 3, 2017): 49–58. http://dx.doi.org/10.31358/techne.v16i01.158.
Full textArda, Betul, Daniel Rudoy, and Patrick J. Wolfe. "Testing for periodicity in speech waveforms." Journal of the Acoustical Society of America 125, no. 4 (April 2009): 2699. http://dx.doi.org/10.1121/1.4784326.
Full textTerzopoulos, D. "Co-occurrence analysis of speech waveforms." IEEE Transactions on Acoustics, Speech, and Signal Processing 33, no. 1 (February 1985): 5–30. http://dx.doi.org/10.1109/tassp.1985.1164511.
Full textDissertations / Theses on the topic "Speech waveforms"
Lowry, Andrew. "Efficient structures for vector quantisation of speech waveforms." Thesis, Queen's University Belfast, 1988. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.292596.
Full textCarandang, Alfonso B., and n/a. "Recognition of phonemes using shapes of speech waveforms in WAL." University of Canberra. Information Sciences & Engineering, 1994. http://erl.canberra.edu.au./public/adt-AUC20060626.144432.
Full textDeivard, Johannes. "How accuracy of estimated glottal flow waveforms affects spoofed speech detection performance." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-48414.
Full textBamini, Praveen Kumar. "FPGA-based Implementation of Concatenative Speech Synthesis Algorithm." [Tampa, Fla.] : University of South Florida, 2003. http://purl.fcla.edu/fcla/etd/SFE0000187.
Full textChoy, Eddie L. T. "Waveform interpolation speech coder at 4 kbs." Thesis, McGill University, 1998. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=20901.
Full textLeong, Michael. "Representing voiced speech using prototype waveform interpolation for low-rate speech coding." Thesis, McGill University, 1992. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=56796.
Full textIn examining the PWI method, it was found that although the method generally works very well there are occasional sections of the reconstructed voiced speech where audible distortion can be heard, even when the prototypes are not quantized. The research undertaken in this thesis focuses on the fundamental principles behind modelling voiced speech using PWI instead of focusing on bit allocation for encoding the prototypes. Problems in the PWI method are found that may be have been overlooked as encoding error if full encoding were implemented.
Kleijn uses PWI to represent voiced sections of the excitation signal which is the residual obtained after the removal of short-term redundancies by a linear predictive filter. The problem with this method is that when the PWI reconstructed excitation is passed through the inverse filter to synthesize the speech undesired effects occur due to the time-varying nature of the filter. The reconstructed speech may have undesired envelope variations which result in audible warble.
This thesis proposes an energy fixup to smoothen the synthesized speech envelope when the interpolation procedure fails to provide the smooth linear result that is desired. Further investigation, however, leads to the final proposal in this thesis that PWI should he performed on the clean speech signal instead of the excitation to achieve consistently reliable results for all voiced frames.
Choy, Eddie L. T. "Waveform interpolation speech coder at 4 kb/s." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk1/tape11/PQDD_0028/MQ50596.pdf.
Full textDavis, Andrew J. "Waveform coding of speech and voiceband data signals." Thesis, University of Liverpool, 1988. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.232946.
Full textEide, Ellen Marie. "A linguistic feature representation of the speech waveform." Thesis, Massachusetts Institute of Technology, 1993. http://hdl.handle.net/1721.1/12510.
Full textIncludes bibliographical references (leaves 95-97).
by Ellen Marie Eide.
Ph.D.
Zeghidour, Neil. "Learning representations of speech from the raw waveform." Thesis, Paris Sciences et Lettres (ComUE), 2019. http://www.theses.fr/2019PSLEE004/document.
Full textWhile deep neural networks are now used in almost every component of a speech recognition system, from acoustic to language modeling, the input to such systems are still fixed, handcrafted, spectral features such as mel-filterbanks. This contrasts with computer vision, in which a deep neural network is now trained on raw pixels. Mel-filterbanks contain valuable and documented prior knowledge from human auditory perception as well as signal processing, and are the input to state-of-the-art speech recognition systems that are now on par with human performance in certain conditions. However, mel-filterbanks, as any fixed representation, are inherently limited by the fact that they are not fine-tuned for the task at hand. We hypothesize that learning the low-level representation of speech with the rest of the model, rather than using fixed features, could push the state-of-the art even further. We first explore a weakly-supervised setting and show that a single neural network can learn to separate phonetic information and speaker identity from mel-filterbanks or the raw waveform, and that these representations are robust across languages. Moreover, learning from the raw waveform provides significantly better speaker embeddings than learning from mel-filterbanks. These encouraging results lead us to develop a learnable alternative to mel-filterbanks, that can be directly used in replacement of these features. In the second part of this thesis we introduce Time-Domain filterbanks, a lightweight neural network that takes the waveform as input, can be initialized as an approximation of mel-filterbanks, and then learned with the rest of the neural architecture. Across extensive and systematic experiments, we show that Time-Domain filterbanks consistently outperform melfilterbanks and can be integrated into a new state-of-the-art speech recognition system, trained directly from the raw audio signal. Fixed speech features being also used for non-linguistic classification tasks for which they are even less optimal, we perform dysarthria detection from the waveform with Time-Domain filterbanks and show that it significantly improves over mel-filterbanks or low-level descriptors. Finally, we discuss how our contributions fall within a broader shift towards fully learnable audio understanding systems
Books on the topic "Speech waveforms"
Yaghmaie, Khashayar. Prototype waveform interpolation based low bit rate speech coding. 1997.
Find full textLamel, Lori, and Jean-Luc Gauvain. Speech Recognition. Edited by Ruslan Mitkov. Oxford University Press, 2012. http://dx.doi.org/10.1093/oxfordhb/9780199276349.013.0016.
Full textDutoit, Thierry, and Yannis Stylianou. Text-to-Speech Synthesis. Edited by Ruslan Mitkov. Oxford University Press, 2012. http://dx.doi.org/10.1093/oxfordhb/9780199276349.013.0017.
Full textGhaidan, K. A. A study of the application of modern techniques to speech waveform analysis. 1986.
Find full textBook chapters on the topic "Speech waveforms"
Kittler, J., and A. E. Lucas. "A New Method for Dynamic Time Alignment of Speech Waveforms." In Speech Recognition and Understanding, 537–42. Berlin, Heidelberg: Springer Berlin Heidelberg, 1992. http://dx.doi.org/10.1007/978-3-642-76626-8_53.
Full textSinha, Priyabrata. "Waveform Coders." In Speech Processing in Embedded Systems, 101–12. Boston, MA: Springer US, 2009. http://dx.doi.org/10.1007/978-0-387-75581-6_7.
Full textHaagen, Jesper, and W. Bastiaan Kleijn. "Waveform Interpolation." In Modern Methods of Speech Processing, 75–99. Boston, MA: Springer US, 1995. http://dx.doi.org/10.1007/978-1-4615-2281-2_4.
Full textKleijn, W. Bastiaan, and Wolfgang Granzow. "Waveform Interpolation in Speech Coding." In Speech and Audio Coding for Wireless and Network Applications, 111–18. Boston, MA: Springer US, 1993. http://dx.doi.org/10.1007/978-1-4615-3232-3_15.
Full textTóth, Bálint Pál, Kornél István Kis, György Szaszák, and Géza Németh. "Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis." In Speech and Computer, 271–78. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-43958-7_32.
Full textde Mori, R. "Working Group B: Waveform and Speech Recognition." In Syntactic and Structural Pattern Recognition, 447–51. Berlin, Heidelberg: Springer Berlin Heidelberg, 1988. http://dx.doi.org/10.1007/978-3-642-83462-2_27.
Full textWeiner, A. M., P. S. D. Lin, and R. B. Marcus. "Photoemissive Testing of High-Speed Electrical Waveforms." In Picosecond Electronics and Optoelectronics II, 56–60. Berlin, Heidelberg: Springer Berlin Heidelberg, 1987. http://dx.doi.org/10.1007/978-3-642-72970-6_12.
Full textPaul, Subham, and Debashis Chakraborty. "Efficient Speech Compression Using Waveform Coding in Time Domain." In Emerging Research in Computing, Information, Communication and Applications, 415–30. Singapore: Springer Singapore, 2017. http://dx.doi.org/10.1007/978-981-10-4741-1_37.
Full textSmidi, Ghaya, Aicha Bouzid, and Noureddine Ellouze. "Glottal Closure Instant Detection by the Multi-scale Product of the Derivative Glottal Waveform Signal." In Recent Advances in Nonlinear Speech Processing, 191–98. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-28109-4_19.
Full textPanda, Soumya Priyadarsini, and Ajit Kumar Nayak. "Spectral Smoothening Based Waveform Concatenation Technique for Speech Quality Enhancement in Text-to-Speech Systems." In Advances in Intelligent Systems and Computing, 425–32. Singapore: Springer Singapore, 2020. http://dx.doi.org/10.1007/978-981-15-1081-6_36.
Full textConference papers on the topic "Speech waveforms"
Hoshen, Yedid, Ron J. Weiss, and Kevin W. Wilson. "Speech acoustic modeling from raw multichannel waveforms." In ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015. http://dx.doi.org/10.1109/icassp.2015.7178847.
Full textCvetkovic, Z. "Modulating waveforms for OFDM." In 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258). IEEE, 1999. http://dx.doi.org/10.1109/icassp.1999.760629.
Full textTokuday, Keiichi, and Heiga Zen. "Directly modeling speech waveforms by neural networks for statistical parametric speech synthesis." In ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015. http://dx.doi.org/10.1109/icassp.2015.7178765.
Full textCvetkovic, Beferull-Lozano, and Buja. "Robust phoneme discrimination using acoustic waveforms." In IEEE International Conference on Acoustics Speech and Signal Processing ICASSP-02. IEEE, 2002. http://dx.doi.org/10.1109/icassp.2002.1005740.
Full textSujatha, B. K., P. S. Satyanarayana, and K. N. Haribhat. "Digital Coding of Speech Waveforms Using the Proposed ADM." In 2008 Second International Conference on Future Generation Communication and Networking (FGCN). IEEE, 2008. http://dx.doi.org/10.1109/fgcn.2008.160.
Full textMporas, Iosif, Todor Ganchev, and Nikos Fakotakis. "A hybrid architecture for automatic segmentation of speech waveforms." In ICASSP 2008 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2008. http://dx.doi.org/10.1109/icassp.2008.4518645.
Full textFainberg, Joachim, Ondrej Klejch, Erfan Loweimi, Peter Bell, and Steve Renals. "Acoustic Model Adaptation from Raw Waveforms with Sincnet." In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 2019. http://dx.doi.org/10.1109/asru46091.2019.9003974.
Full textHongbing Zhang, Fan, and Lindsey. "Wavelet packet waveforms for multicarrier CDMA communications." In IEEE International Conference on Acoustics Speech and Signal Processing ICASSP-02. IEEE, 2002. http://dx.doi.org/10.1109/icassp.2002.1005207.
Full textLourey, Simon J. "Frequency hopping waveforms for continuous active sonar." In ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015. http://dx.doi.org/10.1109/icassp.2015.7178287.
Full textDai, Wei, Chia Dai, Shuhui Qu, Juncheng Li, and Samarjit Das. "Very deep convolutional neural networks for raw waveforms." In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2017. http://dx.doi.org/10.1109/icassp.2017.7952190.
Full textReports on the topic "Speech waveforms"
Nehl, Albert. Investigation of techniques for high speed CMOS arbitrary waveform generation. Portland State University Library, January 2000. http://dx.doi.org/10.15760/etd.5993.
Full textHaller, Gunther. High-Speed, High-Resolution Analog Waveform Sampling in VLSI Technology. Office of Scientific and Technical Information (OSTI), October 1998. http://dx.doi.org/10.2172/10132.
Full textDrive modelling and performance estimation of IPM motor using SVPWM and Six-step Control Strategy. SAE International, April 2021. http://dx.doi.org/10.4271/2021-01-0775.
Full text