Literatura científica selecionada sobre o tema "Distant speech"

Crie uma referência precisa em APA, MLA, Chicago, Harvard, e outros estilos

Selecione um tipo de fonte:

Consulte a lista de atuais artigos, livros, teses, anais de congressos e outras fontes científicas relevantes para o tema "Distant speech".

Ao lado de cada fonte na lista de referências, há um botão "Adicionar à bibliografia". Clique e geraremos automaticamente a citação bibliográfica do trabalho escolhido no estilo de citação de que você precisa: APA, MLA, Harvard, Chicago, Vancouver, etc.

Você também pode baixar o texto completo da publicação científica em formato .pdf e ler o resumo do trabalho online se estiver presente nos metadados.

Artigos de revistas sobre o assunto "Distant speech"

1

Sorokin, V. N. "Distant Speech Detection". Acoustical Physics 69, n.º 4 (agosto de 2023): 565–73. http://dx.doi.org/10.1134/s1063771023600250.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Ashwini, Jaya Kumar, e Ramaswamy Kumaraswamy. "Single-Channel Speech Enhancement Techniques for Distant Speech Recognition". Journal of Intelligent Systems 22, n.º 2 (1 de junho de 2013): 81–93. http://dx.doi.org/10.1515/jisys-2012-0051.

Texto completo da fonte
Resumo:
AbstractThis article presents an overview of the single-channel dereverberation methods suitable for distant speech recognition (DSR) application. The dereverberation methods are mainly classified based on the domain of enhancement of speech signal captured by a distant microphone. Many single-channel speech enhancement methods focus on either denoising or dereverberating the distorted speech signal. There are very few methods that consider both noise and reverberation effects. Such methods are discussed under a multistage approach in this article. The article concludes with a hypothesis that the methods that do not require an a priori reverberation impulse response is desirable in varying the environmental conditions for DSR applications such as intelligent home and office environments, humanoid robots, and automobiles rather than the methods that require an a priori reverberation impulse response.
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Guerrero Flores, Cristina, Georgina Tryfou e Maurizio Omologo. "Cepstral distance based channel selection for distant speech recognition". Computer Speech & Language 47 (janeiro de 2018): 314–32. http://dx.doi.org/10.1016/j.csl.2017.08.003.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Nematollahi, Mohammad Ali, e S. A. R. Al-Haddad. "Distant Speaker Recognition: An Overview". International Journal of Humanoid Robotics 13, n.º 02 (25 de maio de 2016): 1550032. http://dx.doi.org/10.1142/s0219843615500322.

Texto completo da fonte
Resumo:
Distant speaker recognition (DSR) system assumes the microphones are far away from the speaker’s mouth. Also, the position of microphones can vary. Furthermore, various challenges and limitation in terms of coloration, ambient noise and reverberation can bring some difficulties for recognition of the speaker. Although, applying speech enhancement techniques can attenuate speech distortion components, it may remove speaker-specific information and increase the processing time in real-time application. Currently, many efforts have been investigated to develop DSR for commercial viable systems. In this paper, state-of-the-art techniques in DSR such as robust feature extraction, feature normalization, robust speaker modeling, model compensation, dereverberation and score normalization are discussed to overcome the speech degradation components i.e., reverberation and ambient noise. Performance results on DSR show that whenever speaker to microphone distant increases, recognition rates decreases and equal error rate (EER) increases. Finally, the paper concludes that applying robust feature and robust speaker model varying lesser with distant, can improve the DSR performance.
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Swietojanski, Pawel, Arnab Ghoshal e Steve Renals. "Convolutional Neural Networks for Distant Speech Recognition". IEEE Signal Processing Letters 21, n.º 9 (setembro de 2014): 1120–24. http://dx.doi.org/10.1109/lsp.2014.2325781.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Pardede, Hilman F., Asri R. Yuliani e Rika Sustika. "Convolutional Neural Network and Feature Transformation for Distant Speech Recognition". International Journal of Electrical and Computer Engineering (IJECE) 8, n.º 6 (1 de dezembro de 2018): 5381. http://dx.doi.org/10.11591/ijece.v8i6.pp5381-5388.

Texto completo da fonte
Resumo:
In many applications, speech recognition must operate in conditions where there are some distances between speakers and the microphones. This is called distant speech recognition (DSR). In this condition, speech recognition must deal with reverberation. Nowadays, deep learning technologies are becoming the the main technologies for speech recognition. Deep Neural Network (DNN) in hybrid with Hidden Markov Model (HMM) is the commonly used architecture. However, this system is still not robust against reverberation. Previous studies use Convolutional Neural Networks (CNN), which is a variation of neural network, to improve the robustness of speech recognition against noise. CNN has the properties of pooling which is used to find local correlation between neighboring dimensions in the features. With this property, CNN could be used as feature learning emphasizing the information on neighboring frames. In this study we use CNN to deal with reverberation. We also propose to use feature transformation techniques: linear discriminat analysis (LDA) and maximum likelihood linear transformation (MLLT), on mel frequency cepstral coefficient (MFCC) before feeding them to CNN. We argue that transforming features could produce more discriminative features for CNN, and hence improve the robustness of speech recognition against reverberation. Our evaluations on Meeting Recorder Digits (MRD) subset of Aurora-5 database confirm that the use of LDA and MLLT transformations improve the robustness of speech recognition. It is better by 20% relative error reduction on compared to a standard DNN based speech recognition using the same number of hidden layers.
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

NAKANO, Alberto Yoshihiro, Seiichi NAKAGAWA e Kazumasa YAMAMOTO. "Distant Speech Recognition Using a Microphone Array Network". IEICE Transactions on Information and Systems E93-D, n.º 9 (2010): 2451–62. http://dx.doi.org/10.1587/transinf.e93.d.2451.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Ravanelli, Mirco, e Maurizio Omologo. "Automatic context window composition for distant speech recognition". Speech Communication 101 (julho de 2018): 34–44. http://dx.doi.org/10.1016/j.specom.2018.05.001.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Matassoni, Marco, Maurizio Omologo, Diego Giuliani e Piergiorgio Svaizer. "Hidden Markov model training with contaminated speech material for distant-talking speech recognition". Computer Speech & Language 16, n.º 2 (abril de 2002): 205–23. http://dx.doi.org/10.1006/csla.2002.0191.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

Mustafaev, M. Sh, V. A. Vissarionov, E. M. Tarchokova e S. A. Dyshekova. "Basics of complex rehabilitation of patients with speech disorders after uranoplasty". Medical alphabet, n.º 3 (12 de junho de 2020): 40–42. http://dx.doi.org/10.33667/2078-5631-2020-3-40-42.

Texto completo da fonte
Resumo:
Result of the study a theoretical overview technologies for eliminating cleft of the palate and their distant results for the formation of correct speech. The process of speech formation and the influence of technical features of first and reconstructive uranoplastics using pharyngeal flap on this process was analyzed.
Estilos ABNT, Harvard, Vancouver, APA, etc.

Teses / dissertações sobre o assunto "Distant speech"

1

Ravanelli, Mirco. "Deep Learning for Distant Speech Recognition". Doctoral thesis, Università degli studi di Trento, 2017. https://hdl.handle.net/11572/368959.

Texto completo da fonte
Resumo:
Deep learning is an emerging technology that is considered one of the most promising directions for reaching higher levels of artificial intelligence. Among the other achievements, building computers that understand speech represents a crucial leap towards intelligent machines. Despite the great efforts of the past decades, however, a natural and robust human-machine speech interaction still appears to be out of reach, especially when users interact with a distant microphone in noisy and reverberant environments. The latter disturbances severely hamper the intelligibility of a speech signal, making Distant Speech Recognition (DSR) one of the major open challenges in the field. This thesis addresses the latter scenario and proposes some novel techniques, architectures, and algorithms to improve the robustness of distant-talking acoustic models. We first elaborate on methodologies for realistic data contamination, with a particular emphasis on DNN training with simulated data. We then investigate on approaches for better exploiting speech contexts, proposing some original methodologies for both feed-forward and recurrent neural networks. Lastly, inspired by the idea that cooperation across different DNNs could be the key for counteracting the harmful effects of noise and reverberation, we propose a novel deep learning paradigm called “network of deep neural networks†. The analysis of the original concepts were based on extensive experimental validations conducted on both real and simulated data, considering different corpora, microphone configurations, environments, noisy conditions, and ASR tasks.
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Ravanelli, Mirco. "Deep Learning for Distant Speech Recognition". Doctoral thesis, University of Trento, 2017. http://eprints-phd.biblio.unitn.it/2693/1/PhD_thesis_Ravanelli_Mirco_archived.pdf.

Texto completo da fonte
Resumo:
Deep learning is an emerging technology that is considered one of the most promising directions for reaching higher levels of artificial intelligence. Among the other achievements, building computers that understand speech represents a crucial leap towards intelligent machines. Despite the great efforts of the past decades, however, a natural and robust human-machine speech interaction still appears to be out of reach, especially when users interact with a distant microphone in noisy and reverberant environments. The latter disturbances severely hamper the intelligibility of a speech signal, making Distant Speech Recognition (DSR) one of the major open challenges in the field. This thesis addresses the latter scenario and proposes some novel techniques, architectures, and algorithms to improve the robustness of distant-talking acoustic models. We first elaborate on methodologies for realistic data contamination, with a particular emphasis on DNN training with simulated data. We then investigate on approaches for better exploiting speech contexts, proposing some original methodologies for both feed-forward and recurrent neural networks. Lastly, inspired by the idea that cooperation across different DNNs could be the key for counteracting the harmful effects of noise and reverberation, we propose a novel deep learning paradigm called “network of deep neural networks”. The analysis of the original concepts were based on extensive experimental validations conducted on both real and simulated data, considering different corpora, microphone configurations, environments, noisy conditions, and ASR tasks.
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Liu, Yulan. "Distant speech recognition of natural spontaneous multi-party conversations". Thesis, University of Sheffield, 2017. http://etheses.whiterose.ac.uk/17691/.

Texto completo da fonte
Resumo:
Distant speech recognition (DSR) has gained wide interest recently. While deep networks keep improving ASR overall, the performance gap remains between using close-talking recordings and distant recordings. Therefore the work in this thesis aims at providing some insights for further improvement of DSR performance. The investigation starts with collecting the first multi-microphone and multi-media corpus of natural spontaneous multi-party conversations in native English with the speaker location tracked, i.e. the Sheffield Wargame Corpus (SWC). The state-of-the-art recognition systems with the acoustic models trained standalone and adapted both show word error rates (WERs) above 40% on headset recordings and above 70% on distant recordings. A comparison between SWC and AMI corpus suggests a few unique properties in the real natural spontaneous conversations, e.g. the very short utterances and the emotional speech. Further experimental analysis based on simulated data and real data quantifies the impact of such influence factors on DSR performance, and illustrates the complex interaction among multiple factors which makes the treatment of each influence factor much more difficult. The reverberation factor is studied further. It is shown that the reverberation effect on speech features could be accurately modelled with a temporal convolution in the complex spectrogram domain. Based on that a polynomial reverberation score is proposed to measure the distortion level of short utterances. Compared to existing reverberation metrics like C50, it avoids a rigid early-late-reverberation partition without compromising the performance on ranking the reverberation level of recording environments and channels. Furthermore, the existing reverberation measurement is signal independent thus unable to accurately estimate the reverberation distortion level in short recordings. Inspired by the phonetic analysis on the reverberation distortion via self-masking and overlap-masking, a novel partition of reverberation distortion into the intra-phone smearing and the inter-phone smearing is proposed, so that the reverberation distortion level is first estimated on each part and then combined.
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Mahdian, Toroghi Rahil [Verfasser], e Dietrich [Akademischer Betreuer] Klakow. "Blind speech separation in distant speech recognition front-end processing / Rahil Mahdian Toroghi ; Betreuer: Dietrich Klakow". Saarbrücken : Saarländische Universitäts- und Landesbibliothek, 2016. http://d-nb.info/1119802229/34.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Eldridge, Larry D. "A distant heritage : the growth of free speech in early America /". New York, NY [u.a.] : New York Univ. Press, 1994. http://www.gbv.de/dms/spk/sbb/recht/toc/277767857.pdf.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Titus, Andrew Richard. "A study of adaptive enhancement methods for improved distant speech recognition". Thesis, Massachusetts Institute of Technology, 2018. http://hdl.handle.net/1721.1/119706.

Texto completo da fonte
Resumo:
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.
This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 65-68).
Automatic speech recognition systems trained on speech data recorded by microphones placed close to the speaker tend to perform poorly on speech recorded by microphones placed farther away from the speaker due to reverberation effects and background noise. I designed and implemented a variety of machine learning models to improve distant speech recognition performance by adaptively enhancing incoming speech to appear as if it was recorded in a close-talking environment, regardless of whether it was originally recorded in a close-talking or distant environment. These were evaluated by passing the enhanced speech to acoustic models trained on only close-talking speech and comparing error rates to those achieved without speech enhancement. Experiments conducted on the AMI, TIMIT and TED-LIUM datasets indicate that decreases in error rate on distant speech of up to 33% relative can be achieved by these with only minor increases (1% relative) on clean speech.
by Andrew Richard Titus.
M. Eng.
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Guerrero, Flores Cristina Maritza. "Information Fusion Approaches for Distant Speech Recognition in a Multi-microphone Setting". Doctoral thesis, Università degli studi di Trento, 2016. https://hdl.handle.net/11572/368955.

Texto completo da fonte
Resumo:
It is a well known fact that high quality Automatic Speech Recognition is still difficult to guarantee under conditions in which the speaker is distant from the microphone due to the distortions caused by acoustic phenomena, such as noise and reverberation. Among the different research directions pursued around this problem, the adoption of multi-channel approaches is of great interest to the community given the potential of taking advantage of information diversity. In this thesis we elaborate on approaches that exploit different instances of a sound source, captured by various largely spaced microphones, in order to extract a Distant Speech Recognition hypothesis. Two original solutions are presented, based on information fusion approaches at different levels of the recognition system, one at front-end stage and one at post-decoding stage, namely for the problems of channel selection (CS) and hypothesis combination. First, a new CS framework is proposed. Cepstral distance (CD), which is effectively applied in other acoustic processing fields, is the basis of the CS method developed. Experimental results confirmed the advantages of a CD-based selection schema under different scenarios. The second contribution concerns the combination of information extracted from the individual decoding processes performed over the multiple captured signals. It is shown how temporal cues can be identified in the hypothesis space, and be beneficial for the elaboration of a multi-microphone confusion network, from which the final speech transcription is derived. The proposed methods are applicable in a setting equipped with synchronized distributed microphones, independently of the proximity between the sensors. Analysis of the novel concepts were performed over synthetic and real-captured data. Both approaches achieved positive results at the different assessment tasks they were exposed to.
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Guerrero, Flores Cristina Maritza. "Information Fusion Approaches for Distant Speech Recognition in a Multi-microphone Setting". Doctoral thesis, University of Trento, 2016. http://eprints-phd.biblio.unitn.it/1853/1/160830_cguerrero_phd-thesis.pdf.

Texto completo da fonte
Resumo:
It is a well known fact that high quality Automatic Speech Recognition is still difficult to guarantee under conditions in which the speaker is distant from the microphone due to the distortions caused by acoustic phenomena, such as noise and reverberation. Among the different research directions pursued around this problem, the adoption of multi-channel approaches is of great interest to the community given the potential of taking advantage of information diversity. In this thesis we elaborate on approaches that exploit different instances of a sound source, captured by various largely spaced microphones, in order to extract a Distant Speech Recognition hypothesis. Two original solutions are presented, based on information fusion approaches at different levels of the recognition system, one at front-end stage and one at post-decoding stage, namely for the problems of channel selection (CS) and hypothesis combination. First, a new CS framework is proposed. Cepstral distance (CD), which is effectively applied in other acoustic processing fields, is the basis of the CS method developed. Experimental results confirmed the advantages of a CD-based selection schema under different scenarios. The second contribution concerns the combination of information extracted from the individual decoding processes performed over the multiple captured signals. It is shown how temporal cues can be identified in the hypothesis space, and be beneficial for the elaboration of a multi-microphone confusion network, from which the final speech transcription is derived. The proposed methods are applicable in a setting equipped with synchronized distributed microphones, independently of the proximity between the sensors. Analysis of the novel concepts were performed over synthetic and real-captured data. Both approaches achieved positive results at the different assessment tasks they were exposed to.
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Swietojanski, Paweł. "Learning representations for speech recognition using artificial neural networks". Thesis, University of Edinburgh, 2016. http://hdl.handle.net/1842/22835.

Texto completo da fonte
Resumo:
Learning representations is a central challenge in machine learning. For speech recognition, we are interested in learning robust representations that are stable across different acoustic environments, recording equipment and irrelevant inter– and intra– speaker variabilities. This thesis is concerned with representation learning for acoustic model adaptation to speakers and environments, construction of acoustic models in low-resource settings, and learning representations from multiple acoustic channels. The investigations are primarily focused on the hybrid approach to acoustic modelling based on hidden Markov models and artificial neural networks (ANN). The first contribution concerns acoustic model adaptation. This comprises two new adaptation transforms operating in ANN parameters space. Both operate at the level of activation functions and treat a trained ANN acoustic model as a canonical set of fixed-basis functions, from which one can later derive variants tailored to the specific distribution present in adaptation data. The first technique, termed Learning Hidden Unit Contributions (LHUC), depends on learning distribution-dependent linear combination coefficients for hidden units. This technique is then extended to altering groups of hidden units with parametric and differentiable pooling operators. We found the proposed adaptation techniques pose many desirable properties: they are relatively low-dimensional, do not overfit and can work in both a supervised and an unsupervised manner. For LHUC we also present extensions to speaker adaptive training and environment factorisation. On average, depending on the characteristics of the test set, 5-25% relative word error rate (WERR) reductions are obtained in an unsupervised two-pass adaptation setting. The second contribution concerns building acoustic models in low-resource data scenarios. In particular, we are concerned with insufficient amounts of transcribed acoustic material for estimating acoustic models in the target language – thus assuming resources like lexicons or texts to estimate language models are available. First we proposed an ANN with a structured output layer which models both context–dependent and context–independent speech units, with the context-independent predictions used at runtime to aid the prediction of context-dependent states. We also propose to perform multi-task adaptation with a structured output layer. We obtain consistent WERR reductions up to 6.4% in low-resource speaker-independent acoustic modelling. Adapting those models in a multi-task manner with LHUC decreases WERRs by an additional 13.6%, compared to 12.7% for non multi-task LHUC. We then demonstrate that one can build better acoustic models with unsupervised multi– and cross– lingual initialisation and find that pre-training is a largely language-independent. Up to 14.4% WERR reductions are observed, depending on the amount of the available transcribed acoustic data in the target language. The third contribution concerns building acoustic models from multi-channel acoustic data. For this purpose we investigate various ways of integrating and learning multi-channel representations. In particular, we investigate channel concatenation and the applicability of convolutional layers for this purpose. We propose a multi-channel convolutional layer with cross-channel pooling, which can be seen as a data-driven non-parametric auditory attention mechanism. We find that for unconstrained microphone arrays, our approach is able to match the performance of the comparable models trained on beamform-enhanced signals.
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

KITAOKA, Norihide, Seiichi NAKAGAWA e Longbiao WANG. "Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN". Institute of Electronics, Information and Communication Engineers, 2008. http://hdl.handle.net/2237/14966.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.

Livros sobre o assunto "Distant speech"

1

Woelfel, Matthias. Distant speech recognition. Chichester, West Sussex, U.K: Wiley, 2009.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Eldridge, Larry D. A distant heritage: The growth of free speech in early America. New York: New York University Press, 1994.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Smoothey, Marion. Time, distance, and speed. New York: Marshall Cavendish, 1993.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Smoothey, Marion. Time, distance, and speed. New York: Marshall Cavendish, 1993.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Humphreys, D. Russell. Starlight and time: Solving the puzzle of distant starlight in a young universe. Green Forest, AR: Master Books, 1994.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Speed, Charles S. Call of a distant drum: The Speeds, the Crittendons, and the New Land. Arlington, VA (2000 South Eads St., Arlington 22202): C.S. Speed, 1986.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Subhasis, Chaudhuri, e SpringerLink (Online service), eds. Video Analysis and Repackaging for Distance Education. New York, NY: Springer New York, 2012.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Hunt, M. J. Distance measures for speech recognition =: Les distances spectrales pour la reconnaissance de la parole. Ottawa: National Research Council Canada, 1989.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Brunner-Jass, Renata. Field of play: Measuring distance, rate, and time. Chicago, IL: Norwood House Press, 2013.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

DIMED 86 (1986 Algarve, Portugal). DIMED 86: Discurso dos media e ensino a distância = discours des médias et enseignement à distance = media speech and distance teaching : actas do colóquio, Algarve 10-15 março de 1986. Lisboa: Ministério da Educação e Cultura, Instituto Português de Ensino a Distância, 1986.

Encontre o texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.

Capítulos de livros sobre o assunto "Distant speech"

1

Kendall, Henry W. "Speech to the United Nations Intergovernmental Negotiating Committee". In A Distant Light, 189–91. New York, NY: Springer New York, 2000. http://dx.doi.org/10.1007/978-1-4419-8507-1_17.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Zając, Marcin, e Adam Przepiórkowski. "Distant Supervision Learning of DBPedia Relations". In Text, Speech, and Dialogue, 193–200. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013. http://dx.doi.org/10.1007/978-3-642-40585-3_25.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Haderlein, Tino, Elmar Nöth, Wolfgang Herbordt, Walter Kellermann e Heinrich Niemann. "Using Artificially Reverberated Training Data in Distant-Talking ASR". In Text, Speech and Dialogue, 226–33. Berlin, Heidelberg: Springer Berlin Heidelberg, 2005. http://dx.doi.org/10.1007/11551874_29.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Ji, Mikyong, Sungtak Kim e Hoirin Kim. "Composite Decision by Bayesian Inference in Distant-Talking Speech Recognition". In Text, Speech and Dialogue, 463–70. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/11846406_58.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Radeck-Arneth, Stephan, Benjamin Milde, Arvid Lange, Evandro Gouvêa, Stefan Radomski, Max Mühlhäuser e Chris Biemann. "Open Source German Distant Speech Recognition: Corpus and Acoustic Model". In Text, Speech, and Dialogue, 480–88. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-24033-6_54.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Vacher, Michel, Benjamin Lecouteux e François Portet. "On Distant Speech Recognition for Home Automation". In Smart Health, 161–88. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-16226-3_7.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Golubev, Anton, e Natalia Loukachevitch. "Use of Augmentation and Distant Supervision for Sentiment Analysis in Russian". In Text, Speech, and Dialogue, 184–96. Cham: Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-83527-9_16.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Renals, Steve, e Pawel Swietojanski. "Distant Speech Recognition Experiments Using the AMI Corpus". In New Era for Robust Speech Recognition, 355–68. Cham: Springer International Publishing, 2017. http://dx.doi.org/10.1007/978-3-319-64680-0_16.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Astapov, Sergei, Gleb Svirskiy, Aleksandr Lavrentyev, Tatyana Prisyach, Dmitriy Popov, Dmitriy Ubskiy e Vladimir Kabarov. "Acoustic Event Mixing to Multichannel AMI Data for Distant Speech Recognition and Acoustic Event Classification Benchmarking". In Speech and Computer, 31–42. Cham: Springer International Publishing, 2019. http://dx.doi.org/10.1007/978-3-030-26061-3_4.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

Agarwal, Nidhi, Amita Jain, Ayush Gupta e Devendra Kumar Tayal. "Applying XGBoost Machine Learning Model to Succor Astronomers Detect Exoplanets in Distant Galaxies". In Artificial Intelligence and Speech Technology, 385–404. Cham: Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-030-95711-7_33.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.

Trabalhos de conferências sobre o assunto "Distant speech"

1

Lin, Q., C. Che, D. S. Yak, L. Jin, B. deVries, J. Pearson e J. Flanagan. "Robust distant-talking speech recognition". In 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings. IEEE, 1996. http://dx.doi.org/10.1109/icassp.1996.540280.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Guerrero, Cristina, Georgina Tryfou e Maurizio Omologo. "Channel Selection for Distant Speech Recognition Exploiting Cepstral Distance". In Interspeech 2016. ISCA, 2016. http://dx.doi.org/10.21437/interspeech.2016-865.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Thotappa, Deepak, e S. R. Mahadeva Prasanna. "Analysis of Foreground and Distant speech". In TENCON 2015 - 2015 IEEE Region 10 Conference. IEEE, 2015. http://dx.doi.org/10.1109/tencon.2015.7372808.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Jin, Qin, Runxin Li, Qian Yang, Kornel Laskowski e Tanja Schultz. "Speaker identification with distant microphone speech". In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2010. http://dx.doi.org/10.1109/icassp.2010.5495590.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Morales-Cordovilla, Juan A., Hannes Pessentheiner, Martin Hagmüller e Gernot Kubin. "Room localization for distant speech recognition". In Interspeech 2014. ISCA: ISCA, 2014. http://dx.doi.org/10.21437/interspeech.2014-238.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

McDonough, John, e Matthias Wolfel. "Distant Speech Recognition: Bridging the Gaps". In 2008 Hands-Free Speech Communication and Microphone Arrays (HSCMA 2008). IEEE, 2008. http://dx.doi.org/10.1109/hscma.2008.4538699.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Renals, Steve, e Pawel Swietojanski. "Neural networks for distant speech recognition". In 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA). IEEE, 2014. http://dx.doi.org/10.1109/hscma.2014.6843274.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Zhao, Tuo, Yunxin Zhao, Shaojun Wang e Mei Han. "UNet++-Based Multi-Channel Speech Dereverberation and Distant Speech Recognition". In 2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP). IEEE, 2021. http://dx.doi.org/10.1109/iscslp49672.2021.9362064.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Ravanelli, Mirco, e Maurizio Omologo. "Contaminated speech training methods for robust DNN-HMM distant speech recognition". In Interspeech 2015. ISCA: ISCA, 2015. http://dx.doi.org/10.21437/interspeech.2015-251.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

Zwyssig, Erich, Mike Lincoln e Steve Renals. "A digital microphone array for distant speech recognition". In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2010. http://dx.doi.org/10.1109/icassp.2010.5495040.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.

Relatórios de organizações sobre o assunto "Distant speech"

1

Zilberman, Mark. Methods to Test the “Dimming Effect” Produced by a Decrease in the Number of Photons Received from Receding Light Sources. Intellectual Archive, junho de 2021. http://dx.doi.org/10.32370/ia_2021_06_22.

Texto completo da fonte
Resumo:
The hypothetical “Dimming effect” describes the change of the number of photons arriving from a receding light source per unit of time. In non-relativistic systems,the "Dimming effect" occurs due to the fact that as light sources move away, the distance between the emitter and the receiver constantly increases, and the photons always take longer to reach the receiver. This reduces the number of photons received per time unit compared to the number of emitted photons per time unit. Negligible for speeds incomparable with the speed of light c, the "Dimming effect" can be very significant for speeds above 0.1c. “Dimming effect” can possibly be tested in a physics labor-atory using a moving light source (or mirror) and photon counters located in the travel direction and in opposite direction. It can possibly also be tested utilizing the orbital movement of the Earth around the Sun. If confirmed, “Dimming effect” would allow astronomers to adjust values of the "Standard Candles", which are critical in cosmological models. Absence of “Dimming effect” will mean that the number of photons arriving per time unit does not depend on the relative speed of light source and observer,which is not so apparent
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Stastny, Petr, Robert Roczniok, Daniel Cleather, Martin Musalek, Dominik Novak e Michal Vagner. Straight speed and acceleration optimal distances and reference values. A systematic review, and meta-analyses. INPLASY - International Platform of Registered Systematic Review and Meta-analysis Protocols, maio de 2022. http://dx.doi.org/10.37766/inplasy2022.5.0010.

Texto completo da fonte
Resumo:
Review question / Objective: To summarize the sprint reference acceleration and speed values for different sprint distances and suggest optimal unification of ice-hockey straight sprint testing. Eligibility criteria: The title and abstract screening was done by two researchers (PS and RR) who selected a set of articles for full text screening, where the inclusion criteria were: 1) male or female ice-hockey players; 2) any cross-sectional or intervention study; 3) tests of ice-hockey sprinting over any distance or any battery of conditioning tests that included straight-line sprints; and, 4) results reported straight-line sprint distance, speed, time, or acceleration. In the case of disagreement between the evaluating authors, the final decision was made by a third author (MV).The full text screening exclusion criteria were: 1) if the article was not in English; 2) the testing did not include straight-line sprinting; 3) the reported values did not include data distribution; 4) the study reported only maximum speed without skating time or average speed; 5) the end of the sprint was defined by the point the player stopped sprinting; 6) the measurement was made with a stopwatch; and, 7) the study had high bias estimation. The maximum speed test was not included due to the uncertain velocity conditions at beginning of testing distance. The bias estimation was performed using the JBI (Joanna Briggs Institute) Critical Appraisal Checklist for Analytical Cross Sectional Studies (supplementary material 1).
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Harris e Vaze. PR-185-0351-R05 Welding for Small to Medium Diameter Gas Pipelines - Real-Time Quality Monitoring. Chantilly, Virginia: Pipeline Research Council International, Inc. (PRCI), abril de 2006. http://dx.doi.org/10.55274/r0011071.

Texto completo da fonte
Resumo:
A literature review was conducted to establish and highlight the state of the art for real-time quality monitoring (RTQM). Work in this task advanced the state of the art in terms of achieving constant welding power input at constant contact tip-to-workpiece distance (CTWD), thus achieving constant welding heat input at constant welding travel speed. This was coupled with high speed on-board DAQ to achieve RTQM, in these terms.
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Bernau, Jeremiah. The Changing Bonneville Salt Flats. Utah Geological Survey, janeiro de 2024. http://dx.doi.org/10.34191/pi-106.

Texto completo da fonte
Resumo:
The Bonneville Salt Flats in western Utah are renowned worldwide for hosting land-speed racing events. They are easily accessible from Interstate 80 and are managed by the Bureau of Land Management. This distinct landscape spans approximately 5 miles wide by 12 miles long, featuring a smooth, hard salt crust that draws thousands of visitors each year.
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

Nascimento, Lucas, Augusto Boening, Abílio Galli, Janaine Polese e Louise Ada. Mechanically-assisted walking training for improving walking speed, walking distance and social participation after stroke: a systematic review protocol. INPLASY - International Platform of Registered Systematic Review Protocols, março de 2020. http://dx.doi.org/10.37766/inplasy2020.3.0007.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Rahai, Hamid, e Assma Begum. Numerical Investigations of Transient Wind Shear from Passing Vehicles Near a Road Structure (Part I: Unsteady Reynolds-Averaged Navier-Stokes Simulations). Mineta Transportation Institute, janeiro de 2021. http://dx.doi.org/10.31979/mti.2020.1933.

Texto completo da fonte
Resumo:
In this research, the authors performed unsteady numerical simulations of a moving Ahmed body under a freeway overpass at different distances from the bridge columns in order to evaluate transient wind shear and the wind load on these columns. Results have shown that when the vehicle is at 0.75W distance from the bridge columns, an unsteady wind speed of up to 24 m/s is observed at the columns with a pressure coefficient difference of 0.9. Here W is the width of the vehicle. These results indicate with an appropriate system for harnessing these wind energy potentials, significant renewable electric power could be generated with zero carbon footprint.
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Heron, Neil, e Sam McCormick. The Physical Demands of Wheelchair Tennis - a Systematic Review. INPLASY - International Platform of Registered Systematic Review and Meta-analysis Protocols, março de 2023. http://dx.doi.org/10.37766/inplasy2023.3.0060.

Texto completo da fonte
Resumo:
Review question / Objective: Our aim was to describe and synthesise the physical demands of wheelchair tennis. We reviewed the data across different playing surfaces, performance levels and sex of tennis players. Eligibility criteria: Studies had to meet the criteria below to be included in the review:i. The paper reported on participants playing singles or doubles wheelchair tennis matches (all ages, performance levels, quad or open category and court surfaces).ii. The data collected was related to the duration of play (e.g. length of match, effective playing time), on-court movement characteristics (e.g. distance covered, moving speed, accelerations), stroke characteristics (e.g. first serve %, count, frequency) or physiological response to match play (e.g. heart rate, oxygen uptake, energy expenditure) of wheelchair tennis.
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

Idris, Iffat. Preventing Atrocities in Conflict and Non-conflict Settings. Institute of Development Studies, agosto de 2022. http://dx.doi.org/10.19088/k4d.2022.137.

Texto completo da fonte
Resumo:
Atrocity prevention refers to activities to prevent atrocity crimes against civilians. These include genocide, war crimes, crimes against humanity and ethnic cleansing, and can take place in both conflict and non-conflict settings. This points to the need to prioritise and implement atrocity prevention specifically, and not just as part of conflict prevention efforts. Atrocity prevention interventions are broadly of two types: operational (short-term responses) and structural (addressing underlying causes/drivers). These encompass a wide range of approaches including: acting locally (with local actors taking the lead in prevention activities); tackling hate speech, and promoting an independent and strong media; documenting human rights violations, and prosecuting and punishing those responsible (establishing rule of law). The international community should prioritise atrocity prevention, but work in a united manner, take a comprehensive approach, and give the lead to local actors. Atrocity crimes generally develop in a process over time, and risk factors can be identified; these traits make atrocity prevention possible. This rapid review looks at the concept of atrocity prevention, how it is distinct from conflict prevention, the different approaches taken to atrocity prevention, and the lessons learned from these. The review draws on a mixture of academic and grey literature, in particular reports produced by international development organisations such as the United Nations (UN) and USAID. The literature was largely gender-blind (with the exception of conflict-related sexual violence) and disability-blind.
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Kubica, Stefan, Tobias Peuschke-Bischof, Belinda Müller e Robin Avci. Fahrmanöver für Geradeausfahrt. Technische Hochschule Wildau, 2019. http://dx.doi.org/10.15771/1264.

Texto completo da fonte
Resumo:
This repository contains various short driving scenarios of our vehicle fleet in 1:14 scale. These are part of our digital learning factory "Wildauer Maschinen Werke" (www.th-wildau.de/wmw) and are used for the development of autonomous driving functions. The data is recorded in the form of Rosbags in the underlying robotic operating system (www.ros.org) and can be played in their own ROS server instances, whereby the recorded journeys of the vehicles can be simulated and used for their own developments. The scenarios contain short driving scenarios with so-called ROS topics. This includes engine and steering control and frontal distance measurements of an ultrasonic sensor. Videos are also provided for each scenario for a better overview. It is the 1st generation. Towards the end of 2020, data sets will follow that additionally contain camera data, GPS coordinates and speed as well as laser scanner data.
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

Tao, Yang, Amos Mizrach, Victor Alchanatis, Nachshon Shamir e Tom Porter. Automated imaging broiler chicksexing for gender-specific and efficient production. United States Department of Agriculture, dezembro de 2014. http://dx.doi.org/10.32747/2014.7594391.bard.

Texto completo da fonte
Resumo:
Extending the previous two years of research results (Mizarch, et al, 2012, Tao, 2011, 2012), the third year’s efforts in both Maryland and Israel were directed towards the engineering of the system. The activities included the robust chick handling and its conveyor system development, optical system improvement, online dynamic motion imaging of chicks, multi-image sequence optimal feather extraction and detection, and pattern recognition. Mechanical System Engineering The third model of the mechanical chick handling system with high-speed imaging system was built as shown in Fig. 1. This system has the improved chick holding cups and motion mechanisms that enable chicks to open wings through the view section. The mechanical system has achieved the speed of 4 chicks per second which exceeds the design specs of 3 chicks per second. In the center of the conveyor, a high-speed camera with UV sensitive optical system, shown in Fig.2, was installed that captures chick images at multiple frames (45 images and system selectable) when the chick passing through the view area. Through intensive discussions and efforts, the PIs of Maryland and ARO have created the protocol of joint hardware and software that uses sequential images of chick in its fall motion to capture opening wings and extract the optimal opening positions. This approached enables the reliable feather feature extraction in dynamic motion and pattern recognition. Improving of Chick Wing Deployment The mechanical system for chick conveying and especially the section that cause chicks to deploy their wings wide open under the fast video camera and the UV light was investigated along the third study year. As a natural behavior, chicks tend to deploy their wings as a mean of balancing their body when a sudden change in the vertical movement was applied. In the latest two years, this was achieved by causing the chicks to move in a free fall, in the earth gravity (g) along short vertical distance. The chicks have always tended to deploy their wing but not always in wide horizontal open situation. Such position is requested in order to get successful image under the video camera. Besides, the cells with checks bumped suddenly at the end of the free falling path. That caused the chicks legs to collapse inside the cells and the image of wing become bluer. For improving the movement and preventing the chick legs from collapsing, a slowing down mechanism was design and tested. This was done by installing of plastic block, that was printed in a predesign variable slope (Fig. 3) at the end of the path of falling cells (Fig.4). The cells are moving down in variable velocity according the block slope and achieve zero velocity at the end of the path. The slop was design in a way that the deacceleration become 0.8g instead the free fall gravity (g) without presence of the block. The tests showed better deployment and wider chick's wing opening as well as better balance along the movement. Design of additional sizes of block slops is under investigation. Slops that create accelerations of 0.7g, 0.9g, and variable accelerations are designed for improving movement path and images.
Estilos ABNT, Harvard, Vancouver, APA, etc.
Oferecemos descontos em todos os planos premium para autores cujas obras estão incluídas em seleções literárias temáticas. Contate-nos para obter um código promocional único!

Vá para a bibliografia