Dissertationen zum Thema „Acoustic analysis of speech“
Geben Sie eine Quelle nach APA, MLA, Chicago, Harvard und anderen Zitierweisen an
Machen Sie sich mit Top-50 Dissertationen für die Forschung zum Thema "Acoustic analysis of speech" bekannt.
Neben jedem Werk im Literaturverzeichnis ist die Option "Zur Bibliographie hinzufügen" verfügbar. Nutzen Sie sie, wird Ihre bibliographische Angabe des gewählten Werkes nach der nötigen Zitierweise (APA, MLA, Harvard, Chicago, Vancouver usw.) automatisch gestaltet.
Sie können auch den vollen Text der wissenschaftlichen Publikation im PDF-Format herunterladen und eine Online-Annotation der Arbeit lesen, wenn die relevanten Parameter in den Metadaten verfügbar sind.
Sehen Sie die Dissertationen für verschiedene Spezialgebieten durch und erstellen Sie Ihre Bibliographie auf korrekte Weise.
John, Jeeva. „Acoustic Analysis of Speech of Persons with Autistic Spectrum Disorders“. Bowling Green State University / OhioLINK, 2008. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1206329066.
Der volle Inhalt der QuelleNulsen, Susan, und n/a. „Combining acoustic analysis and phonotactic analysis to improve automatic speech recognition“. University of Canberra. Information Sciences & Engineering, 1998. http://erl.canberra.edu.au./public/adt-AUC20060825.131042.
Der volle Inhalt der QuelleBrock, James L. „Acoustic classification using independent component analysis /“. Link to online version, 2006. https://ritdml.rit.edu/dspace/handle/1850/2067.
Der volle Inhalt der QuelleSingh-Miller, Natasha 1981. „Neighborhood analysis methods in acoustic modeling for automatic speech recognition“. Thesis, Massachusetts Institute of Technology, 2010. http://hdl.handle.net/1721.1/62450.
Der volle Inhalt der QuelleCataloged from PDF version of thesis.
Includes bibliographical references (p. 121-134).
This thesis investigates the problem of using nearest-neighbor based non-parametric methods for performing multi-class class-conditional probability estimation. The methods developed are applied to the problem of acoustic modeling for speech recognition. Neighborhood components analysis (NCA) (Goldberger et al. [2005]) serves as the departure point for this study. NCA is a non-parametric method that can be seen as providing two things: (1) low-dimensional linear projections of the feature space that allow nearest-neighbor algorithms to perform well, and (2) nearest-neighbor based class-conditional probability estimates. First, NCA is used to perform dimensionality reduction on acoustic vectors, a commonly addressed problem in speech recognition. NCA is shown to perform competitively with another commonly employed dimensionality reduction technique in speech known as heteroscedastic linear discriminant analysis (HLDA) (Kumar [1997]). Second, a nearest neighbor-based model related to NCA is created to provide a class-conditional estimate that is sensitive to the possible underlying relationship between the acoustic-phonetic labels. An embedding of the labels is learned that can be used to estimate the similarity or confusability between labels. This embedding is related to the concept of error-correcting output codes (ECOC) and therefore the proposed model is referred to as NCA-ECOC. The estimates provided by this method along with nearest neighbor information is shown to provide improvements in speech recognition performance (2.5% relative reduction in word error rate). Third, a model for calculating class-conditional probability estimates is proposed that generalizes GMM, NCA, and kernel density approaches. This model, called locally-adaptive neighborhood components analysis, LA-NCA, learns different low-dimensional projections for different parts of the space. The models exploits the fact that in different parts of the space different directions may be important for discrimination between the classes. This model is computationally intensive and prone to over-fitting, so methods for sub-selecting neighbors used for providing the classconditional estimates are explored. The estimates provided by LA-NCA are shown to give significant gains in speech recognition performance (7-8% relative reduction in word error rate) as well as phonetic classification.
by Natasha Singh-Miller.
Ph.D.
Williams, A. Lynn. „Phonologic and Acoustic Analyses of Final Consonant Omission“. Digital Commons @ East Tennessee State University, 1998. https://dc.etsu.edu/etsu-works/2008.
Der volle Inhalt der QuelleLee, Matthew E. „Acoustic Models for the Analysis and Synthesis of the Singing Voice“. Diss., Georgia Institute of Technology, 2005. http://hdl.handle.net/1853/6859.
Der volle Inhalt der QuelleNg, So-sum. „Acoustic analysis of contour tones produced by Cantonese dysarthric speakers“. Click to view the E-thesis via HKUTO, 2001. http://sunzi.lib.hku.hk/hkuto/record/B36208024.
Der volle Inhalt der Quelle"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 4, 2001." Also available in print.
Srinivasan, Nandini. „Acoustic Analysis of English Vowels by Young Spanish-English Bilingual Language Learners“. Thesis, The George Washington University, 2018. http://pqdtopen.proquest.com/#viewpdf?dispub=10815722.
Der volle Inhalt der QuelleSeveral studies across various languages have shown that monolingual listeners perceive significant differences between the speech of monolinguals and bilinguals. However, these differences may not always affect the phoneme category as identified by the listener or the speaker; differences may often be found between tokens corresponding to unique phonological categories and, as such, be more easily detectable through acoustic analysis. We hypothesized that unshared English vowels produced by young Spanish-English bilinguals would have measurably different formant values and duration than the same vowels produced by young English monolinguals because of Spanish influence on English phonology. We did not find significant differences in formant values between the two groups, but we found that SpanishEnglish bilinguals produced certain vowels with longer duration than English monolinguals. Our findings add to the ever-growing body of literature on bilingual language acquisition and the perception of accentedness.
Odlozinski, Lisa M. „An acoustic analysis of speech rate control procedures in Parkinson's disease“. Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk2/tape17/PQDD_0004/MQ30738.pdf.
Der volle Inhalt der QuelleCao, Ying Alisa 1979. „Analysis of acoustic cues for identifying consonant /ð/ in continuous speech“. Thesis, Massachusetts Institute of Technology, 2002. http://hdl.handle.net/1721.1/87279.
Der volle Inhalt der QuelleBadenhorst, Jacob Andreas Cornelius. „Data sufficiency analysis for automatic speech recognition / by J.A.C. Badenhorst“. Thesis, North-West University, 2009. http://hdl.handle.net/10394/3994.
Der volle Inhalt der QuelleThesis (M. Ing. (Computer and Electronical Engineering))--North-West University, Potchefstroom Campus, 2009.
Daly, Nancy Ann. „Acoustic-phonetic and linguistic analyses of spontaneous speech : implications for speech understanding“. Thesis, Massachusetts Institute of Technology, 1994. http://hdl.handle.net/1721.1/12009.
Der volle Inhalt der QuelleIncludes bibliographical references (leaves 142-149).
by Nancy Ann Daly.
Ph.D.
Dempster, Gavin John. „A large-scale analysis of the acoustic-phonetic markers of speaker sex“. Thesis, University of Sheffield, 1996. http://etheses.whiterose.ac.uk/10188/.
Der volle Inhalt der QuelleHiga, Rodrigo Hitoshi. „Influence of different orthodontic upper retainers in speech: analysis by laypersons and acoustic analysis“. Universidade de São Paulo, 2018. http://www.teses.usp.br/teses/disponiveis/25/25144/tde-02102018-221945/.
Der volle Inhalt der QuelleIntrodução: O objetivo deste estudo foi avaliar a influência de diferentes contenções superiores na fala, através de análise perceptiva auditiva por leigos e análise acústica. Métodos: Dezoito voluntários foram selecionados para utilizar quatro tipos de contenções superiores, sendo elas: placa Wrap-Around convencional (WAC), Wrap- Around modificada em formato de ferradura (WAF) Wrap-Around modificada com orifício anterior (WAO) e contenção termoplástica transparente (CTT). Elas foram usadas por 21 dias cada, com intervalos de 7 dias sem utilização entre elas. A avaliação da fala foi realizada em gravações de trechos vocais realizadas antes da instalação das contenções (T0), imediatamente após a instalação de cada contenção (T1), assim como após 21 dias de uso destas (T2). A análise perceptiva auditiva dos leigos foi realizada através da escala visual analógica de 100 mm, enquanto a análise acústica consistiu na avaliação da média da taxa de diadococinesia (DDC), bem como a frequência dos formantes F1 e F2 das consoantes fricativas. Os testes ANOVA a um critério e ANOVA a dois critérios foram utilizados. Resultados: Na análise perceptiva auditiva dos leigos houve uma piora nos valores de T0 para T1 em todas as contenções, mas somente para WAC e CTT os valores foram estatisticamente menores. Em T2 os valores voltaram a aumentar, mas para CTT ainda houve um valor estatisticamente menor do que T0 enquanto para WAO a diferença de T0 para T2 foi praticamente nula. Não houve alterações nos valores da DDC. Para os formantes, de uma maneira geral houve uma diferença de T0 para T1 e pouca diferença de T0 para T2, enquanto na comparação entre os aparelhos a WAC apresentou alterações maiores nos formantes F1 de algumas consoantes, enquanto WAO apresentou valores menores, e os demais dispositivos valores intermediários. Conclusões: Nos dois tipos de análise (subjetiva e objetiva) houve alteração na fala após a instalação de cada contenção, havendo uma melhora após 21 dias de uso. Os leigos consideraram maiores as alterações da fala envolvendo a CTT, e menores envolvendo WAO. Para a análise acústica os valores foram piores para WAC, enquanto para WAO as alterações foram menores.
Jackson, Philip J. B. „Characterisation of plosive, fricative and aspiration components in speech production“. Thesis, University of Southampton, 2000. https://eprints.soton.ac.uk/254111/.
Der volle Inhalt der QuelleJones, Catherine Jacquelynn Julia. „Queclaratives in Xhosa : an acoustic and perceptual analysis“. Thesis, Stellenbosch : Stellenbosch University, 2001. http://hdl.handle.net/10019.1/52426.
Der volle Inhalt der QuelleENGLISH ABSTRACT: Key words: acoustic speech analysis, speech synthesis, speech perception, copulative queclarative, linguistics, psycho linguistics, human language technology This study investigates the notion of interrogativity in Xhosa as expressed in the form of Queclaratives. Queclaratives, or statements which are question-like in function, have been studied in many languages of the world. Unfortunately with regard to the Bantu languages, studies relating to interrogativity in general have largely been impressionistic in nature. This research comprised two aspects of analysis. These included an acoustic and a perceptual analysis of data. The reason for this approach is that, without this combination the results could have been considered to be suspect and lacking in authenticity. The acoustic analysis was conducted on 858 words in statement and queclarative pairs. Significant parameters were extracted and these were then statistically analyzed. The results revealed that duration on the penultimate vowel, pitch on the penultimate vowel and the overall raised pitch of queclaratives as opposed to statements were indeed the acoustically significant parameters differentiating statements from queclaratives. However as is well known, there is no one-to-one relationship between the acoustic signal and its perception and, therefore, it is imperative that such findings also be perceptually tested. The perceptual testing of these parameters was conducted in an attempt to elicit whether they were perceptually significant and also at what point in the utterance listeners could differentiate between queclaratives and statements. The next progression was the compilation of carefully designed perception tests on the acoustically significant parameters. Two experiments were compiled using stimuli that were manipulations of the original signal of one of the selected informant's utterances. These tests were administered on multimedia computers in the Language Laboratory at the University of Stellenbosch using 64 subjects for the first experiment and 63 for the second. The results of the perception tests showed that duration and pitch on the penultimate syllable are perceptually highly significant in differentiating statements from queclaratives. However the results also indicated very early recognition of the different forms with minimal speech segments from which the penultimate vowels were absent altogether. This then suggests that the perceptual judgements made earlier in the utterance may be either reinforced or overridden by the duration and pitch on the penultimate vowel. These results have assisted in the validation of some impressionistic claims made within the Bantu and other languages, while refuting others. However, as this corpus of data has included research on copulative queclaratives, it appeals for further research on this subject using any other linguistic markers. The results have also been evaluated in terms of their possible contribution to the related disciplines of psycholinguistics, linguistics and human language technologies. In so doing, the thesis makes an urgent appeal to researchers to pursue this experimental approach to language research. Another appeal is made for an awareness campaign as to the importance of this approach in harnessing the power of language for the development of language and society as a whole. The fertility of the South African society lies in its richness of multilingualism and the necessity for the improvement of the dissemination of information to all people of all languages and the improvement of communication between people in general, including those less fortunate in terms of literacy skills.
AFRIKAANSE OPSOMMING: Sleutelwoorde: akoestiese spraakanalise, spraaksintese, spraakpersepsie, kopulatiewe stelvraag, linguistiek, psigolinguistiek, taal-en-spraaktegnologie In hierdie projek word die aard van vraagstelling in Xhosa ondersoek met betrekking tot die stelvraag-vorm. Stelvrae, of stellings wat ook as vrae kan funksioneer, is reeds bestudeer vir heelwat wêreldtale. Oor die algemeen was studies oor vraagstelling in die Afrikatale egter grootliks impressionisties van aard. Hierdie navorsingsprojek het uit twee analisekomponente bestaan, naamlik 'n akoestiese analise van die data en 'n reeks persepsueie eksperimente. Sonder die kombinasie van die twee tipes analise sou die resultate van die navorsing minder kredietwaardig gewees het. Die akoestiese analise is gedoen op 858 woordpare bestaande uit stellings en stelvrae. Die data is statisties ontleed en die relevante parameters is onttrek. Die resultate het daarop gedui dat die duur en toonhoogte van die voorlaaste vokaal sowel as die register van die hele woord belangrike parameters is in die onderskeid tussen stellings en stelvrae. Aangesien dit wel bekend is dat daar nie 'n een-tot-een verwantskap tussen die akoestiese klanksein en die persepsie daarvan is nie, is dit noodsaaklik om ook 'n persepsueie eksperiment uit te voer. Die persepsueie toetse is so opgestel dat bepaal kon word watter akoestiese parameters ook persepsueel relevant is en om die vroegste sillabie te vind waar luisteraars reeds die onderskeid tussen die twee vorme kan maak. Die volgende stap was om stimuli vir die persepsietoetse op te stel wat inderdaad bogenoemde resultate sou lewer. Stimuli is geskep deur die spraakdata van een spreker te manipuleer. Die persepsietoetse is toe uitgevoer op multimedia-rekenaars in die Taallaboratorium van die Universiteit van Stellenbosch. Die resultate van die persepsietoetse het gewys dat die duur en toonhoogte van die voorlaaste sillabe ook persepsueel belangrik is in die ondersekeid tussen die verskillende vorme. Dit was ook duidelik dat proefpersone die vorme van mekaar kon onderskei met minimale hoeveelhede inligting waar die voorlaaste en laaste sillabes heeltemal afwesig was. Dit dui daarop dat luisteraars persepsuele besluite baie vroeg in die woord neem, maar dat hierdie besluite óf versterk óf omgekeer kan word deur die duur en toonhoogte van die voorlaaste sillabe. Die resultate van die navorsing het sekere impressionistiese stellings ten opsigte van Afrikatale ondersteun terwyl ander stellings as foutief bewys is. Een van die belangrike bevindings was dat die impressionistiese standpunt dat stellings 'n dalende intonasiekontoer en vrae 'n stygende intonasiekontoer tydens die afloop van die uiting het, 'n oorvereenvondiging is. Hierdie werk is gedoen op enkelwoord-kopulatief stelvrae en leen dit daartoe om uitgebrei te word na frases en sinne vir toekomstige navorsmg. Verder is die navorsingsresultate in verband gebring met verwante dissiplines soos psigolinguistiek, linguistiek en taal- en spraaktegnologie. 'n Pleidooi is gelewer vir 'n bewusmakingsveldtog om die belangrikheid van hierdie tipe navorsing te beklemtoon om die potensiaal van taal te benut vir die ontwikkeling van Suider-Afrikaanse tale en gemeenskappe. Die rykheid van ons gemeenskap lê in die veeltaligheid daarvan en bied besondere uitdagings om die verspreiding van inligting na alle mense van alle tale te verbeter en om die kommunikasie tussen mense in die algemeen, maar ook spesifiek vir diegene met laer vlakke van geletterdheid.
Keerio, Ayaz. „Acoustic analysis of Sindhi speech : a pre-curser for an ASR system“. Thesis, University of Sussex, 2011. http://sro.sussex.ac.uk/id/eprint/6325/.
Der volle Inhalt der QuelleOfuka, Etsuko. „Acoustic and perceptual analyses of politeness in Japanese speech“. Thesis, University of Leeds, 1996. http://etheses.whiterose.ac.uk/1036/.
Der volle Inhalt der QuelleTAKEDA, Kazuya, Norihide KITAOKA und Makoto SAKAI. „Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition“. Institute of Electronics, Information and Communication Engineers, 2010. http://hdl.handle.net/2237/14969.
Der volle Inhalt der QuelleBezuidenhout, Hannelie. „An electroglottographic and acoustic analysis of glottal activity during speech initiation in stuttering“. Pretoria : [s.n.], 2006. http://upetd.up.ac.za/thesis/available/etd-09252008-142958.
Der volle Inhalt der QuelleTyson, Na'im R. „Exploration of Acoustic Features for Automatic Vowel Discrimination in Spontaneous Speech“. The Ohio State University, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=osu1339695879.
Der volle Inhalt der QuelleNissen, Shawn L. „An Acoustic Analysis of Voiceless Obstruents Produced by Adults and Typically Developing Children“. The Ohio State University, 2003. http://rave.ohiolink.edu/etdc/view?acc_num=osu1041225568.
Der volle Inhalt der QuelleSaikachi, Yoko. „Development, perceptual evaluation, and acoustic analysis of amplitude-based F0 control in Electrolarynx speech“. Thesis, Massachusetts Institute of Technology, 2009. http://hdl.handle.net/1721.1/54667.
Der volle Inhalt der Quelle"September 2009." Cataloged from PDF version of thesis.
Includes bibliographical references (p. 120-126).
An Electrolarynx (EL) is a battery-powered device that produces a sound that can be used to acoustically excite the vocal tract as a substitute for laryngeal voice production. ELs provide laryngectomy patients with the basic capability to communicate, but current EL devices produce a mechanical speech quality which has been largely attributed to the lack of natural fundamental frequency (F0) variation. In order to improve the quality of EL speech, the present study aimed to develop and evaluate an automatic F0 control scheme, in which F0 was modulated based on variations in the root-mean-squared (RMS) amplitude of the EL speech signal. Recordings of declarative sentences produced by two male subjects before and after total laryngectomy were used to develop procedures for calculating F0 contours for EL speech, and perceptual experiments and acoustic analyses were conducted to examine the impact of F0 modulation on the quality and prosodic function of the EL speech. The results of perceptual experiments showed that modulating the F0 of EL speech using a linear relationship between amplitude and frequency made it significantly more natural sounding than EL speech with constant F0, but also revealed some limitations in terms of communicating linguistic contrasts (distinction between question vs. statement and location of contrastive stress). Results are interpreted in relation to the acoustic characteristics of F0 modified EL speech and discussed in terms of their clinical implications and suggestion for improved algorithms of F0 control in EL speech.
by Yoko Saikachi.
Ph.D.
Butt, Abdul Haleem. „Speech Assessment for the Classification of Hypokinetic Dysthria in Parkinson Disease“. Thesis, Högskolan Dalarna, Datateknik, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:du-10041.
Der volle Inhalt der QuelleNg, Yuk-sim Cherry. „Perceptual and acoustic analysis of dysarthric dysphonia direct magnitude estimation versus interval scaling /“. Click to view the E-thesis via HKUTO, 2002. http://sunzi.lib.hku.hk/hkuto/record/B36208425.
Der volle Inhalt der Quelle"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 10, 2002." Also available in print.
Li, Yee-key Nicole, und 李依祺. „Acoustic and perceptual analysis of modal and falsetto registers in females with dysphonia“. Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2003. http://hub.hku.hk/bib/B26653278.
Der volle Inhalt der QuelleLi, Qiang. „Acoustic Analysis of Intonation in Persons with Parkinson's Disease Receiving Transcranial Magnetic Stimulation and Intensive Voice Therapy“. Thesis, University of Louisiana at Lafayette, 2019. http://pqdtopen.proquest.com/#viewpdf?dispub=10843550.
Der volle Inhalt der QuelleIntonation is one of the prosodic features manifested acoustically in the fundamental frequency (F0). Intonation abnormality is common and prominent in the speech of persons with Parkinson's disease (PD). Intensive speech therapies such as Lee Silverman Voice Treatment (LSVT-LOUD®) have been demonstrated effective for increasing vocal intensity and F0 variability, but no prior studies have examined linguistic features of intonation before and after treatment in PD. Additionally, transcranial magnetic stimulation (TMS) has been demonstrated as an appropriate adjuvant to a primary treatment. It is reasonable to hypothesize that intonation abnormality will be improved after the combined modality treatment of LSVT-LOUD® and TMS. To examine this hypothesis, the current research investigated acoustically five intonational features including F0 declination, resetting, emphasis, terminal fall, and syntactic pre-junctural fall in twenty PD participants, receiving LSVT-LOUD® alone, or combined with TMS delivered to the left or right cerebral hemisphere. The primary experiment was designed and carried out by Shalini Narayana and colleagues in their project funded by the Michael J. Fox Foundation for Parkinson's Research. They collected and provided the recorded reading samples for the current study.
F0 changes of each of five intonational feature were measured before and after the combined modality treatment, and at two months follow-up, then analyzed statistically. The results revealed that F0 declination, emphasis, and terminal fall changed significantly from pre- to post-treatment, and the changes of declination and terminal fall were maintained at the follow-up evaluations.
The observed changes in intonation were attributed to LSVT alone, which caused large changes of F0 magnitude. F0 resetting and syntactic pre-junctural fall did not change significantly following treatment, probably because these intonational features need very precise fine motor control of the intrinsic laryngeal muscles to make small-range, rapid F0 adjustments, which were not improved by LSVT in present PD participants. Difficulties with syntactic processing previously reported in PD may have contributed to the lack of improvement in resetting and pre-junctural fall, since these F0 features are used to mark syntactic boundaries within utterances. Consideration of incorporation of linguistic intonation to speech intervention for speakers with PD is suggested for future clinical research.
TAKEDA, Kazuya, Seiichi NAKAGAWA, Yuya HATTORI, Norihide KITAOKA und Makoto SAKAI. „Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training“. Institute of Electronics, Information and Communication Engineers, 2010. http://hdl.handle.net/2237/14968.
Der volle Inhalt der QuelleBianchi, Michelle. „Effects of clear speech and linguistic experience on acoustic characteristics of vowel production“. [Tampa, Fla.] : University of South Florida, 2007. http://purl.fcla.edu/usf/dc/et/SFE0002084.
Der volle Inhalt der QuelleTalkar, Tanya. „Design of tool for analysis of speech development disorders using landmarks and other acoustic cues“. Thesis, Massachusetts Institute of Technology, 2017. http://hdl.handle.net/1721.1/113098.
Der volle Inhalt der QuelleThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (page 71).
Non-word repetition tasks have been used to diagnose children with various developmental difficulties with phonology, but these productions have not been phonetically analyzed to reveal the nature of the modifications produced by children diagnosed with SLI, autism spectrum disorder or dyslexia compared to those produced by typically-developing children. In this thesis, we compared the modification of predicted acoustic cues to distinctive features of manner, place and voicing for just under 30 children (ages 5-12), for the CN-Rep word inventory, in an extension of the earlier analysis in Levy et al. 2014. Feature cues, including abrupt acoustic landmarks (Stevens 2002) and other acoustic feature cues, were hand-labeled and analysis of factors that may influence feature cue modifications included position in the word, position in the syllable, word length measured in syllables, lexical stress, and manner type. Results suggest specific patterns of modification in specific contexts for specific clinical populations. These findings set the foundation for understanding how phonetic variation in speech arises in both typical and clinical populations, and for using this knowledge to develop tools to aid in more accurate and insightful diagnosis as well as improved intervention methods.
by Tanya Talkar.
M. Eng.
Celaya, Marissa. „Speech Adaptation to Electropalatography in Children's Productions of /s/ and /ʃ/“. BYU ScholarsArchive, 2014. https://scholarsarchive.byu.edu/etd/4103.
Der volle Inhalt der QuelleSmith, Megan Marie. „The Sound of the Snow Queen: An Acoustic Analysis of Vowel Clarity in "Let it Go"“. Ohio University Honors Tutorial College / OhioLINK, 2016. http://rave.ohiolink.edu/etdc/view?acc_num=ouhonors1461277153.
Der volle Inhalt der QuelleTorgerson, Richard Christen. „A Comparison of Beijing and Taiwan Mandarin Tone Register: An Acoustic Analysis of Three Native Speech Styles“. Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd1003.pdf.
Der volle Inhalt der QuelleChan, Carlos Chun Ming. „Speaker model adaptation in automatic speech recognition“. Thesis, Robert Gordon University, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.339307.
Der volle Inhalt der QuelleSchulz, Henrik. „Large vocabulary continuous speech recognition for the transcription of Catalan broadcast news and conversations : towards analysis and modelling of acoustic reduction in spontaneous speech“. Doctoral thesis, Universitat Politècnica de Catalunya, 2017. http://hdl.handle.net/10803/405985.
Der volle Inhalt der QuelleLa transcripció de converses espontànies encara suposa un repte per als mètodes actuals de reconeixement automàtic de veu. Aquesta tesi descriu el desenvolupament d'un sistema de reconeixement de veu continu de vocabulari gran per a la transcripció de converses i notícies emeses en català i condueix cap a noves aproximacions per a l'anàlisi i modelat de la reducció acústica en converses espontànies. Es centra inicialment en diversos mètodes convencionals per a l'anàlisi acústica, modelat acústic i del llenguatge i en la cerca d'hipòtesis. Les millores respecte el sistema original d'única passada són principalment degudes al domini i l'estil en la parla posant èmfasi en la interpolació de models de llenguatge, discriminació lineal i projecció d'observacions acústiques, entrenament adaptat al locutor per millorar la separació de les classes fonètiques, normalització de les observacions acústiques, i adaptació del model acústic en una sistema de múltiples passades. L'anàlisi de reducció acústica posa inicialment èmfasi en les propietats espectrals i temporals independents de vocals i consonant específiques, els paràmetres de les quals mostren diferències estadísticament significatives entre els prototips de fonemes en la conversa espontània i la seva realització canònica en el discurs planejat. La introducció de l'anàlisi del espai de característiques proporciona els mitjans generals per a revelar aquestes diferències en observacions acústiques convencionals per al reconeixement automàtic de veu. Mostra diferències estadísticament significatives independents de context però també entre fonemes adjacents en el context de síl·laba suggerint patrons de reducció particulars. A més, l'anàlisi desafia la, sovint suggerida, coherència entre les reducció simultànies de les propietats espectrals i temporals. El modelat de la reducció acústica primer fa èmfasi en variables discriminants de cada segment, models dependents de la variabilitat de la classe i l'adaptació del model acústic original. Introdueix la taxa de fonemes com a mitjà d'analitzar propietats temporals i la proporció de la reducció del espai de característiques com a mitjà d'analitzar la reducció dels propietats espectrals en el espai de característiques convencional per al reconeixement de veu continu de vocabulari gran com a variables discriminants. Aquestes variables s'agrupen i determinen les classes per a models dependents de la variabilitat de cada segment i la seva puntuació durant el reconeixement i cerca d'hipòtesi. Ambdues aproximacions no mostren una millora significativa en el rendiment. A més a més, les tècniques de modelat es dirigeixen cap a models dependents de la predicibilitat del segment que introdueixen la predicibilitat com a variable discriminant per a models dependents de la classe de variabilitat basats en la coherència fonamental entre predicibilitat i reducció acústica que es suggereix pel principi del mínim esforç i la teoria de la redundància. Per tant, emfatitza la predicibilitat de les paraules i dels fonemes. Aquesta aproximació no suposa cap millora significativa de rendiment. El discurs planejat és aparentment antagònic amb el principi del mínim esforç. Per tant, un anàlisi previ condicionat al segment de la reducció acústica pot indicar el seu grau mig de reducció, mentre la variació intra-segmental pot indicar si exhibeix prou relaxació en l'estil de parlar per adoptar el principi del mínim esforç. Per tant, segments amb poca variació intra-segmental poden ser modelats apart dels que tenen gran variació intra-segmental, mentre que modelar aquestes darreres mitjançant models dependents de predicibilitat de paraula, síl·laba o fonema poden aportar una perspectiva viable de recerca.
Kocjancic, Tanja. „Ultrasound and acoustic analysis of lingual movement in teenagers with childhood apraxia of speech, control adults and typically developing children“. Thesis, Queen Margaret University, 2010. https://eresearch.qmu.ac.uk/handle/20.500.12289/7448.
Der volle Inhalt der QuelleYang, Lening. „Computer modelling of speech intelligibility in underground stations“. Thesis, London South Bank University, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.245130.
Der volle Inhalt der QuelleFreij, G. J. „Enhanced sequential adaptive linear prediction for speech encoding“. Thesis, University of Liverpool, 1985. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.356268.
Der volle Inhalt der QuelleBeet, S. W. „Digital processing of speech produced in hyperbaric helium“. Thesis, University of Liverpool, 1985. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.356244.
Der volle Inhalt der QuelleCrozier, Philip Mark. „Enhancement techniques for noise affected telephone quality speech“. Thesis, University of Liverpool, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.321115.
Der volle Inhalt der QuelleRex, James Alexander. „Microphone signal processing for speech recognition in cars“. Thesis, University of Southampton, 2000. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.326728.
Der volle Inhalt der QuelleThomas, T. J. „An articulatory model of speech production including turbulence“. Thesis, University of Cambridge, 1985. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.333125.
Der volle Inhalt der QuelleNarayanan, Arun. „Computational auditory scene analysis and robust automatic speech recognition“. The Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1401460288.
Der volle Inhalt der QuelleChen, Jitong. „On Generalization of Supervised Speech Separation“. The Ohio State University, 2017. http://rave.ohiolink.edu/etdc/view?acc_num=osu1492038295603502.
Der volle Inhalt der QuelleWang, Yuxuan. „Supervised Speech Separation Using Deep Neural Networks“. The Ohio State University, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=osu1426366690.
Der volle Inhalt der QuelleGhaidan, Khaldoon A. „A study of the application of modern techniques to speech waveform analysis“. Thesis, Loughborough University, 1986. https://dspace.lboro.ac.uk/2134/28015.
Der volle Inhalt der QuelleLee, Sang Jun. „Comparative analysis of speech intelligibility in church acoustics using computer modeling“. [Gainesville, Fla.] : University of Florida, 2003. http://purl.fcla.edu/fcla/etd/UFE0000866.
Der volle Inhalt der QuelleRaeesy, Zeynabalsadat. „Automatic analysis of magnetic resonance images of speech articulation“. Thesis, University of Oxford, 2013. http://ora.ox.ac.uk/objects/uuid:ffa6d290-6920-4204-8d65-e4f2f09278c5.
Der volle Inhalt der QuellePerkins, Rosalie. „PHONETIC AND ACOUSTIC ANALYSES OF TWO NEW CASES OF FOREIGN ACCENT SYNDROME“. Master's thesis, University of Central Florida, 2007. http://digital.library.ucf.edu/cdm/ref/collection/ETD/id/4183.
Der volle Inhalt der QuelleM.A.
Department of Communication Sciences and Disorders
Health and Public Affairs
Comm Sciences & Disorders MA
Panico, Adriana Campos Balieiro. „"Julgamento do comportamento vocal de jornalistas em diferentes estilos de notícias e seus correlatos acústicos"“. Universidade de São Paulo, 2005. http://www.teses.usp.br/teses/disponiveis/59/59134/tde-03042006-164315/.
Der volle Inhalt der QuelleAcoustic correlations were investigated and the identification of different emission styles in TV newscasts was rated in the following manner: neutral, serious and relaxed. Experienced presenters, from both genders, with constant appearances in network TV newscasts recorded a text with the same semantic content three times in the same emission styles. Based on this recording, the audio was separated in a CD in the wave format in order to perform the acoustic analysis of the sample assessing the acoustic parameters of frequency, intensity and duration of the emission through the Dr. Speech 4.0 software. In another CD, in the audio format, the samples were randomized and judged by 30 subjects with the task of identifying the styles. The acoustic parameters that differed significantly among the styles were the average Fo, maximum Fo, the Fo variation and the speech time. The spectators were capable of identifying the different styles. Later, the samples were separated by styles in three CDs in the audio format and presented by means of a Pair Comparison Method to lay individuals to be judged in function of each one of the emission styles. These data were submitted to a multidimensional analysis (MDS-Multidimensional Scaling) in order to determine the dimensions in which the different emission styles were. Two dimensions were determined for each style. In the relaxed style, the first dimension did not have a significantly correlated acoustic parameter. In the second one, without distinguishing the type, the number of semitones was significant; for the female voices, the minimum Fo, the Fo variation and the number of semitones; and for the male voices, there was no significant correlated pattern. In the neutral style, the first dimension, without distinguishing the type, two parameters were significantly correlated: minimum Fo and the number of semitones; for the female voices, the Fo variation had a significant correlation; and for the male voices, the maximum Io. In dimension two, no parameter presented a significant correlation. In the serious style, the first dimension was significantly correlated with the time of speech acoustic parameter, only for the male voices. Dimension two presented the following significantly correlated parameters: without considering the distinction of the type, average Fo, minimum Fo and minimum Io; for the female voices, the minimum Io; and for the male voices no acoustic parameter was significantly correlated. Based on these results, intervention possibilities with individuals that use communication professionally are discussed.