Dissertations / Theses on the topic 'Pronunciation'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Pronunciation.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Erichsen, Stian. "Evaluating Vowel Pronunciation in Computer Assisted Pronunciation Training." Thesis, Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon, 2011. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-13377.
Full textVersvik, Eivind. "Computer Assisted Pronunciation Training : Evaluation of non-native vowel length pronunciation." Thesis, Norwegian University of Science and Technology, Department of Electronics and Telecommunications, 2009. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-9016.
Full textComputer Assisted Pronunciation Training systems have become popular tools to train on second languages. Many second language learners prefer to train on pronunciation in a stress free environment with no other listeners. There exists no such tool for training on pronunciation of the Norwegian language. Pronunciation exercises in training systems should be directed at important properties in the language which the second language learners are not familiar with. In Norwegian two acoustically similar words can be contrasted by the vowel length, these words are called vowel length words. The vowel length is not important in many other languages. This master thesis has examined how to make the part of a Computer Assisted Pronunciation Training system which can evaluate non-native vowel length pronunciations. To evaluate vowel length pronunciations a vowel length classifier was developed. The approach was to segment utterances using automatic methods (Dynamic Time Warping and Hidden Markov Models). The segmented utterances were used to extract several classification features. A linear classifier was used to discriminate between short and long vowel length pronunciations. The classifier was trained by the Fisher Linear Discriminant principle. A database of Norwegian words of minimal pairs with respect to vowel length was recorded. Recordings from native Norwegians were used for training the classifier. Recordings from non-natives (Chinese and Iranians) were used for testing, resulting in an error rate of 6.7%. Further, confidence measures were used to improve the error rate to 3.4% by discarding 8.3% of the utterances. It could be argued that more than half of the discarded utterances were correctly discarded because of errors in the pronunciation. A CAPT demo, which was developed in an former assignment, was improved to use classifiers trained with the described approach.
Liu, Yang. "The effectiveness of integrating commercial pronunciation software into an ESL pronunciation class." [Ames, Iowa : Iowa State University], 2008.
Find full textDavel, Marelie Hattingh. "Pronunciation modelling and bootstrapping." Thesis, Pretoria : [s.n.], 2005. http://upetd.up.ac.za/thesis/available/etd-10112005-150530.
Full textCenterman, Sofi, and Felix Krausz. "Common L2 Pronunciation Errors." Thesis, Malmö högskola, Lärarutbildningen (LUT), 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-32834.
Full textEckstein, Grant Taylor. "A Correlation of Pronunciation Learning Strategies with Spontaneous English Pronunciation of Adult ESL Learners." BYU ScholarsArchive, 2007. https://scholarsarchive.byu.edu/etd/973.
Full textRobins, Seth L. "Examining the Effects of Pronunciation Strategy Usage on Pronunciation Gains by L2 Japanese Learners." BYU ScholarsArchive, 2010. https://scholarsarchive.byu.edu/etd/2840.
Full text(UPC), Universidad Peruana de Ciencias Aplicadas. "Pronunciation and phonetics - TR192 201801." Universidad Peruana de Ciencias Aplicadas (UPC), 2018. http://hdl.handle.net/10757/623641.
Full textAlsabaan, Majed Soliman K. "Pronunciation support for Arabic learners." Thesis, University of Manchester, 2015. https://www.research.manchester.ac.uk/portal/en/theses/pronunciation-support-for-arabic-learners(3db28816-90ed-4e8b-b64c-4bbd35f98be7).html.
Full textFlisberg, Julia. "English pronunciation in Swedish Upper Secondary School Students : A qualitative study of Swedish students’ pronunciation tendencies." Thesis, Stockholms universitet, Engelska institutionen, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-165479.
Full textPeterson, Susan S. "Pronunciation Learning Strategies and Learning Strategies Related to Pronunciation Ability in American University Students Studying Spanish." The Ohio State University, 1997. http://rave.ohiolink.edu/etdc/view?acc_num=osu1394793631.
Full textOviedo, Kerry Thomson. "The second language learner and pronunciation." reponame:Repositório Institucional da UFPR, 2010. http://hdl.handle.net/1884/22508.
Full textBadr, Ibrahim. "Pronunciation learning for automatic speech recognition." Thesis, Massachusetts Institute of Technology, 2011. http://hdl.handle.net/1721.1/66022.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (p. 99-101).
In many ways, the lexicon remains the Achilles heel of modern automatic speech recognizers (ASRs). Unlike stochastic acoustic and language models that learn the values of their parameters from training data, the baseform pronunciations of words in an ASR vocabulary are typically specified manually, and do not change, unless they are edited by an expert. Our work presents a novel generative framework that uses speech data to learn stochastic lexicons, thereby taking a step towards alleviating the need for manual intervention and automnatically learning high-quality baseform pronunciations for words. We test our model on a variety of domains: an isolated-word telephone speech corpus, a weather query corpus and an academic lecture corpus. We show significant improvements of 25%, 15% and 2% over expert-pronunciation lexicons, respectively. We also show that further improvements can be made by combining our pronunciation learning framework with acoustic model training.
by Ibrahim Badr.
S.M.
Loots, Linsen. "Data-driven augmentation of pronunciation dictionaries." Thesis, Stellenbosch : University of Stellenbosch, 2010. http://hdl.handle.net/10019.1/4212.
Full textENGLISH ABSTRACT: This thesis investigates various data-driven techniques by which pronunciation dictionaries can be automatically augmented. First, well-established grapheme-to-phoneme (G2P) conversion techniques are evaluated for Standard South African English (SSAE), British English (RP) and American English (GenAm) by means of four appropriate dictionaries: SAEDICT, BEEP, CMUDICT and PRONLEX. Next, the decision tree algorithm is extended to allow the conversion of pronunciations between different accents by means of phoneme-to-phoneme (P2P) and grapheme-andphoneme- to-phoneme (GP2P) conversion. P2P conversion uses the phonemes of the source accent as input to the decision trees. GP2P conversion further incorporates the graphemes into the decision tree input. Both P2P and GP2P conversion are evaluated using the four dictionaries. It is found that, when the pronunciation is needed for a word not present in the target accent, it is substantially more accurate to modify an existing pronunciation from a different accent, than to derive it from the word’s spelling using G2P conversion. When converting between accents, GP2P conversion provides a significant further increase in performance above P2P. Finally, experiments are performed to determine how large a training dictionary is required in a target accent for G2P, P2P and GP2P conversion. It is found that GP2P conversion requires less training data than P2P and substantially less than G2P conversion. Furthermore, it is found that very little training data is needed for GP2P to perform at almost maximum accuracy. The bulk of the accuracy is achieved within the initial 500 words, and after 3000 words there is almost no further improvement. Some specific approaches to compiling the best training set are also considered. By means of an iterative greedy algorithm an optimal ranking of words to be included in the training set is discovered. Using this set is shown to lead to substantially better GP2P performance for the same training set size in comparison with alternative approaches such as the use of phonetically rich words or random selections. A mere 25 words of training data from this optimal set already achieve an accuracy within 1% of that of the full training dictionary.
AFRIKAANSE OPSOMMING: Hierdie tesis ondersoek verskeie data-gedrewe tegnieke waarmee uitspraakwoordeboeke outomaties aangevul kan word. Eerstens word gevestigde grafeem-na-foneem (G2P) omskakelingstegnieke ge¨evalueer vir Standaard Suid-Afrikaanse Engels (SSAE), Britse Engels (RP) en Amerikaanse Engels (GenAm) deur middel van vier geskikte woordeboeke: SAEDICT, BEEP, CMUDICT en PRONLEX. Voorts word die beslissingsboomalgoritme uitgebrei om die omskakeling van uitsprake tussen verskillende aksente moontlik te maak, deur middel van foneem-na-foneem (P2P) en grafeem-en-foneem-na-foneem (GP2P) omskakeling. P2P omskakeling gebruik die foneme van die bronaksent as inset vir die beslissingsbome. GP2P omskakeling inkorporeer verder die grafeme by die inset. Beide P2P en GP2P omskakeling word evalueer deur middel van die vier woordeboeke. Daar word bevind dat wanneer die uitspraak benodig word vir ’n woord wat nie in die teikenaksent teenwoordig is nie, dit bepaald meer akkuraat is om ’n bestaande uitspraak van ’n ander aksent aan te pas, as om dit af te lei vanuit die woord se spelling met G2P omskakeling. Wanneer daar tussen aksente omgeskakel word, gee GP2P omskakeling ’n verdere beduidende verbetering in akkuraatheid bo P2P. Laastens word eksperimente uitgevoer om die grootte te bepaal van die afrigtingswoordeboek wat benodig word in ’n teikenaksent vir G2P, P2P en GP2P omskakeling. Daar word bevind dat GP2P omskakeling minder afrigtingsdata as P2P en substansieel minder as G2P benodig. Verder word dit bevind dat baie min afrigtingsdata benodig word vir GP2P om teen bykans maksimum akkuraatheid te funksioneer. Die oorwig van die akkuraatheid word binne die eerste 500 woorde bereik, en n´a 3000 woorde is daar amper geen verdere verbetering nie. ’n Aantal spesifieke benaderings word ook oorweeg om die beste afrigtingstel saam te stel. Deur middel van ’n iteratiewe, gulsige algoritme word ’n optimale rangskikking van woorde bepaal vir insluiting by die afrigtingstel. Daar word getoon dat deur hierdie stel te gebruik, substansieel beter GP2P gedrag verkry word vir dieselfde grootte afrigtingstel in vergelyking met alternatiewe benaderings soos die gebruik van foneties-ryke woorde of lukrake seleksies. ’n Skamele 25 woorde uit hierdie optimale stel gee reeds ’n akkuraatheid binne 1% van di´e van die volle afrigtingswoordeboek.
Madzo, Daniela. "Teachers’ Attitudes Towards Teaching English Pronunciation." Thesis, Jönköping University, Högskolan för lärande och kommunikation, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hj:diva-51748.
Full textGonzalez, Johnson Aracelis Maydee. "Dialectal Allophonic Variation in L2 Pronunciation." OpenSIUC, 2012. https://opensiuc.lib.siu.edu/theses/783.
Full textJande, Per-Anders. "Modelling Phone-Level Pronunciation in Discourse Context." Doctoral thesis, Stockholm : Department of Speech, Music and Hearing, Computer Science and Communication, Kungliga Tekniska högskolan (KTH), 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4202.
Full textAarinen, J. (Jenni). "Teaching and learning English pronunciation in Finland." Bachelor's thesis, University of Oulu, 2019. http://jultika.oulu.fi/Record/nbnfioulu-201905111724.
Full textAndersson, Sigrid. "Pronunciation Teaching in the Swedish EFL Classroom." Thesis, Malmö universitet, Fakulteten för lärande och samhälle (LS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-34572.
Full textLiu, Yi. "Pronunciation modeling for spontaneous mandarin speech recognition /." View Abstract or Full-Text, 2002. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202002%20LIU.
Full textIncludes bibliographical references (leaves 169-177). Also available in electronic version. Access restricted to campus users.
Véliz, Campos Mauricio Enrique. "Pronunciation learning strategy use, aptitude, and their relationship with pronunciation performance of pre-service English language teachers in Chile." Thesis, University of Exeter, 2015. http://hdl.handle.net/10871/17794.
Full textKivistö, de Souza Hanna. "Phonological awareness and pronunciation in a second language." Doctoral thesis, Universitat de Barcelona, 2015. http://hdl.handle.net/10803/393726.
Full textThe objective of this dissertation is to increase knowledge about L2 phonological awareness through three research agendas: to investigate the nature of L2 phonological awareness in adult language learners and its relation to some individual differences, to examine the relationship between L2 phonological awareness and L2 pronunciation, and to create novel language-specific instruments to measure L2 phonological awareness reliably. Research on phonological awareness has focused on L1 literacy acquisition, where it has been understood as the ability to manipulate speech segments. In SLA, phonological awareness has been examined in its explicit dimension. Nevertheless, due to the special nature of L2 speech acquisition, L2 learners are rarely able to elaborate explicitly on aspects of pronunciation. Consequently, the present study advocates that L2 phonological awareness mainly consists of proceduralized knowledge. L1 BraziIian Portuguese learners of English (n=71) were tested on their awareness about the L2 phonological system through three domain-specific (segmental, suprasegmental and phonotactic) tasks. Performance in the L2 phonological awareness tasks was related to the participants' L2 pronunciation (measured with a Foreign Accent Rating Task) and to individual differences in the amount of L2 experience, L2 use and L2 proficiency. Additionally, 19 L1 American English speakers performed the same phonological awareness tasks, enabling comparison between L1 and L2 phonological awareness. The results revealed that L2 learners manifested significantly lower degrees of phonological awareness than L1 speakers. Moreover, L2 phonological awareness explained 32.8% of the variance in L2 pronunciation. As for the individual differences, L2 proficiency explained unique variance in L2 phonological awareness, whereas the role of L2 experience and use remained unsettled. Apart from contributing to our understanding of the nature of L2 phonological awareness, the findings of the present study have important pedagogical implications. Knowing the gaps in a language learner's L2 phonological awareness enables the instructor to bring them to the learner's attention, which in turn could be positively reflected in improved L2 pronunciation. Finally, the instruments developed for the present study are expected to guide further studies on L2 phonological awareness.
Fritzsche, Kenneth H. "Visual feedback for a student learning language pronunciation." Monterey, Calif. : Springfield, Va. : Naval Postgraduate School ; Available from National Technical Information Service, 1997. http://handle.dtic.mil/100.2/ADA336924.
Full textQader, Raheel. "Pronunciation and disfluency modeling for expressive speech synthesis." Thesis, Rennes 1, 2017. http://www.theses.fr/2017REN1S076/document.
Full textIn numerous domains, the usage of synthetic speech is conditioned upon the ability of speech synthesis systems to generate natural and expressive speech. In this frame, we address the problem of expressivity in TTS by incorporating two phenomena with a high impact on speech: pronunciation variants and speech disfluencies. In the first part of this thesis, we present a new pronunciation variant generation method which works by adapting standard i.e., dictionary-based, pronunciations to a spontaneous style. Its strength and originality lie in exploiting a wide range of linguistic, articulatory and acoustic features and to use a probabilistic machine learning framework, namely conditional random fields (CRFs) and language models. Extensive experiments on the Buckeye corpus demonstrate the effectiveness of this approach through objective and subjective evaluations. Listening tests on synthetic speech show that adapted pronunciations are judged as more spontaneous than standard ones, as well as those realized by real speakers. Furthermore, we show that the method can be extended to other adaptation tasks, for instance, to solve the problem of inconsistency between phoneme sequences handled in TTS systems. The second part of this thesis explores a novel approach to automatic generation of speech disfluencies for TTS. Speech disfluencies are one of the most pervasive phenomena in spontaneous speech, therefore being able to automatically generate them is crucial to have more expressive synthetic speech. The proposed approach provides the advantage of generating several types of disfluencies: pauses, repetitions and revisions. To achieve this task, we formalize the problem as a theoretical process, where transformation functions are iteratively composed. We present a first implementation of the proposed process using CRFs and language models, before conducting objective and perceptual evaluations. These experiments lead to the conclusion that our proposition is effective to generate disfluencies, and highlights perspectives for future improvements
Baker, Amanda A. "Pronunciation Pedagogy: Second Language Teacher Cognition and Practice." Digital Archive @ GSU, 2011. http://digitalarchive.gsu.edu/alesl_diss/16.
Full textSoonklang, Tasanawan. "Extending pronunciation by analogy for speech synthesis applications." Thesis, University of Southampton, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.485539.
Full textLivescu, Karen 1975. "Feature-based pronunciation modeling for automatic speech recognition." Thesis, Massachusetts Institute of Technology, 2005. http://hdl.handle.net/1721.1/34488.
Full textIncludes bibliographical references (p. 131-140).
Spoken language, especially conversational speech, is characterized by great variability in word pronunciation, including many variants that differ grossly from dictionary prototypes. This is one factor in the poor performance of automatic speech recognizers on conversational speech. One approach to handling this variation consists of expanding the dictionary with phonetic substitution, insertion, and deletion rules. Common rule sets, however, typically leave many pronunciation variants unaccounted for and increase word confusability due to the coarse granularity of phone units. We present an alternative approach, in which many types of variation are explained by representing a pronunciation as multiple streams of linguistic features rather than a single stream of phones. Features may correspond to the positions of the speech articulators, such as the lips and tongue, or to acoustic or perceptual categories. By allowing for asynchrony between features and per-feature substitutions, many pronunciation changes that are difficult to account for with phone-based models become quite natural. Although it is well-known that many phenomena can be attributed to this "semi-independent evolution" of features, previous models of pronunciation variation have typically not taken advantage of this. In particular, we propose a class of feature-based pronunciation models represented as dynamic Bayesian networks (DBNs).
(cont.) The DBN framework allows us to naturally represent the factorization of the state space of feature combinations into feature-specific factors, as well as providing standard algorithms for inference and parameter learning. We investigate the behavior of such a model in isolation using manually transcribed words. Compared to a phone-based baseline, the feature-based model has both higher coverage of observed pronunciations and higher recognition rate for isolated words. We also discuss the ways in which such a model can be incorporated into various types of end-to-end speech recognizers and present several examples of implemented systems, for both acoustic speech recognition and lipreading tasks.
by Karen Livescu.
Ph.D.
Lee, Ann Ph D. Massachusetts Institute of Technology. "Language-independent methods for computer-assisted pronunciation training." Thesis, Massachusetts Institute of Technology, 2016. http://hdl.handle.net/1721.1/107338.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (pages 137-145).
Computer-assisted pronunciation training (CAPT) systems help students practice speaking foreign languages by providing automatic pronunciation assessment and corrective feedback. Automatic speech recognition (ASR) technology is a natural component in CAPT systems. Since a nonnative speaker's native language (Li) background affects their pronunciation patterns in a target language (L2), typically not only native but also nonnative training data of specific Ls is needed to train a recognizer for CAPT systems. Given that there are around 7,000 languages in the world, the data collection process is costly and has scalability issues. In addition, expert knowledge on the target L2 is also often needed to design a large feature set describing the deviation of nonnative speech from native speech. In contrast to machines, it is relatively easy for native listeners to detect pronunciation errors without being exposed to nonnative speech or trained with linguistic knowledge beforehand. In this thesis, we are interested in this unsupervised capability and propose methods to overcome the language-dependent challenges. Inspired by the success of unsupervised acoustic pattern discovery, we propose to discover an individual learner's pronunciation error patterns in an unsupervised manner by analyzing the acoustic similarity between speech segments from the learner. Experimental results on nonnative English and nonnative Mandarin Chinese spoken by students from different Ls show that the proposed method is Li-independent and can be portable to different L2s. Moreover, the method is personalized such that it accommodates variations in pronunciation patterns across students. In addition, motivated by the success of deep learning models in unsupervised feature learning, we explore the use of convolutional neural networks (CNNs) for mispronunciation detection. A language-independent data augmentation method is developed to take advantage of native speech as training samples. Experimental results on nonnative Mandarin Chinese speech show the effectiveness of the model and the method. Moreover, both qualitative and quantitative analyses on the convolutional filters reveal that the CNN automatically learns a set of human-interpretable high-level features.
by Ann Lee.
Ph. D.
Bagshaw, Paul Christopher. "Automatic prosodic analysis for computer aided pronunciation teaching." Thesis, University of Edinburgh, 1994. http://hdl.handle.net/1842/10694.
Full textYates, Karen. "Teaching linguistic mimicry to improve second language pronunciation." Thesis, University of North Texas, 2003. https://digital.library.unt.edu/ark:/67531/metadc4164/.
Full textWen, Tao-Chih. "The Role of Motivation in Second Language Pronunciation." Thesis, University of North Texas, 2005. https://digital.library.unt.edu/ark:/67531/metadc4829/.
Full textBinturki, Turki A. "Analysis of pronunciation errors of Saudi ESL learners /." Available to subscribers only, 2008. http://proquest.umi.com/pqdweb?did=1594494161&sid=6&Fmt=2&clientId=1509&RQT=309&VName=PQD.
Full text"Department of Teaching English to Speakers of Other Languages." Includes bibliographical references (p. 74-80). Also available online.
Ma, Rui. "The Role of Pronunciation in Speaking Test Ratings." BYU ScholarsArchive, 2015. https://scholarsarchive.byu.edu/etd/4426.
Full textTegnered, Axel, and Jonas Rentner. "Teachers’ Views on Teaching English Pronunciation : A Phenomenographic Study of Upper-secondary Teachers’ Views and Reported Practices." Thesis, Linköpings universitet, Institutionen för kultur och samhälle, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177896.
Full textБондаренко, Юлия Станиславовна, Алена Владимировна Сауляк, Юлія Станіславівна Бондаренко, Yuliia Stanislavivna Bondarenko, Alona Yevhenivna Sauliak, and Альона Євгенівна Сауляк. "Estuary as the latest pronunciation standard of the English language." Thesis, Нижний Новгород: Издательство Волго-Вятской академии государственной службы, 2010. http://essuir.sumdu.edu.ua/handle/123456789/2749.
Full textУ статті розглянуто вимовний варіант англійської мови Ест’юарі, подано опис фонемічної варіативності цієї вимовної норм та популярність цього вимовного стандарту. При цитуванні документа, використовуйте посилання http://essuir.sumdu.edu.ua/handle/123456789/2749
Umezawa, Kaoru. "Japanese pitch accent and the English speaking learner : a study of production, perception and teaching." Thesis, University College London (University of London), 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.251751.
Full textÖstlund, Fredrik. "British vs American English : Pronunciation in the EFL Classroom." Thesis, Karlstad University, Division for Culture and Communication, 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kau:diva-31.
Full textToday English is a world language; it is spoken by millions both as first and second language almost all over the world. The varieties best known to Swedish pupils are the varieties British and American English. Another variety of English, which is spoken by both native and non-native speakers, is a mixture of British English and American English called Mid-Atlantic English. As long as the English language has been a part of the Swedish curriculum, the leading variety taught has been British English, but lately American English has influenced Swedish teenagers because of its prominent status in media. Since both British English and American English are used in Swedish schools, different attitudes can be perceived among pupils and teachers towards these two varieties. The aim of this paper is to determine if Swedish pupils are using British or American English or if they mix these two varieties. Attitudes and prejudice amongst pupils and their teachers towards these two varieties are looked into as well as whether the pupils speak the variety of English they claim they speak. The question of why the pupils speak the variety they do is also investigated. The results show that most pupils mix British and American English and that American English features predominate in the mix. According to this investigation, teachers and pupils find British English to be a bit “snobbish” while American English can sound a bit “cocky” to them. This investigation concludes that the two major influences on the pupils are their teachers and different kind of media.
Snow, Charles. "Improving continuous speech recognition with automatic multiple pronunciation support." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk1/tape10/PQDD_0020/NQ44592.pdf.
Full textSnow, Charles. "Improving continuous speech recognition with automatic multiple pronunciation support." Thesis, McGill University, 1997. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=35621.
Full textPeabody, Mitchell A. (Mitchell Aaron). "Methods for pronunciation assessment in computer aided language learning." Thesis, Massachusetts Institute of Technology, 2011. http://hdl.handle.net/1721.1/68491.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (p. 149-176).
Learning a foreign language is a challenging endeavor that entails acquiring a wide range of new knowledge including words, grammar, gestures, sounds, etc. Mastering these skills all require extensive practice by the learner and opportunities may not always be available. Computer Aided Language Learning (CALL) systems provide non-threatening environments where foreign language skills can be practiced where ever and whenever a student desires. These systems often have several technologies to identify the different types of errors made by a student. This thesis focuses on the problem of identifying mispronunciations made by a foreign language student using a CALL system. We make several assumptions about the nature of the learning activity: it takes place using a dialogue system, it is a task- or game-oriented activity, the student should not be interrupted by the pronunciation feedback system, and that the goal of the feedback system is to identify severe mispronunciations with high reliability. Detecting mispronunciations requires a corpus of speech with human judgements of pronunciation quality. Typical approaches to collecting such a corpus use an expert phonetician to both phonetically transcribe and assign judgements of quality to each phone in a corpus. This is time consuming and expensive. It also places an extra burden on the transcriber. We describe a novel method for obtaining phone level judgements of pronunciation quality by utilizing non-expert, crowd-sourced, word level judgements of pronunciation. Foreign language learners typically exhibit high variation and pronunciation shapes distinct from native speakers that make analysis for mispronunciation difficult. We detail a simple, but effective method for transforming the vowel space of non-native speakers to make mispronunciation detection more robust and accurate. We show that this transformation not only enhances performance on a simple classification task, but also results in distributions that can be better exploited for mispronunciation detection. This transformation of the vowel is exploited to train a mispronunciation detector using a variety of features derived from acoustic model scores and vowel class distributions. We confirm that the transformation technique results in a more robust and accurate identification of mispronunciations than traditional acoustic models.
by Mitchell A. Peabody.
Ph.D.
Peng, Li. "Embodied pronunciation training : the benefits of visuospatial hand gestures." Doctoral thesis, Universitat Pompeu Fabra, 2021. http://hdl.handle.net/10803/672644.
Full textAl llarg de les últimes dècades s’ha demostrat que emprar gestos manuals que fan visibles els aspectes fonològics d’una llengua estrangera afavoreix l’aprenentatge de la prosòdia d’aquesta llengua. No obstant això, hi ha menys estudis sobre l’efectivitat d’aquests gestos en l’adquisició de nous sons. Aquesta tesi doctoral inclou tres estudis experimentals que tenen per objectiu avaluar els efectes dels entrenaments multimodals de la pronúncia que inclouen aquests moviments manuals com a gestos pedagògics per a l’entrenament dels trets fonètics . L’estudi 1 demostra que emprar gestos que codifiquen trets fonètics de duració dels sons (i.e., moviments horitzontals de les mans que il·lustren aquests contrastos de durada vocàlica) millora la producció de les vocals llargues del japonès per part d’aprenents novells. L’estudi 2 mostra que una realització adequada dels gestos que imiten els trets d’aspiració consonàntica facilita l’aprenentatge de les consonants oclusives aspirades del xinès per part de nous aprenents. L’estudi 3 demostra que un entrenament multimodal que integri gestos manuals que facin visibles els trets prosòdics del francès ajuda els aprenents d’aquesta llengua a reduir el seu accent i alhora augmentar la precisió en la pronúncia de les vocals arrodonides anteriors. En resum, els tres estudis mostren els beneficis de la pràctica multimodal de la pronúncia amb exercicis que incloguin gestos manuals que codifiquen informació fonològica a nivell segmental i suprasegmental. Els resultats ressalten la importància d’incorporar entrenaments multimodals de la pronúncia en l’aula de llengües estrangeres i donen suport a les prediccions del paradigma de la Cognició Corporeïtzada (Embodied Cognition) sobre l’aprenentatge fonològic de segones llengües.
A lo largo de las últimas décadas, se ha demostrado que el uso de gestos manuales que visualizan aspectos fonológicos de una segunda lengua facilita el aprendizaje de la prosodia de esta lengua. No obstante, hay menos estudios sobre la efectividad de esos gestos en la adquisición de nuevos sonidos. Esta tesis doctoral incluye tres estudios experimentales que tienen por objetivo evaluar los efectos de entrenamientos multimodales de la pronunciación que incluyen esos movimientos manuales como gestos pedagógicos de los rasgos fonéticos. El estudio 1 demuestra que el uso de gestos que codifican rasgos fonéticos de duración de los sonidos (i.e., movimientos horizontales de las manos que ilustran los contrastes de duración vocálica) mejora la producción de las vocales largas del japonés por parte de nuevos aprendices. El estudio 2 muestra que una realización adecuada de los gestos que imitan los rasgos de aspiración consonántica facilita el aprendizaje de las con-sonantes oclusivas aspiradas del chino por parte de estudiantes principiantes. El estudio 3 demuestra que un entrenamiento multimodal que incluye el uso de gestos manuales que codifican los rasgos prosódicos del francés ayuda a los estudiantes de esta lengua a reducir su acento y aumentar la precisión en la pronunciación de las vocales labializadas anteriores. En resumen, los tres estudios muestran los beneficios de la práctica multimodal de la pronunciación con ejercicios que incluyan gestos manuales que codifican información fonológica a nivel segmental y suprasegmental. Los resultados resaltan la importancia de incorporar entrenamientos multimodales en el aula de lenguas extranjeras y apoyan las predicciones del paradigma de la Cognición Corporeizada (Embodied Cognition) en el contexto del aprendizaje fonológico de segundas lenguas.
Tikkakoski, S. (Saara). "Communicative language teaching as English pronunciation teaching method:developing exercises." Bachelor's thesis, University of Oulu, 2016. http://urn.fi/URN:NBN:fi:oulu-201602031107.
Full textHARRIS, DAWN FAIRLEY. "THE EFFECTS OF TRAINING ON THE PRONUNCIATION OF MANDARIN." University of Cincinnati / OhioLINK, 2002. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1022165161.
Full textCruz, Neide de Fátima Cesar da. "Pronunciation intelligibility in spontaneous speech of brazilian learners englisg." Florianópolis, SC, 2004. http://repositorio.ufsc.br/xmlui/handle/123456789/86878.
Full textMade available in DSpace on 2012-10-21T11:37:25Z (GMT). No. of bitstreams: 0
The main aim of this study is to find out the extent to which features of
Tegnered, Axel, and Jonas Rentner. "The Importance of Pronunciation Instruction in the English as a Foreign Language Classroom." Thesis, Linköpings universitet, Institutionen för kultur och samhälle, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-167481.
Full textTaylor, Sean D. "A Musician's Guide to Latin Diction in Nineteenth and Twentieth Century Choral Repertoire." University of Cincinnati / OhioLINK, 2013. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1368026296.
Full textWesterberg, Ann-Britt. "Med fokus på uttalet : Elever lär tilsammans." Thesis, Karlstads universitet, Estetisk-filosofiska fakulteten, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:kau:diva-14234.
Full text鄭少玲 and Siu-ling Flona Cheng. "A study of variant readings of Chinese characters labeledas mispronounced in the Yueyin zhengdu zihui." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2008. http://hub.hku.hk/bib/B40733968.
Full text李素琴 and So-kam Lee. "A comparison of Cantonese Transcriptions in Guangzhouhua zhengyin zidian and Changyongzi guangzhouhua duyinbiao." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2008. http://hub.hku.hk/bib/B40736982.
Full text廖雲娟 and Wan-kuen Liu. "A comparison of the Cantonese transcriptions in Guangzhouhua zhengyin zidian and Yueyin zhengdu shouce." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2008. http://hub.hku.hk/bib/B41547093.
Full text