Dissertations / Theses on the topic 'Temporal Representation in speech'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Temporal Representation in speech.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Davies, David Richard Llewellyn, and dave davies@canberra edu au. "Representing Time in Automated Speech Recognition." The Australian National University. Research School of Information Sciences and Engineering, 2003. http://thesis.anu.edu.au./public/adt-ANU20040602.163031.
Full textLeach, Corinne. "MANIPULATING TEMPORAL COMPONENTS DURING SINGLE-WORD PROCESSING TO FACILITATE ACCESS TO STORED ORTHOGRAPHIC REPRESENTATIONS IN LETTER-BY-LETTER READERS." Master's thesis, Temple University Libraries, 2019. http://cdm16002.contentdm.oclc.org/cdm/ref/collection/p245801coll10/id/574233.
Full textM.A.
This study investigated the benefits of rapid presentation of written words as a treatment strategy to enhance reading speed and accuracy in two participants with acquired alexia who are letter-by-letter readers. Previous studies of pure alexia have shown that when words are rapidly presented, participants can accurately perform lexical decision and category judgment tasks, yet they are unable to read words aloud. These studies suggest that rapid presentation of words could be used as a treatment technique to promote whole-word reading. It was predicted that treatment utilizing rapid presentation (250/500 ms) will increase reading speed and accuracy of both trained and untrained words compared to the words trained in standard presentation (5000 ms). A single-subject ABACA/ACABA multiple baseline treatment design was used. Treatment was provided twice per week for four weeks for both rapid and standard presentation treatment. Each session comprised a spoken-to-written word decision task and semantic category judgment task. Stimuli included 80 trained words divided between the two treatments and 20 untrained controls. Weekly probes to assess reading accuracy were administered after every two treatment sessions. Based on effect sizes, results showed no consistent unambiguous benefit for rapid or standard presentation treatment. However, possible generalization to untrained words due to rapid presentation treatment was observed. Future research is warranted to investigate the effectiveness of rapid presentation treatment in letter-by-letter readers.
Temple University--Theses
Hernandez, Sierra Gabriel. "Métodos de representación y verificación del locutor con independencia del texto." Thesis, Avignon, 2014. http://www.theses.fr/2014AVIG0203/document.
Full textText-independent automatic speaker recognition is a recent method in biometric area. Its increasing interest is reflected both in the increasing participation in international competitions and in the performance progresses. Moreover, the accuracy of the methods is still limited by the quantity of speaker discriminant information contained in the representations of speech utterances. This thesis presents a study on speech representation for speaker recognition systems. It shows firstly two main weaknesses. First, it fails to take into account the temporal behavior of the voice, which is known to contain speaker discriminant information. Secondly, speech events rare in a large population of speakers although very present for a given speaker are hardly taken into account by these approaches, which is contradictory when the goal is to discriminate among speakers.In order to overpass these limitations, we propose in this thesis a new speech representation for speaker recognition. This method represents each acoustic vector in a a binary space which is intrinsically speaker discriminant. A similarity measure associated with a global representation (cumulative vectors) is also proposed. This new speech utterance representation is able to represent infrequent but discriminant events and to work on temporal information. It allows also to take advantage of existing « session » variability compensation approaches (« session » variability represents all the negative variability factors). In this area, we proposed also several improvements to the usual session compensation algorithms. An original solution to deal with the temporal information inside the binary speech representation was also proposed. Thanks to a linear fusion approach between the two sources of information, we demonstrated the complementary nature of the temporal information versus the classical time independent representations
El reconocimiento automático del locutor independiente del texto, es un método dereciente incorporación en los sistemas biométricos. El desarrollo y auge del mismo serefleja en las competencias internacionales, pero aun la eficacia de los métodos de reconocimientose encuentra afectada por la cantidad de información discriminatoria dellocutor que esta presente en las representaciones actuales de las expresiones de voz.En esta tesis se realizó un estudio donde se identificaron dos principales debilidadespresentes en las representaciones actuales del locutor. En primer lugar, no se tiene encuenta el comportamiento temporal de la voz, siendo este un rasgo discriminatorio dellocutor y en segundo lugar los eventos pocos frecuentes dentro de una población delocutores pero frecuentes en un locutor dado, apenas son tenidos en cuenta por estosenfoques, lo cual es contradictorio cuando el objetivo es discriminar los locutores. Motivadopor la solución de estos problemas, se confirmó la redundancia de informaciónexistente en las representaciones actuales y la necesidad de emplear nuevas representacionesde las expresiones de voz. Se propuso un nuevo enfoque con el desarrollo de unmétodo para la obtención de un modelo generador capaz de transformar la representación actual del espacio acústico a una representación en un espacio binario, dondese propuso una medida de similitud asociada con una representación global (vectoracumulativo) que contiene tanto los eventos frecuentes como los pocos frecuentes enuna expresión de voz. Para la compensación de la variabilidad de sesión se incorporóen la matriz de dispersión intra-clase, la información común de la población de locutores,lo que implicó la modificación de tres algoritmos de la literatura que mejoraronsu desempeño respecto a la eficacia en el reconocimiento del locutor, tanto utilizandoel nuevo enfoque propuesto como el enfoque actual de referencia. La información temporalexistente en las expresiones de voz fue capturada e incorporada en una nuevarepresentación, mejorando aun más la eficacia del enfoque propuesto. Finalmente sepropuso y evaluó una fusión lineal entre los dos enfoques que demostró la informacióncomplementaria existente entre ellos, obteniéndose los mejores resultados de eficaciaen el reconocimiento del locutor
Sun, Felix (Felix W. ). "Speech Representation Models for Speech Synthesis and Multimodal Speech Recognition." Thesis, Massachusetts Institute of Technology, 2016. http://hdl.handle.net/1721.1/106378.
Full textThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 59-63).
The field of speech recognition has seen steady advances over the last two decades, leading to the accurate, real-time recognition systems available on mobile phones today. In this thesis, I apply speech modeling techniques developed for recognition to two other speech problems: speech synthesis and multimodal speech recognition with images. In both problems, there is a need to learn a relationship between speech sounds and another source of information. For speech synthesis, I show that using a neural network acoustic model results in a synthesizer that is more tolerant of noisy training data than previous work. For multimodal recognition, I show how information from images can be effectively integrated into the recognition search framework, resulting in improved accuracy when image data is available.
by Felix Sun.
M. Eng.
Mansfield, Rachel. "Temporal Abstract Behavioral Representation Model." Honors in the Major Thesis, University of Central Florida, 2007. http://digital.library.ucf.edu/cdm/ref/collection/ETH/id/1181.
Full textBachelors
Engineering and Computer Science
Electrical Engineering
Howard, John Graham. "Temporal aspects of auditory-visual speech and non-speech perception." Thesis, University of Reading, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.553127.
Full textPayne, Nicole, and Saravanan Elangovan. "Musical Training Influences Temporal Processing of Speech and Non-Speech Contrasts." Digital Commons @ East Tennessee State University, 2012. https://dc.etsu.edu/etsu-works/1565.
Full textSchramm, Cheryl (Cheryl Joanne) Carleton University Dissertation Engineering Electrical. "A temporal representation for multimedia radiological reports." Ottawa, 1989.
Find full textIgualada, Pérez Alfonso. "Gesture-speech temporal integration in language development." Doctoral thesis, Universitat Pompeu Fabra, 2017. http://hdl.handle.net/10803/670094.
Full textWlodarczak, Marcin [Verfasser]. "Temporal entrainment in overlapping speech / Marcin Wlodarczak." Bielefeld : Universitätsbibliothek Bielefeld, 2014. http://d-nb.info/1047666359/34.
Full textWarren, P. "The temporal organisation and perception of speech." Thesis, University of Cambridge, 1985. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.355053.
Full textBrown, Stephanie Danielle. "Speech-in-Speech Recognition: Understanding the Effect of Different Talker Maskers." Case Western Reserve University School of Graduate Studies / OhioLINK, 2019. http://rave.ohiolink.edu/etdc/view?acc_num=case1556649651028033.
Full textMesgarani, Nima. "Discrimination of speech from non-speech based on multiscale spectro-temporal modulations." College Park, Md. : University of Maryland, 2005. http://hdl.handle.net/1903/3044.
Full textThesis research directed by: Dept. of Electrical and Computer Engineering. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
Theron, Karin. "Temporal aspects of speech production in bilingual speakers with neurogenic speech disorders." Diss., Pretoria : [s.n.], 2003. http://upetd.up.ac.za/thesis/available/etd-08072003-152242.
Full textPayne, N., Saravanan Elangovan, and Jacek Smurzynski. "Auditory Temporal Processing of Speech and Non-speech Contrasts in Specialized Listeners." Digital Commons @ East Tennessee State University, 2012. https://dc.etsu.edu/etsu-works/2216.
Full textHostetter, Michael. "Analogical representation in temporal, spatial, and mnemonic reasoning." Thesis, This resource online, 1990. http://scholar.lib.vt.edu/theses/available/etd-03242009-040545/.
Full textHamilton, A. C. J. "Parameters of the analytic vector representation of speech." Thesis, University of Canterbury. Electrical Engineering, 1985. http://hdl.handle.net/10092/5833.
Full textEide, Ellen Marie. "A linguistic feature representation of the speech waveform." Thesis, Massachusetts Institute of Technology, 1993. http://hdl.handle.net/1721.1/12510.
Full textIncludes bibliographical references (leaves 95-97).
by Ellen Marie Eide.
Ph.D.
Karnebäck, Stefan. "Spectro-temporal properties of the acoustic speech signal used for speech/music discrimination /." Stockholm : Department of Speech, Music and Hearing, Royal Institute of Technology, 2004. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-501.
Full textDomont, Xavier. "Hierarchical spectro-temporal features for robust speech recognition." Münster Verl.-Haus Monsenstein und Vannerdat, 2009. http://d-nb.info/1001282655/04.
Full textKleinschmidt, Michael. "Robust speech recognition based on spectro-temporal processing." [S.l. : s.n.], 2002. http://deposit.ddb.de/cgi-bin/dokserv?idn=965610276.
Full textShatzer, Hannah Elizabeth. "Visual and Temporal Influences on Multimodal Speech Integration." The Ohio State University, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=osu1437403560.
Full textKirchhuebel, Christin. "The acoustic and temporal characteristics of deceptive speech." Thesis, University of York, 2013. http://etheses.whiterose.ac.uk/4790/.
Full textHaas, B., Jacek Smurzynski, and Marc A. Fagelson. "Temporal Processing in Patients with Tinnitus." Digital Commons @ East Tennessee State University, 2011. https://dc.etsu.edu/etsu-works/1642.
Full textParkinson, Jon. "Representation learning with a temporally coherent mixed-representation." Thesis, University of Manchester, 2017. https://www.research.manchester.ac.uk/portal/en/theses/representation-learning-with-a-temporally-coherent-mixedrepresentation(ba48bd9e-80ed-4d37-b743-cb149bc498ee).html.
Full textNilsson, Mattias. "Entropy and Speech." Doctoral thesis, Stockholm : Sound and Image Processing Laboratory, School of Electrical Engineering, Royal Institute of Technology, 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3990.
Full textViera, Gerardo. "Time in mind : the cognitive science of temporal representation." Thesis, University of British Columbia, 2016. http://hdl.handle.net/2429/59975.
Full textArts, Faculty of
Graduate
Shunmugam, Tamindran. "Adoption of a visual model for temporal database representation." Master's thesis, University of Cape Town, 2016. http://hdl.handle.net/11427/20875.
Full textQuick, Donya. "Applications and parameter analysis of temporal chaos game representation." Ann Arbor, Mich. : ProQuest, 2008. http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqdiss&rft_dat=xri:pqdiss:1454350.
Full textTitle from PDF title page (viewed Mar. 16, 2009). Source: Masters Abstracts International, Volume: 47-01, page: 0419. Adviser: Margaret H. Dunham. Includes supplementary digital materials in a .zip file available from ProQuest website. Includes bibliographical references.
Alexopoulos, Kyriakos. "Phase spectral representation for low bit rate speech coding." Thesis, Imperial College London, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.249314.
Full textStrange, John. "VOICE AUTHENTICATIONA STUDY OF POLYNOMIAL REPRESENTATION OF SPEECH SIGNALS." Master's thesis, University of Central Florida, 2005. http://digital.library.ucf.edu/cdm/ref/collection/ETD/id/4015.
Full textM.S.
Department of Mathematics
Arts and Sciences
Mathematics
Rusnak, John Joseph 1975. "Pitch period segmentation and spectral representation of speech signals." Thesis, Massachusetts Institute of Technology, 1999. http://hdl.handle.net/1721.1/80641.
Full textIncludes bibliographical references (leaves 87-88).
by John Joseph Rusnak, Junior.
M.Eng.
Lee, Chia-ying (Chia-ying Jackie). "Closed-loop auditory-based representation for robust speech recognition." Thesis, Massachusetts Institute of Technology, 2010. http://hdl.handle.net/1721.1/60176.
Full textIncludes bibliographical references (p. 93-96).
A closed-loop auditory based speech feature extraction algorithm is presented to address the problem of unseen noise for robust speech recognition. This closed-loop model is inspired by the possible role of the medial olivocochlear (MOC) efferent system of the human auditory periphery, which has been suggested in [6, 13, 42] to be important for human speech intelligibility in noisy environment. We propose that instead of using a fixed filter bank, the filters used in a feature extraction algorithm should be more flexible to adapt dynamically to different types of background noise. Therefore, in the closed-loop model, a feedback mechanism is designed to regulate the operating points of filters in the filter bank based on the background noise. The model is tested on a dataset created from TIDigits database. In this dataset, five kinds of noise are added to synthesize noisy speech. Compared with the standard MFCC extraction algorithm, the proposed closed-loop form of feature extraction algorithm provides 9.7%, 9.1% and 11.4% absolution word error rate reduction on average for three kinds of filter banks respectively.
by Chia-ying Lee.
S.M.
Stuttle, Matthew Nicholas. "A gaussian mixture model spectral representation for speech recognition." Thesis, University of Cambridge, 2003. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.620077.
Full textElangovan, Saravanan, Nicole Payne, Jacek Smurzynski, and Marc A. Fagelson. "Musical Training Influences Auditory Temporal Processing." Digital Commons @ East Tennessee State University, 2016. https://dc.etsu.edu/etsu-works/1551.
Full textElangovan, Saravanan, and Andrew Stuart. "Auditory Temporal Processing in the Perception of Voicing." Digital Commons @ East Tennessee State University, 2006. https://dc.etsu.edu/etsu-works/1559.
Full textMesgarani, Nima. "Representation of speech in the primary auditory cortex and its implications for robust speech processing." College Park, Md.: University of Maryland, 2008. http://hdl.handle.net/1903/8586.
Full textThesis research directed by: Dept. of Electrical and Computer Engineering. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
Moghimi, Amir Reza. "Array-based Spectro-temporal Masking For Automatic Speech Recognition." Research Showcase @ CMU, 2014. http://repository.cmu.edu/dissertations/334.
Full textTarr, Eric William. "Processing Perceptually Important Temporal and Spectral Characteristics of Speech." The Ohio State University, 2013. http://rave.ohiolink.edu/etdc/view?acc_num=osu1376913300.
Full textEl-Geresy, Baher. "Qualitative representation and reasoning for spatial and spatio-temporal systems." Thesis, University of South Wales, 2004. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.403330.
Full textLucke, Helmut. "On the representation of temporal data for connectionist word recognition." Thesis, University of Cambridge, 1991. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.239520.
Full textCiotti, Giovanni. "The representation of Sanskrit speech-sounds : philological and linguistic historiographies." Thesis, University of Cambridge, 2013. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.608079.
Full textWilli, Megan, and Megan Willi. "The Perceptual Significance of a Relative Acoustic Representation of Speech." Diss., The University of Arizona, 2017. http://hdl.handle.net/10150/624495.
Full textWang, Yao Electrical Engineering & Telecommunications Faculty of Engineering UNSW. "Single channel speech enhancement based on perceptual temporal masking model." Awarded by:University of New South Wales. Electrical Engineering & Telecommunications, 2007. http://handle.unsw.edu.au/1959.4/40454.
Full textMonteiro, Axel. "Spatial and temporal replication in visual and audiovisual speech recognition." Thesis, University of Nottingham, 2003. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.410421.
Full textDavis, Samantha N. "Assessing Temporal Compensation of Speech due to Delayed Auditory Feedback." Ohio University Honors Tutorial College / OhioLINK, 2017. http://rave.ohiolink.edu/etdc/view?acc_num=ouhonors1492779500766271.
Full textTodman, Christopher Derek. "The representation of time in data warehouses." Thesis, Open University, 1999. http://oro.open.ac.uk/58004/.
Full textDossoumon, Mafoya. "Class and Gender Representation in Nollywood Movies." Thesis, Southern Illinois University at Edwardsville, 2014. http://pqdtopen.proquest.com/#viewpdf?dispub=1549813.
Full textThis study examines class and gender representations in Nollywood films through textual analysis of a sample of films retrieved from the website of the largest Nollywood streaming service, irokoTV. The study investigates patterns in class and gender representations in terms of similarities in portrayals, instances of stereotypes, and value assumptions in terms of who has power by answering the following questions: (1) What class stereotypes are portrayed in Nollywood films? (2) What gender stereotypes are portrayed in Nollywood films? (3) What hegemonic ideas of power are portrayed in Nollywood films as a result of class and gender representations? The study uses an exposure approach to select a sample of convenience of the top 5 films most attended to by the audience on iROKOtv and relies on close reading and a distancing technique called the "commutation test" to discuss the meaning of class and gender representations in the films. Findings indicate that even when they appear to subvert dominant ideologies, the films still reinforce long established societal norms about the importance of wealth and female gender stereotypes such as submissiveness in domestic households. The tales are often aspirational but the films lack grand ideological narratives to make them relevant to social transformation. These findings support Stuart Hall's Theory of Ideology which allows for a subversive agenda in media texts while retaining the flexibility needed to critique connections between dominant ideologies and social practices and structures.
Hasselmo, M. E. "The representation and storage of visual information in the temporal lobe." Thesis, University of Oxford, 1987. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.379950.
Full textPrendergast, Garreth. "The representation of temporal dynamics : psychophysics, neural activity and natural sounds." Thesis, University of York, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.495906.
Full text