Dissertations / Theses on the topic 'Audio speaker'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Audio speaker.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Khan, Faheem. "Audio-visual speaker separation." Thesis, University of East Anglia, 2016. https://ueaeprints.uea.ac.uk/59679/.
Full textKwon, Patrick (Patrick Ryan) 1975. "Speaker spotting : automatic annotation of audio data with speaker identity." Thesis, Massachusetts Institute of Technology, 1998. http://hdl.handle.net/1721.1/47608.
Full textSeymour, R. "Audio-visual speech and speaker recognition." Thesis, Queen's University Belfast, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.492489.
Full textMalegaonkar, Amit. "Speaker-based indexation of conversational audio." Thesis, University of Hertfordshire, 2006. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.440175.
Full textD'Arca, Eleonora. "Speaker tracking in a joint audio-video network." Thesis, Heriot-Watt University, 2015. http://hdl.handle.net/10399/2972.
Full textLathe, Andrew. "Speaker Prototyping Design." Digital Commons @ East Tennessee State University, 2020. https://dc.etsu.edu/honors/584.
Full textMartí, Guerola Amparo. "Multichannel audio processing for speaker localization, separation and enhancement." Doctoral thesis, Universitat Politècnica de València, 2013. http://hdl.handle.net/10251/33101.
Full textMartí Guerola, A. (2013). Multichannel audio processing for speaker localization, separation and enhancement [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/33101
TESIS
Lucey, Simon. "Audio-visual speech processing." Thesis, Queensland University of Technology, 2002. https://eprints.qut.edu.au/36172/7/SimonLuceyPhDThesis.pdf.
Full textAbdelraheem, Mahmoud Fakhry Mahmoud. "Exploiting spatial and spectral information for audio source separation and speaker diarization." Doctoral thesis, University of Trento, 2016. http://eprints-phd.biblio.unitn.it/1876/1/PhD_Thesis.pdf.
Full textDean, David Brendan. "Synchronous HMMs for audio-visual speech processing." Thesis, Queensland University of Technology, 2008. https://eprints.qut.edu.au/17689/3/David_Dean_Thesis.pdf.
Full textDean, David Brendan. "Synchronous HMMs for audio-visual speech processing." Queensland University of Technology, 2008. http://eprints.qut.edu.au/17689/.
Full textAlmaadeed, Noor. "Evaluation and analysis of hybrid intelligent pattern recognition techniques for speaker identification." Thesis, Brunel University, 2014. http://bura.brunel.ac.uk/handle/2438/8760.
Full textRaghunathan, Anusha. "EVALUATION OF INTELLIGIBILITY AND SPEAKER SIMILARITY OF VOICE TRANSFORMATION." UKnowledge, 2011. http://uknowledge.uky.edu/gradschool_theses/101.
Full textKrishnan, Ravikiran. "Detecting Group Turns of Speaker Groups in Meeting Room Conversations Using Audio-Video Change Scale-Space." Scholar Commons, 2010. http://scholarcommons.usf.edu/etd/3644.
Full textSoldi, Giovanni. "Diarisation du locuteur en temps réel pour les objets intelligents." Electronic Thesis or Diss., Paris, ENST, 2016. http://www.theses.fr/2016ENST0061.
Full textOn-line speaker diarization aims to detect “who is speaking now" in a given audio stream. The majority of proposed on-line speaker diarization systems has focused on less challenging domains, such as broadcast news and plenary speeches, characterised by long speaker turns and low spontaneity. The first contribution of this thesis is the development of a completely unsupervised adaptive on-line diarization system for challenging and highly spontaneous meeting data. Due to the obtained high diarization error rates, a semi-supervised approach to on-line diarization, whereby speaker models are seeded with a modest amount of manually labelled data and adapted by an efficient incremental maximum a-posteriori adaptation (MAP) procedure, is proposed. Obtained error rates may be low enough to support practical applications. The second part of the thesis addresses instead the problem of phone normalisation when dealing with short-duration speaker modelling. First, Phone Adaptive Training (PAT), a recently proposed technique, is assessed and optimised at the speaker modelling level and in the context of automatic speaker verification (ASV) and then is further developed towards a completely unsupervised system using automatically generated acoustic class transcriptions, whose number is controlled by regression tree analysis. PAT delivers significant improvements in the performance of a state-of-the-art iVector ASV system even when accurate phonetic transcriptions are not available
Unnikrishnan, Harikrishnan. "AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES." UKnowledge, 2010. http://uknowledge.uky.edu/gradschool_theses/622.
Full textMiller, William H. "Analog Implementation of DVM and Farrow Filter Based Beamforming Algorithms for Audio Frequencies." University of Akron / OhioLINK, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=akron1531951902410037.
Full textLeis, John W. "Spectral coding methods for speech compression and speaker identification." Thesis, Queensland University of Technology, 1998. https://eprints.qut.edu.au/36062/7/36062_Digitised_Thesis.pdf.
Full textVajaria, Himanshu. "Diarization, localization and indexing of meeting archives." [Tampa, Fla] : University of South Florida, 2008. http://purl.fcla.edu/usf/dc/et/SFE0002581.
Full textZhang, Xianxian. "Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition." Diss., Connect to online resource, 2005. http://wwwlib.umi.com/cr/colorado/fullcit?p3190350.
Full textBrangers, Kirstin M. "Perceptual Ruler for Quantifying Speech Intelligibility in Cocktail Party Scenarios." UKnowledge, 2013. http://uknowledge.uky.edu/ece_etds/31.
Full textBarkmeier, Julie Marie. "Intelligibility of dysarthric speakers: audio-only and audio-visual presentations." Thesis, University of Iowa, 1988. https://ir.uiowa.edu/etd/5698.
Full textLarcher, Anthony. "Modèles acoustiques à structure temporelle renforcée pour la vérification du locuteur embarquée." Phd thesis, Université d'Avignon, 2009. http://tel.archives-ouvertes.fr/tel-00453645.
Full textKilic, V. "Audio-visual tracking of multiple moving speakers." Thesis, University of Surrey, 2016. http://epubs.surrey.ac.uk/809761/.
Full textSturtzer, Eric. "Modélisation en vue de l'intégration d'un système audio de micro puissance comprenant un haut-parleur MEMS et son amplificateur." Phd thesis, INSA de Lyon, 2013. http://tel.archives-ouvertes.fr/tel-00940463.
Full textCollins, Christopher Michael. "Development of a Virtual Acoustic Showroom for Simulating Listening Environments and Audio Speakers." Thesis, Virginia Tech, 2004. http://hdl.handle.net/10919/9965.
Full textMaster of Science
Syncox, David. "The effects of audio-taped feedback on ESL graduate student writing." Thesis, McGill University, 2003. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=19391.
Full textEiderbo, Ian. "How does binaural audio mixed for headphones translate to loudspeaker setups in terms of listener preferences?" Thesis, Luleå tekniska universitet, Institutionen för ekonomi, teknik, konst och samhälle, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-85732.
Full textScaini, Davide. "Wavelet-based spatial audio framework : from ambisonics to wavelets: a novel approach to spatial audio." Doctoral thesis, Universitat Pompeu Fabra, 2019. http://hdl.handle.net/10803/668214.
Full textAmbisonics és una teoria completa d’àudio espacial construïda a partir dels harmònics esfèrics. Alguns dels inconvenients d'Ambisonics de baix ordre, com ara una localització pobra i una àrea petita d’escolta òptima, estan directament relacionats amb les propietats dels harmònics esfèrics. En aquesta tesi presentem un nou formalisme d’àudio espacial basat en Ambisonics substituint però els harmònics esfèrics per les ondetes esfèriques. Desenvolupem una cadena d’àudio completa, des de la codificació fins a la descodificació, a través de l'ús de ondetes discretes construïdes en una malla de multirresolució. Mostrem com es pot generar la família de ondetes i les matrius de descodificació a altaveus mitjançant una optimització numèrica. Presentem un algorisme de descodificació que pot generar matrius de descodificació a conjunts irregulars d'altaveus tant per a Ambisonics com per al nou format basat en ondetes. Finalment, comparem aquest nou formalisme d’àudio amb Ambisonics.
Li, Ying. "Audio-visual training effect on L2 perception and production of English /0/-/s/ and /d/-/z/ by Mandarin speakers." Thesis, University of Newcastle upon Tyne, 2015. http://hdl.handle.net/10443/3052.
Full textBern, Charlotte, and Linda Liljeström. "“Request to speak, button” : Accessibility for visually impaired VoiceOver users on social live audio chat platforms." Thesis, Linnéuniversitetet, Institutionen för informatik (IK), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-105457.
Full textZhang, Xiangmei. "Authentic materials in English as a Second Language conversation instruction." CSUSB ScholarWorks, 2004. https://scholarworks.lib.csusb.edu/etd-project/2526.
Full textHussin, Nora Anniesha Binte. "Interaction from an activity theoretical perspective: comparing learner discourse of language face-to-face, inchat and in audio conferencing in second language learning." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2009. http://hub.hku.hk/bib/B41758146.
Full textAoyama, Kazumasa. "Using A Diglot Reader to Teach Kanji: The Effects of Audio and Romaji on the Acquisition of Kanji Vocabulary." Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd888.pdf.
Full textBodenstein, Eckhard W. "Lernervoraussetzungen von Deutschstudenten an der Universitat Zululand : eine Untersuchung auf der Grundlage von Bildtexten." Thesis, Stellenbosch : Stellenbosch University, 1998. http://hdl.handle.net/10019.1/50985.
Full textENGLISH ABSTRACT: During my work as a lecturer in "German as a foreign language" at the University of Zululand I have experienced that African students often understand German texts in a different way than I, coming from a European background, would have expected. According to the research on text reception, differences in understanding texts are the result of different reader characteristics of which the socio-cultural background forms an important component. This thesis examines the socio-cultural background of Zulu students and aims to show how it influences their understanding of German texts. The necessary data is obtained by way of a comparative empirical investigation which is enhanced by personal observations made while teaching German to African learners. The investigation is based on a German advertisement. The control groups consist of South African students at the Universities of Natal/Durban and Stellenbosch as well as students in Germany at the University of Kassel. The investigation is concluded by a discussion of the implications that the socio-cultural background of Zulu students can have on the teaching of "German as a foreign language" and on intercultural communication.
AFRIKAANSE OPSOMMING: Gedurende my werks,aamheidas dosent in die vak "Duits as vreemde taal" aan die Universiteit van Zululand het ek ondervind dat Swart studente dikwels Duitse tekste heeltemal anders verstaan as wat ek, as iemand met Europese agtergrond, sou verwag het. Navorsing oar teks-resepsie skryf resepsieverskille toe aan verskillende lesereienskappe waarvan die sosio-kulturele agtergrond 'n belangrike komponent vorm. Hierdie tesis ondersoek die sosio-kulturele agtergrond van Zoeloe-studente en probeer aantoon hoe dit die resepsie van Duitse tekste be'invloed. Die nodige inligting hiervoor word verkry deur middel van 'n vergelykende empiriese ondersoek. Dit word aangevul deur persoonlike waarnemings wat ek gedurende die onderrig van Duits aan Swart studente gemaak het. Die ondersoek is gebaseer op 'n Duitse advertensie. Die kontrolegroepe bestaan uit studente aan die universiteite in Natal/Durban en Stellenbosch in Suid- Afrika en in Duitsland aan die Universiteit van Kassel. In die slotgedeelte word die implikasies uitgewys wat die sosio-kulturele agtergrond van Zoeloe-studente op die onderrig van "Duits as vreemde taal" as oak op interkulturele kommunikasie kan he.
Murray, Garold Linwood. "Bodies in cyberspace : language learning in a simulated environment." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk2/ftp02/NQ27209.pdf.
Full textThompson, Scott Alan. "A Comparison of the Effects of Different Video Imagery Upon Adult ESL Students' Comprehension of a Video Narrative." PDXScholar, 1994. https://pdxscholar.library.pdx.edu/open_access_etds/4845.
Full textZappen-Thomson, Marianne 1956. "Liedertexte im fremdkulturellen Literaturunterricht : eine textwissenschaftliche und -didaktische Untersuchung." Thesis, Stellenbosch : Stellenbosch University, 1985. http://hdl.handle.net/10019.1/64968.
Full textTyson, Marian. "The effect of media on the listening comprehension scores of intermediate ESL students." PDXScholar, 1989. https://pdxscholar.library.pdx.edu/open_access_etds/3961.
Full textSundberg, Daniel. "HANDSFREE-ENHET FÖR MOBIL TRYGGHETSTELEFON." Thesis, Örebro University, Örebro University, Department of Technology, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:oru:diva-7411.
Full textCnior Mobile AB i Lindesberg utvecklar en mobil trygghetstelefon för äldre. Detta examensarbete går ut på att utforma en handsfree-enhet för denna. Handsfree-enheten ska integreras i larmknappen, som bärs av användaren runt handleden, och har kontakt med telefonen via blåtandsradio. I examensarbetet ingår att välja ut lämplig högtalare och mikrofon, hitta lösningar för smuts- och vattentålighet samt att lösa problem med ekon och bakgrundsstörningar.
En högtalare hittades som uppfyllde kraven för smuts- och vattentålighet samtidigt som den hade utmärkt frekvensgång för återgivning av tydligt tal. Vattenavrinning från högtalaren löstes genom att ett sinussvep sänds ut från högtalaren varje gång ett samtal ska kopplas upp. På så sätt pressar ljudtrycket ut vattnet från handledsknappens kavitet. Olika utformningar av ljudhålen i handledsknappens skal provades. Den bästa lösningen för vattenavrinningen var att använda sju stycken runda hål med 1,3 mm i diameter. En ljudtrycksmätning säkerställde att ljudtrycket inte blev lidande av denna utformning av ljudhålen.
Ekosläckning och bakgrundsstörningsundertryckning sköts av GSM-modulen i trygghetsmobilen. I ekosläckningens manual finns beskrivet hur ekosläckningens 24 parametrar kan justeras för att passa olika applikationer. Endast en mindre ändring av de rekommenderade parametervärdena behövdes för att ekosläckning och bakgrundsstörningsundertryckning skulle fungera tillfredställande.
Eftersom mikrofonernas datablad visade på så snarlika egenskaper överlämnades mikrofonvalet till företaget, då det kan vara klokt att låta priset avgöra.
Kůst, Martin. "Konstrukční návrh moderního těla reproduktoru s využitím nových technologií." Master's thesis, Vysoké učení technické v Brně. Fakulta strojního inženýrství, 2020. http://www.nusl.cz/ntk/nusl-432589.
Full textAnguiano, Arcelia. "Visual literacy in kindergarten: How can visual literacy be used as a tool to promote student learning in the kindergarten classroom?" CSUSB ScholarWorks, 2004. https://scholarworks.lib.csusb.edu/etd-project/2559.
Full textShintani, Emi. "Teaching film to enhance brain compatible-learning in English-as-a-foreign language instruction." CSUSB ScholarWorks, 2003. https://scholarworks.lib.csusb.edu/etd-project/2403.
Full textLin, Yi-Chun, and 林怡君. "Performance Improvement of Speaker Recognition for Clipped Audio Signals." Thesis, 2012. http://ndltd.ncl.edu.tw/handle/ysqxnq.
Full text國立臺北科技大學
電腦與通訊研究所
100
This thesis investigates the problem of speaker verification under the condition that the recorded speech signals are clipped due to the saturation of quantization. The clipping of audio signals is not only unpleasant for human listening but also detrimental for speaker verification systems. Although there are a number of restoration techniques for improving the auditory quality of the clipped speech signals, it is found that the speaker characteristics of the restored clipped speech signals can be significantly changed; hence, the restoration techniques are of little help for speaker verification . To solve this problem, this study proposes improving the speaker verification by pruning the clipped signals instead of restoring them. However, to avoid that the length of a testing speech signal may be shorten severely after the pruning, we develop methods for detecting and discarding the speech frames that contain harmful clipped signals while keeping the speech frames that contain acceptable clipped signals. Our experiments conducted using the NIST2001 SRE database show that the proposed methods can reduce around 10% of the equal error rate of the speaker verification .
Chen, Wayne Long, and 陳偉恩. "The impact of smart speaker to the audio industry." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/779da4.
Full text國立政治大學
國際經營管理英語碩士學位學程(IMBA)
107
The smart speaker technology was introduced to the market in recent years. The technology has changed not only the behavior of consumers but also impacted the entire electronic world. This thesis will specifically discuss about how smart speaker impacts the audio industry. From the introduction of the current audio industry to the rise of voice assistant technology, a thorough history background is covered in order to give a holistic view of the industry. This thesis will also introduce the players including technology providers, audio brands, and system manufacturers. The relationship between these players, problems they face, and the strategy they take to grow their businesses with each other will be the main analysis of this thesis. As the evolution of the smart voice technology continues, smart speakers become the trend of the future. From the market side, the thesis covers the current and future demand on the smart speakers. How to fulfill such demand from supplier’s side and what can each player continues to bring to the table are also discussed. At the end, strategic recommendations are provided to all players in the smart speaker supply chain. The goal is to adapt this abrupt change and able to continue the growth with this technology evolution.
郭志梃. "DSP implementation of an audio/video system using panel speaker array." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/75484798920950449088.
Full text國立交通大學
機械工程系
90
Applying the technology of the array signal processing to make the sound radiate omnidirectionally is the main purpose of this paper. Hence the method of designing array coefficients to form omnidirectional pattern was employed in this paper. Further, the efficiency of omnidirectional response was greatly improved by using the method of optimization. The optimization is to find out a set of array coefficients, which has optimal efficiency at desired flatness of sound pattern. Owing to the nonlinear relation between array coefficients and spectral flatness function, a method of optimization called genetic algorithm was employed because of its effective searching global maximum value in nonlinear space. Further, a special case called modified optimal omnidirectional case occurs in low frequency. To provide more efficiency at low frequency is the main purpose in this case.ays.
Tsao, Yan-cheng, and 曹晏誠. "A Speech Indexing System Using the Audio Segmentation and Speaker Clustering Schemes." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/03049870471314472707.
Full textChen, Chun-chi, and 陳俊吉. "A Design of Multi-session Text-independent Digital Camcorder Audio-Video Database for Speaker Recognition." Thesis, 2008. http://ndltd.ncl.edu.tw/handle/vqrmxu.
Full text國立中山大學
電機工程學系研究所
96
In this thesis, an audio-video database for speaker recognition is constructed using a digital camcorder. Motion pictures of fifteen hundred speakers are recorded in three different sessions in the database. For each speaker, 20 still images per session are also derived from the video data. It is hoped that this database can provide an appropriate training and testing mechanism for person identification using both voice and face features.
Wang, Long-Cheng, and 王龍政. "A Design of Multi-Session, Text Independent, TV-Recorded Audio-Video Database for Speaker Recognition." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/55168776720675963268.
Full text國立中山大學
電機工程學系研究所
94
A four-session text independent, TV-recorded audio-video database for speaker recognition is collected in this thesis. The speaker data is used to verify the applicability of a design methodology based on Mel-frequency cepstrum coefficients and Gaussian mixture model. Both single-session and multi-session problems are discussed in the thesis. Experimental results indicate that 90% correct rate can be achieved for a single-session 3000-speaker corpus while only 67% correct rate can be obtained for a two-session 800-speaker dataset. The performance of a multi-session speaker recognition system is greatly reduced due to the variability incurred in the recording environment, speakers’ recording mood and other unknown factors. How to increase the system performance under multi-session conditions becomes a challenging task in the future. And the establishment of such a multi-session large-scale speaker database does indeed play an indispensable role in this task.
Garud, Meera. "Cricket Inspired micro Speakers." Thesis, 2019. https://etd.iisc.ac.in/handle/2005/4585.
Full text