Rozprawy doktorskie na temat „Audio speaker”
Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych
Sprawdź 50 najlepszych rozpraw doktorskich naukowych na temat „Audio speaker”.
Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.
Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.
Przeglądaj rozprawy doktorskie z różnych dziedzin i twórz odpowiednie bibliografie.
Khan, Faheem. "Audio-visual speaker separation". Thesis, University of East Anglia, 2016. https://ueaeprints.uea.ac.uk/59679/.
Pełny tekst źródłaKwon, Patrick (Patrick Ryan) 1975. "Speaker spotting : automatic annotation of audio data with speaker identity". Thesis, Massachusetts Institute of Technology, 1998. http://hdl.handle.net/1721.1/47608.
Pełny tekst źródłaSeymour, R. "Audio-visual speech and speaker recognition". Thesis, Queen's University Belfast, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.492489.
Pełny tekst źródłaMalegaonkar, Amit. "Speaker-based indexation of conversational audio". Thesis, University of Hertfordshire, 2006. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.440175.
Pełny tekst źródłaD'Arca, Eleonora. "Speaker tracking in a joint audio-video network". Thesis, Heriot-Watt University, 2015. http://hdl.handle.net/10399/2972.
Pełny tekst źródłaLathe, Andrew. "Speaker Prototyping Design". Digital Commons @ East Tennessee State University, 2020. https://dc.etsu.edu/honors/584.
Pełny tekst źródłaMartí, Guerola Amparo. "Multichannel audio processing for speaker localization, separation and enhancement". Doctoral thesis, Universitat Politècnica de València, 2013. http://hdl.handle.net/10251/33101.
Pełny tekst źródłaMartí Guerola, A. (2013). Multichannel audio processing for speaker localization, separation and enhancement [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/33101
TESIS
Lucey, Simon. "Audio-visual speech processing". Thesis, Queensland University of Technology, 2002. https://eprints.qut.edu.au/36172/7/SimonLuceyPhDThesis.pdf.
Pełny tekst źródłaAbdelraheem, Mahmoud Fakhry Mahmoud. "Exploiting spatial and spectral information for audio source separation and speaker diarization". Doctoral thesis, University of Trento, 2016. http://eprints-phd.biblio.unitn.it/1876/1/PhD_Thesis.pdf.
Pełny tekst źródłaDean, David Brendan. "Synchronous HMMs for audio-visual speech processing". Thesis, Queensland University of Technology, 2008. https://eprints.qut.edu.au/17689/3/David_Dean_Thesis.pdf.
Pełny tekst źródłaDean, David Brendan. "Synchronous HMMs for audio-visual speech processing". Queensland University of Technology, 2008. http://eprints.qut.edu.au/17689/.
Pełny tekst źródłaAlmaadeed, Noor. "Evaluation and analysis of hybrid intelligent pattern recognition techniques for speaker identification". Thesis, Brunel University, 2014. http://bura.brunel.ac.uk/handle/2438/8760.
Pełny tekst źródłaRaghunathan, Anusha. "EVALUATION OF INTELLIGIBILITY AND SPEAKER SIMILARITY OF VOICE TRANSFORMATION". UKnowledge, 2011. http://uknowledge.uky.edu/gradschool_theses/101.
Pełny tekst źródłaKrishnan, Ravikiran. "Detecting Group Turns of Speaker Groups in Meeting Room Conversations Using Audio-Video Change Scale-Space". Scholar Commons, 2010. http://scholarcommons.usf.edu/etd/3644.
Pełny tekst źródłaSoldi, Giovanni. "Diarisation du locuteur en temps réel pour les objets intelligents". Electronic Thesis or Diss., Paris, ENST, 2016. http://www.theses.fr/2016ENST0061.
Pełny tekst źródłaOn-line speaker diarization aims to detect “who is speaking now" in a given audio stream. The majority of proposed on-line speaker diarization systems has focused on less challenging domains, such as broadcast news and plenary speeches, characterised by long speaker turns and low spontaneity. The first contribution of this thesis is the development of a completely unsupervised adaptive on-line diarization system for challenging and highly spontaneous meeting data. Due to the obtained high diarization error rates, a semi-supervised approach to on-line diarization, whereby speaker models are seeded with a modest amount of manually labelled data and adapted by an efficient incremental maximum a-posteriori adaptation (MAP) procedure, is proposed. Obtained error rates may be low enough to support practical applications. The second part of the thesis addresses instead the problem of phone normalisation when dealing with short-duration speaker modelling. First, Phone Adaptive Training (PAT), a recently proposed technique, is assessed and optimised at the speaker modelling level and in the context of automatic speaker verification (ASV) and then is further developed towards a completely unsupervised system using automatically generated acoustic class transcriptions, whose number is controlled by regression tree analysis. PAT delivers significant improvements in the performance of a state-of-the-art iVector ASV system even when accurate phonetic transcriptions are not available
Unnikrishnan, Harikrishnan. "AUDIO SCENE SEGEMENTATION USING A MICROPHONE ARRAY AND AUDITORY FEATURES". UKnowledge, 2010. http://uknowledge.uky.edu/gradschool_theses/622.
Pełny tekst źródłaMiller, William H. "Analog Implementation of DVM and Farrow Filter Based Beamforming Algorithms for Audio Frequencies". University of Akron / OhioLINK, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=akron1531951902410037.
Pełny tekst źródłaLeis, John W. "Spectral coding methods for speech compression and speaker identification". Thesis, Queensland University of Technology, 1998. https://eprints.qut.edu.au/36062/7/36062_Digitised_Thesis.pdf.
Pełny tekst źródłaVajaria, Himanshu. "Diarization, localization and indexing of meeting archives". [Tampa, Fla] : University of South Florida, 2008. http://purl.fcla.edu/usf/dc/et/SFE0002581.
Pełny tekst źródłaZhang, Xianxian. "Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition". Diss., Connect to online resource, 2005. http://wwwlib.umi.com/cr/colorado/fullcit?p3190350.
Pełny tekst źródłaBrangers, Kirstin M. "Perceptual Ruler for Quantifying Speech Intelligibility in Cocktail Party Scenarios". UKnowledge, 2013. http://uknowledge.uky.edu/ece_etds/31.
Pełny tekst źródłaBarkmeier, Julie Marie. "Intelligibility of dysarthric speakers: audio-only and audio-visual presentations". Thesis, University of Iowa, 1988. https://ir.uiowa.edu/etd/5698.
Pełny tekst źródłaLarcher, Anthony. "Modèles acoustiques à structure temporelle renforcée pour la vérification du locuteur embarquée". Phd thesis, Université d'Avignon, 2009. http://tel.archives-ouvertes.fr/tel-00453645.
Pełny tekst źródłaKilic, V. "Audio-visual tracking of multiple moving speakers". Thesis, University of Surrey, 2016. http://epubs.surrey.ac.uk/809761/.
Pełny tekst źródłaSturtzer, Eric. "Modélisation en vue de l'intégration d'un système audio de micro puissance comprenant un haut-parleur MEMS et son amplificateur". Phd thesis, INSA de Lyon, 2013. http://tel.archives-ouvertes.fr/tel-00940463.
Pełny tekst źródłaCollins, Christopher Michael. "Development of a Virtual Acoustic Showroom for Simulating Listening Environments and Audio Speakers". Thesis, Virginia Tech, 2004. http://hdl.handle.net/10919/9965.
Pełny tekst źródłaMaster of Science
Syncox, David. "The effects of audio-taped feedback on ESL graduate student writing". Thesis, McGill University, 2003. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=19391.
Pełny tekst źródłaEiderbo, Ian. "How does binaural audio mixed for headphones translate to loudspeaker setups in terms of listener preferences?" Thesis, Luleå tekniska universitet, Institutionen för ekonomi, teknik, konst och samhälle, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-85732.
Pełny tekst źródłaScaini, Davide. "Wavelet-based spatial audio framework : from ambisonics to wavelets: a novel approach to spatial audio". Doctoral thesis, Universitat Pompeu Fabra, 2019. http://hdl.handle.net/10803/668214.
Pełny tekst źródłaAmbisonics és una teoria completa d’àudio espacial construïda a partir dels harmònics esfèrics. Alguns dels inconvenients d'Ambisonics de baix ordre, com ara una localització pobra i una àrea petita d’escolta òptima, estan directament relacionats amb les propietats dels harmònics esfèrics. En aquesta tesi presentem un nou formalisme d’àudio espacial basat en Ambisonics substituint però els harmònics esfèrics per les ondetes esfèriques. Desenvolupem una cadena d’àudio completa, des de la codificació fins a la descodificació, a través de l'ús de ondetes discretes construïdes en una malla de multirresolució. Mostrem com es pot generar la família de ondetes i les matrius de descodificació a altaveus mitjançant una optimització numèrica. Presentem un algorisme de descodificació que pot generar matrius de descodificació a conjunts irregulars d'altaveus tant per a Ambisonics com per al nou format basat en ondetes. Finalment, comparem aquest nou formalisme d’àudio amb Ambisonics.
Li, Ying. "Audio-visual training effect on L2 perception and production of English /0/-/s/ and /d/-/z/ by Mandarin speakers". Thesis, University of Newcastle upon Tyne, 2015. http://hdl.handle.net/10443/3052.
Pełny tekst źródłaBern, Charlotte, i Linda Liljeström. "“Request to speak, button” : Accessibility for visually impaired VoiceOver users on social live audio chat platforms". Thesis, Linnéuniversitetet, Institutionen för informatik (IK), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-105457.
Pełny tekst źródłaZhang, Xiangmei. "Authentic materials in English as a Second Language conversation instruction". CSUSB ScholarWorks, 2004. https://scholarworks.lib.csusb.edu/etd-project/2526.
Pełny tekst źródłaHussin, Nora Anniesha Binte. "Interaction from an activity theoretical perspective: comparing learner discourse of language face-to-face, inchat and in audio conferencing in second language learning". Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2009. http://hub.hku.hk/bib/B41758146.
Pełny tekst źródłaAoyama, Kazumasa. "Using A Diglot Reader to Teach Kanji: The Effects of Audio and Romaji on the Acquisition of Kanji Vocabulary". Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd888.pdf.
Pełny tekst źródłaBodenstein, Eckhard W. "Lernervoraussetzungen von Deutschstudenten an der Universitat Zululand : eine Untersuchung auf der Grundlage von Bildtexten". Thesis, Stellenbosch : Stellenbosch University, 1998. http://hdl.handle.net/10019.1/50985.
Pełny tekst źródłaENGLISH ABSTRACT: During my work as a lecturer in "German as a foreign language" at the University of Zululand I have experienced that African students often understand German texts in a different way than I, coming from a European background, would have expected. According to the research on text reception, differences in understanding texts are the result of different reader characteristics of which the socio-cultural background forms an important component. This thesis examines the socio-cultural background of Zulu students and aims to show how it influences their understanding of German texts. The necessary data is obtained by way of a comparative empirical investigation which is enhanced by personal observations made while teaching German to African learners. The investigation is based on a German advertisement. The control groups consist of South African students at the Universities of Natal/Durban and Stellenbosch as well as students in Germany at the University of Kassel. The investigation is concluded by a discussion of the implications that the socio-cultural background of Zulu students can have on the teaching of "German as a foreign language" and on intercultural communication.
AFRIKAANSE OPSOMMING: Gedurende my werks,aamheidas dosent in die vak "Duits as vreemde taal" aan die Universiteit van Zululand het ek ondervind dat Swart studente dikwels Duitse tekste heeltemal anders verstaan as wat ek, as iemand met Europese agtergrond, sou verwag het. Navorsing oar teks-resepsie skryf resepsieverskille toe aan verskillende lesereienskappe waarvan die sosio-kulturele agtergrond 'n belangrike komponent vorm. Hierdie tesis ondersoek die sosio-kulturele agtergrond van Zoeloe-studente en probeer aantoon hoe dit die resepsie van Duitse tekste be'invloed. Die nodige inligting hiervoor word verkry deur middel van 'n vergelykende empiriese ondersoek. Dit word aangevul deur persoonlike waarnemings wat ek gedurende die onderrig van Duits aan Swart studente gemaak het. Die ondersoek is gebaseer op 'n Duitse advertensie. Die kontrolegroepe bestaan uit studente aan die universiteite in Natal/Durban en Stellenbosch in Suid- Afrika en in Duitsland aan die Universiteit van Kassel. In die slotgedeelte word die implikasies uitgewys wat die sosio-kulturele agtergrond van Zoeloe-studente op die onderrig van "Duits as vreemde taal" as oak op interkulturele kommunikasie kan he.
Murray, Garold Linwood. "Bodies in cyberspace : language learning in a simulated environment". Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk2/ftp02/NQ27209.pdf.
Pełny tekst źródłaThompson, Scott Alan. "A Comparison of the Effects of Different Video Imagery Upon Adult ESL Students' Comprehension of a Video Narrative". PDXScholar, 1994. https://pdxscholar.library.pdx.edu/open_access_etds/4845.
Pełny tekst źródłaZappen-Thomson, Marianne 1956. "Liedertexte im fremdkulturellen Literaturunterricht : eine textwissenschaftliche und -didaktische Untersuchung". Thesis, Stellenbosch : Stellenbosch University, 1985. http://hdl.handle.net/10019.1/64968.
Pełny tekst źródłaTyson, Marian. "The effect of media on the listening comprehension scores of intermediate ESL students". PDXScholar, 1989. https://pdxscholar.library.pdx.edu/open_access_etds/3961.
Pełny tekst źródłaSundberg, Daniel. "HANDSFREE-ENHET FÖR MOBIL TRYGGHETSTELEFON". Thesis, Örebro University, Örebro University, Department of Technology, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:oru:diva-7411.
Pełny tekst źródłaCnior Mobile AB i Lindesberg utvecklar en mobil trygghetstelefon för äldre. Detta examensarbete går ut på att utforma en handsfree-enhet för denna. Handsfree-enheten ska integreras i larmknappen, som bärs av användaren runt handleden, och har kontakt med telefonen via blåtandsradio. I examensarbetet ingår att välja ut lämplig högtalare och mikrofon, hitta lösningar för smuts- och vattentålighet samt att lösa problem med ekon och bakgrundsstörningar.
En högtalare hittades som uppfyllde kraven för smuts- och vattentålighet samtidigt som den hade utmärkt frekvensgång för återgivning av tydligt tal. Vattenavrinning från högtalaren löstes genom att ett sinussvep sänds ut från högtalaren varje gång ett samtal ska kopplas upp. På så sätt pressar ljudtrycket ut vattnet från handledsknappens kavitet. Olika utformningar av ljudhålen i handledsknappens skal provades. Den bästa lösningen för vattenavrinningen var att använda sju stycken runda hål med 1,3 mm i diameter. En ljudtrycksmätning säkerställde att ljudtrycket inte blev lidande av denna utformning av ljudhålen.
Ekosläckning och bakgrundsstörningsundertryckning sköts av GSM-modulen i trygghetsmobilen. I ekosläckningens manual finns beskrivet hur ekosläckningens 24 parametrar kan justeras för att passa olika applikationer. Endast en mindre ändring av de rekommenderade parametervärdena behövdes för att ekosläckning och bakgrundsstörningsundertryckning skulle fungera tillfredställande.
Eftersom mikrofonernas datablad visade på så snarlika egenskaper överlämnades mikrofonvalet till företaget, då det kan vara klokt att låta priset avgöra.
Kůst, Martin. "Konstrukční návrh moderního těla reproduktoru s využitím nových technologií". Master's thesis, Vysoké učení technické v Brně. Fakulta strojního inženýrství, 2020. http://www.nusl.cz/ntk/nusl-432589.
Pełny tekst źródłaAnguiano, Arcelia. "Visual literacy in kindergarten: How can visual literacy be used as a tool to promote student learning in the kindergarten classroom?" CSUSB ScholarWorks, 2004. https://scholarworks.lib.csusb.edu/etd-project/2559.
Pełny tekst źródłaShintani, Emi. "Teaching film to enhance brain compatible-learning in English-as-a-foreign language instruction". CSUSB ScholarWorks, 2003. https://scholarworks.lib.csusb.edu/etd-project/2403.
Pełny tekst źródłaLin, Yi-Chun, i 林怡君. "Performance Improvement of Speaker Recognition for Clipped Audio Signals". Thesis, 2012. http://ndltd.ncl.edu.tw/handle/ysqxnq.
Pełny tekst źródła國立臺北科技大學
電腦與通訊研究所
100
This thesis investigates the problem of speaker verification under the condition that the recorded speech signals are clipped due to the saturation of quantization. The clipping of audio signals is not only unpleasant for human listening but also detrimental for speaker verification systems. Although there are a number of restoration techniques for improving the auditory quality of the clipped speech signals, it is found that the speaker characteristics of the restored clipped speech signals can be significantly changed; hence, the restoration techniques are of little help for speaker verification . To solve this problem, this study proposes improving the speaker verification by pruning the clipped signals instead of restoring them. However, to avoid that the length of a testing speech signal may be shorten severely after the pruning, we develop methods for detecting and discarding the speech frames that contain harmful clipped signals while keeping the speech frames that contain acceptable clipped signals. Our experiments conducted using the NIST2001 SRE database show that the proposed methods can reduce around 10% of the equal error rate of the speaker verification .
Chen, Wayne Long, i 陳偉恩. "The impact of smart speaker to the audio industry". Thesis, 2019. http://ndltd.ncl.edu.tw/handle/779da4.
Pełny tekst źródła國立政治大學
國際經營管理英語碩士學位學程(IMBA)
107
The smart speaker technology was introduced to the market in recent years. The technology has changed not only the behavior of consumers but also impacted the entire electronic world. This thesis will specifically discuss about how smart speaker impacts the audio industry. From the introduction of the current audio industry to the rise of voice assistant technology, a thorough history background is covered in order to give a holistic view of the industry. This thesis will also introduce the players including technology providers, audio brands, and system manufacturers. The relationship between these players, problems they face, and the strategy they take to grow their businesses with each other will be the main analysis of this thesis. As the evolution of the smart voice technology continues, smart speakers become the trend of the future. From the market side, the thesis covers the current and future demand on the smart speakers. How to fulfill such demand from supplier’s side and what can each player continues to bring to the table are also discussed. At the end, strategic recommendations are provided to all players in the smart speaker supply chain. The goal is to adapt this abrupt change and able to continue the growth with this technology evolution.
郭志梃. "DSP implementation of an audio/video system using panel speaker array". Thesis, 2002. http://ndltd.ncl.edu.tw/handle/75484798920950449088.
Pełny tekst źródła國立交通大學
機械工程系
90
Applying the technology of the array signal processing to make the sound radiate omnidirectionally is the main purpose of this paper. Hence the method of designing array coefficients to form omnidirectional pattern was employed in this paper. Further, the efficiency of omnidirectional response was greatly improved by using the method of optimization. The optimization is to find out a set of array coefficients, which has optimal efficiency at desired flatness of sound pattern. Owing to the nonlinear relation between array coefficients and spectral flatness function, a method of optimization called genetic algorithm was employed because of its effective searching global maximum value in nonlinear space. Further, a special case called modified optimal omnidirectional case occurs in low frequency. To provide more efficiency at low frequency is the main purpose in this case.ays.
Tsao, Yan-cheng, i 曹晏誠. "A Speech Indexing System Using the Audio Segmentation and Speaker Clustering Schemes". Thesis, 2005. http://ndltd.ncl.edu.tw/handle/03049870471314472707.
Pełny tekst źródłaChen, Chun-chi, i 陳俊吉. "A Design of Multi-session Text-independent Digital Camcorder Audio-Video Database for Speaker Recognition". Thesis, 2008. http://ndltd.ncl.edu.tw/handle/vqrmxu.
Pełny tekst źródła國立中山大學
電機工程學系研究所
96
In this thesis, an audio-video database for speaker recognition is constructed using a digital camcorder. Motion pictures of fifteen hundred speakers are recorded in three different sessions in the database. For each speaker, 20 still images per session are also derived from the video data. It is hoped that this database can provide an appropriate training and testing mechanism for person identification using both voice and face features.
Wang, Long-Cheng, i 王龍政. "A Design of Multi-Session, Text Independent, TV-Recorded Audio-Video Database for Speaker Recognition". Thesis, 2006. http://ndltd.ncl.edu.tw/handle/55168776720675963268.
Pełny tekst źródła國立中山大學
電機工程學系研究所
94
A four-session text independent, TV-recorded audio-video database for speaker recognition is collected in this thesis. The speaker data is used to verify the applicability of a design methodology based on Mel-frequency cepstrum coefficients and Gaussian mixture model. Both single-session and multi-session problems are discussed in the thesis. Experimental results indicate that 90% correct rate can be achieved for a single-session 3000-speaker corpus while only 67% correct rate can be obtained for a two-session 800-speaker dataset. The performance of a multi-session speaker recognition system is greatly reduced due to the variability incurred in the recording environment, speakers’ recording mood and other unknown factors. How to increase the system performance under multi-session conditions becomes a challenging task in the future. And the establishment of such a multi-session large-scale speaker database does indeed play an indispensable role in this task.
Garud, Meera. "Cricket Inspired micro Speakers". Thesis, 2019. https://etd.iisc.ac.in/handle/2005/4585.
Pełny tekst źródła