Dissertations / Theses on the topic 'Binaural localization'

To see the other types of publications on this topic, follow the link: Binaural localization.

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 33 dissertations / theses for your research on the topic 'Binaural localization.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Reid, Greg L. "Active binaural sound localization techniques, experiments and comparisons." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1999. http://www.collectionscanada.ca/obj/s4/f2/dsk2/ftp01/MQ39225.pdf.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Wang, Qiang 1968. "Underwater object localization using a biomimetic binaural sonar." Thesis, Massachusetts Institute of Technology, 1999. http://hdl.handle.net/1721.1/80359.

Full text
Abstract:
Thesis (S.M. in Oceanographic Engineering)--Joint Program in Applied Ocean Science and Engineering (Massachusetts Institute of Technology, Dept. of Ocean Engineering; and the Woods Hole Oceanographic Institution), 1999.
Includes bibliographical references (leaves 85-89).
by Qiang Wang.
S.M.in Oceanographic Engineering
APA, Harvard, Vancouver, ISO, and other styles
3

Jansson, Conny. "Servostyrning med binaural ljudlokalisering." Thesis, Linköpings universitet, Institutionen för systemteknik, 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-117605.

Full text
Abstract:
People are usually directed towards each other in conversations, to make it easier to hear what is being said. Algorithms for voice and speech recognition works in a similar way, regarding the microphone direction towards the sound source. In this thesis in electronics has therefore a servo control with binaural sound localization been implemented on a microcontroller connected to two microphones. When people perceive sound, the brain can estimate the sound source direction by comparing the time taken by the sound reaching one ear to the other [1]. The difference in time is called the interaural time difference, and can be calculated using various techniques. An exploratory comparison between the techniques cross-correlation and cross-spectrum analysis was carried out before implementation. Advantages and disadvantages of each technique were evaluated at the same time. The result is a functioning servo control, that uses a cross correlation algorithm to calculate the interaural time difference, and controls a servo motor towards the sound source with a P-regulated error reduction method. The project was implemented on the microcontroller ATmega328P from Atmel without using floating point calculations. The thesis was carried out on behalf of the company Jetspark Robotics.
APA, Harvard, Vancouver, ISO, and other styles
4

Keyrouz, Fakheredine. "Efficient binaural sound localization for humanoid robots and telepresence applications." kostenfrei, 2008. http://mediatum2.ub.tum.de/doc/648977/648977.pdf.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Benichoux, Victor. "Timing cues for azimuthal sound source localization." Phd thesis, Université René Descartes - Paris V, 2013. http://tel.archives-ouvertes.fr/tel-00931645.

Full text
Abstract:
Azimuth sound localization in many animals relies on the processing of differences in time-of-arrival of the low-frequency sounds at both ears: the interaural time differences (ITD). It was observed in some species that this cue depends on the spectrum of the signal emitted by the source. Yet, this variation is often discarded, as humans and animals are assumed to be insensitive to it. The purpose of this thesis is to assess this dependency using acoustical techniques, and explore the consequences of this additional complexity on the neurophysiology and psychophysics of sound localization. In the vicinity of rigid spheres, a sound field is diffracted, leading to frequency-dependent wave propagation regimes. Therefore, when the head is modeled as a rigid sphere, the ITD for a given position is a frequency-dependent quantity. I show that this is indeed reflected on human ITDs by studying acoustical recordings for a large number of human and animal subjects. Furthermore, I explain the effect of this variation at two scales. Locally in frequency the ITD introduces different envelope and fine structure delays in the signals reaching the ears. Second the ITD for low-frequency sounds is generally bigger than for high frequency sounds coming from the same position. In a second part, I introduce and discuss the current views on the binaural ITD-sensitive system in mammals. I expose that the heterogenous responses of such cells are well predicted when it is assumed that they are tuned to frequency-dependent ITDs. Furthermore, I discuss how those cells can be made to be tuned to a particular position in space irregardless of the frequency content of the stimulus. Overall, I argue that current data in mammals is consistent with the hypothesis that cells are tuned to a single position in space. Finally, I explore the impact of the frequency-dependence of ITD on human behavior, using psychoacoustical techniques. Subjects are asked to match the lateral position of sounds presented with different frequency content. Those results suggest that humans perceive sounds with different frequency contents at the same position provided that they have different ITDs, as predicted from acoustical data. The extent to which this occurs is well predicted by a spherical model of the head. Combining approaches from different fields, I show that the binaural system is remarkably adapted to the cues available in its environment. This processing strategy used by animals can be of great inspiration to the design of robotic systems.
APA, Harvard, Vancouver, ISO, and other styles
6

Woodruff, John F. "Integrating Monaural and Binaural Cues for Sound Localization and Segregation in Reverberant Environments." The Ohio State University, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=osu1332425718.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Kyweriga, Michael. "The Synaptic Mechanisms Underlying Binaural Interactions in Rat Auditory Cortex." Thesis, University of Oregon, 2014. http://hdl.handle.net/1794/18442.

Full text
Abstract:
The interaural level difference (ILD) is a sound localization cue first computed in the lateral superior olive (LSO) by comparing the loudness of sounds between the two ears. In the auditory cortex, one class of neurons is excited by contralateral but not ipsilateral monaural sounds. These "EO" neurons prefer ILDs where contralateral sounds are louder than ipsilateral sounds. Another class, the "PB" neurons, are unresponsive to monaural sounds but respond predominantly to binaural ILDs, when both ears receive simultaneous sounds of roughly equal loudness (0 ILD). Behavioral studies show that ILD sensitivity is invariant to increasing sound levels. However, in the LSO, ILD response functions shift towards the excitatory ear as sound level increases, indicating level-dependence. Thus, changes in firing rate can indicate either a change in sound location or sound level, or both. This suggests a transformation in level-sensitivity between the LSO and the perception of sound sources, yet the location of this transformation remains unknown. I performed recordings in the auditory cortex of the rat to test whether neurons were invariant to overall sound level. I found that with increasing sound levels, ILD responses were level-dependent, suggesting that level invariance of ILD sensitivity is not present in the rat auditory cortex. In general, neurons follow one of two processing strategies. The tuning of cortical cells typically follows the "inheritance strategy", such that the spiking output of the cell matches that of the excitatory synaptic input. However, cortical tuning can be modified by inhibition in the "local processing strategy". In this case, neurons are prevented from spiking at non-preferred stimuli by inhibition that overwhelms excitation. The tuning strategy of cortical neurons to ILD remains unknown. I performed whole-cell recordings in the anesthetized rat and compared the spiking output with synaptic inputs to ILDs within the same neurons. I found that the PB neurons showed evidence of the local processing strategy, which is a novel role for cortical inhibition, whereas the EO neurons utilized the inheritance strategy. This result suggests that an auditory cortical circuit computes sensitivity for midline ILDs. This dissertation includes previously published/unpublished co-authored material.
APA, Harvard, Vancouver, ISO, and other styles
8

Kim, Ui-Hyun. "Improvement of Sound Source Localization for a Binaural Robot of Spherical Head with Pinnae." 京都大学 (Kyoto University), 2013. http://hdl.handle.net/2433/180475.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Schölling, Björn. "Binaural signal processing for source localization and noise reduction with applications to mobile robotics." Münster Verl.-Haus Monsenstein und Vannerdat, 2009. http://d-nb.info/994281242/04.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Goeckel, Tom [Verfasser], Gerhard [Akademischer Betreuer] Lakemeyer, and Hermann [Akademischer Betreuer] Wagner. "Efficient Binaural Sound Localization in Noisy and Reverberant Environments / Tom Goeckel ; Gerhard Lakemeyer, Hermann Wagner." Aachen : Universitätsbibliothek der RWTH Aachen, 2015. http://d-nb.info/1130402738/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
11

Lerborn, Liam. "Binaural ljudlokalisering av vertikala ljudkällor : En undersökning om spatial ljudlokalisering i ett förstapersonsspel." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-13714.

Full text
Abstract:
Detta kandidatarbete handlar om hur implementation av binauralt ljud kan användas för att öka precisionen för spelares ljudlokalisering i datorspel. I detta arbete skapades en artefakt som testade spelares förmåga att lokalisera ljudkällor på olika vertikala nivåer. Resultatet analyserades och problematiserades med kvantitativa- och kvalitativa undersökningsmetoder. Arbetet kopplas till tidigare forskning inom binauralt ljud och hur det används inom digitala medier. Studien tar även upp ämnen som involverar den mänskliga hörseln och ljudimplementation i datorspel.
APA, Harvard, Vancouver, ISO, and other styles
12

Widman, Ludvig. "Binaural versus Stereo Audio in Navigation in a 3D Game: Differences in Perception and Localization of Sound." Thesis, Luleå tekniska universitet, Institutionen för ekonomi, teknik, konst och samhälle, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-85512.

Full text
Abstract:
Recent advancements in audio technology for computer games has made possible for implementations with binaural audio. Compared to regular stereo sound, binaural audio offers possibilities for a player to experience spatial sound, including sounds along the vertical plane, using their own headphones. A computer game prototype called “Crystal Gatherer” was created for this study to explore the possibilities of binaural audio imple- mentation regarding localization and perception of objects that make sound in a 3D game. The game featured two similar game levels, with the difference that one used binaural sound, and the other stereo sound. The levels consisted of a dark space that the player could navigate freely with the objective to find objects that make sound, called “crystals”, as fast as they could. An experiment was conducted with 14 test sub- jects that played the game, qualitative and quantitative data was collected, including the time the players took to complete the game levels, respectively, and answers about how they experienced the levels. A majority of test subjects reported that they per- ceived a difference between the levels. No significant difference was found between the levels in terms of efficacy of finding the objects that made sound. Some test subjects stated that they found localization was better in the binaural level of the game, others found the stereo level to be better in this respect. The study shows that there can exist possibilities for binaural audio to change the perception of audio in computer games.
APA, Harvard, Vancouver, ISO, and other styles
13

Bustamante, Gabriel. "Mouvement actif pour la localisation binaurale de sources sonores en robotique." Thesis, Toulouse 3, 2017. http://www.theses.fr/2017TOU30211/document.

Full text
Abstract:
Ce travail s'inscrit dans le contexte de la localisation de source sonore depuis un capteur binaural (constitué de deux microphones placés sur un élément diffusant) doté de mobilité. Un schéma de localisation "active" en trois phases est considéré : (a) estimation de primitives spatiales par une analyse court-terme du flux audio ; (b) localisation audio-motrice par assimilation de ces données et combinaison avec les ordres moteurs du capteur au sein d'un schéma d'estimation stochastique ; (c) commande en boucle fermée du mouvement du capteur de façon à améliorer la qualité de la localisation. Les recherches portent sur la définition de stratégies de "mouvement actif" constituant la phase (c). Le problème est formulé comme la maximisation d'un critère d'information défini à partir des lois de filtrage de la position relative capteur-source sur un horizon temporel glissant dans le futur (plus exactement de son espérance sur les observations qui seront assimilées sur cet horizon conditionnellement aux observations passées). Cet horizon peut être constitué du prochain instant ou des N prochains instants, ce qui donne lieu à une stratégie "one-step-ahead" ou "N-step-ahead", respectivement. Une approximation de ce critère par utilisation de la transformée "unscented" et le calcul automatique du gradient de celle-ci par exploitation des nombres duaux, permettent la détermination de la commande (en boucle fermée sur l'audio donc) à appliquer au capteur. Les résultats ont été validés par des simulations réalistes, et, pour certains, par des expérimentations sur un ensemble tête-torse anthropomorphe doté de perception binaurale et de mobilité
This work takes place within the field of sound source localization from a binaural sensor (consisting of two microphones placed on a diffusing element) endowed with mobility. An "active" three-phase localization scheme is considered: (a) estimation of spatial primitives by a short-term analysis of the audio stream; (B) audio-motor localizatio! n by assimilation of these data and combination with the! motor commands of the sensor within a stochastic estimation scheme; (C) closed-loop control of the movement of the sensor in order to improve the quality of the location. The research focuses on the definition of "active motion" strategies constituting phase (c). The problem is formulated as the maximization over a receding horizon of an information criterion defined from the filtering pdfs of the relative sensor-to-source position (more exactly of the maximization of its expectation on the N observations that will be assimilated on this horizon conditionally to the past observations). This horizon can consist of the next time instant or the next N time instants, what gives rise to a "one-step-ahead" or "N-step-ahead" strategy, respectively. An approximation of this criterion by using the unscented transform and the automatic calculation of its gradient by using the dual numbers allow the determination of the control (therefore, in closed loop on the audio) to be applied to the sensor. The results were validated by realistic simulations and, for some of them, by experiments on an anthropomorphic head-and-torso simulator endowed with binaural perception and mobility
APA, Harvard, Vancouver, ISO, and other styles
14

Colbert, Debborah. "Manatee Sound Localization: Performance Abilities, Interaural Level Cues, and Usage of Auditory Evoked Potential Techniques to Determine Sound Conduction Pathways." [Tampa, Fla] : University of South Florida, 2008. http://purl.fcla.edu/usf/dc/et/SFE0002489.

Full text
APA, Harvard, Vancouver, ISO, and other styles
15

Zohourian, Mehdi [Verfasser], Rainer [Gutachter] Martin, and Peter [Gutachter] Vary. "Contributions to binaural speaker localization and separation for dynamic acoustic scenarios / Mehdi Zohourian ; Gutachter: Rainer Martin, Peter Vary ; Fakultät für Elektrotechnik und Informationstechnik." Bochum : Ruhr-Universität Bochum, 2019. http://d-nb.info/1195221258/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
16

Novotný, Ota. "Psychoakustická měření binaurálních vlastností lidského sluchu." Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2010. http://www.nusl.cz/ntk/nusl-218742.

Full text
Abstract:
This diploma thesis deals with a binaural hearing issues (it means hearing by both of ears), a human hearing ability to locate position of a sound source at three-dimensional space and parameters that affect this ability. In the second part, it focuses on psychoacoustic experiment, its main features and errors that can occur and which affect a results credibility. Method of pair comparisons is described more closely here. The last part of this thesis describes a technical solution of experiment in Java environment. The application should have a graphical interface and should be able to register a new user and perform a psychoacoustical experiment. The process of experiment is following. The aplication selects a random position of defined virtual sound source on the defined range and it plays this sound into headphones on button click. The users task is to set the application controls, representing a virtual sound source position, that way, where the user hear the sound come from. On another button click the application plays the same sound, but this sound comes from application controls set position (set by user). User compares this pair of sounds and modifies the position of second sound source until these two positions are same. The application stores these results for later processing on another button click. Principles of generating testing sound sources (sine wave, narrowband noise and sound file with wav extension) and their 3D positioning by measured head model impulse responses correlation are described thereinafter. An ability of human hearing system to locate a virtual sound source in dependence on sound parameters is discussed in conclusion.
APA, Harvard, Vancouver, ISO, and other styles
17

Cumming, John Freeman IV. "The Ability of Hamsters (Mesocricetus auratus) to Use the Binaural Phase Cue to Localize Sound." University of Toledo / OhioLINK, 2019. http://rave.ohiolink.edu/etdc/view?acc_num=toledo156475592795121.

Full text
APA, Harvard, Vancouver, ISO, and other styles
18

Mouba, Ndjila Joan. "Manipulations spatiales de sons spectraux." Thesis, Bordeaux 1, 2009. http://www.theses.fr/2009BOR13869/document.

Full text
Abstract:
Dans les applications d'écoute active, il est primordial de pouvoir interagir avec les sources individuelles présentes dans le mix, par exemple en changeant leur position spatiale. Dans cette thèse, nous avons proposé des techniques binaurales pour la localisation et la spatialisation, basées sur les différences interaurales en amplitude et en temps d'arrivée. Les techniques sont développées dans le plan temps-fréquence. Elles permettent de localiser et de projeter toute source dans l'espace environnant un auditeur. aussi nous avons mis au point des techniques de séparation binaurale de source basées sur le Maximum de vraisemblance et de masques spatiaux probabilistes. Enfin nous avons étendu les techniques binaurales à des techniques multi-diffusion utilisant un ensemble de haut-parleurs. Les techniques proposées sont éprouvées et comparées à des techniques de référence de la littérature. Pour des performances similaires aux techniques existantes, nos propositions ont un avantage significatif en terme de complexité qui les rendent appropriées aux applications temps-réel
In active listening applications, it is important to be able to interact with individual sources present in the mix, for example by changing their spatial position. In this thesis, we proposed techniques for binaural localization and spatialization, based on interaural differences in amplitude and in time of arrival. The techniques are developed in the time-frequency plane. They can locate and project sources in the space surrounding a listener. We also developed binaural source separation methods based on the Maximum Likelihood and on spatial probabilistic masks. Finally, we extended binaural spatialization techniques to multi-diffusion techniques which use a set of speakers for diffusion. The proposed techniques are tested and compared to referenced, well-known techniques. For similar performance with the existing ones, our proposed techniques highlight complexity advantages and are suitable for real-time applications
APA, Harvard, Vancouver, ISO, and other styles
19

Dabak, Anand Ganesh. "Binaural localization using interaural cues." Thesis, 1990. http://hdl.handle.net/1911/13424.

Full text
Abstract:
Major nuclei of the superior olivary complex--the lateral superior olive (LSO) and the medial superior olive (MSO) are presumed to play a major role in the localization of sound signals using interaural level and interaural phase differences between the signals arriving at the two ears. The present work develops a novel approach--function based modeling--for assessing the role of these nuclei in binaural localization. The interaural level difference is shown to be the sufficient statistic at high frequencies when only level cues are available. This level difference is processed optimally when the inputs are excitatory from one ear and inhibitory from the other ear. Response characteristics of LSO single units are remarkably similar to the optimal processor's, strongly supporting the notion that LSO units are intimately involved in high-frequency binaural hearing. For low frequencies the optimal processor makes use of the interaural phase difference cue by correlating the inputs to the two ears thus requiring that the two inputs be excitatory. Hence, high and low frequency localization systems are shown to differ greatly, suggesting separate pathways for each.
APA, Harvard, Vancouver, ISO, and other styles
20

Reid, Gregory Lawrence. "Active binaural sound localization techniques, experiments and comparisons." 1999. http://wwwlib.umi.com/cr/yorku/fullcit?pMQ39225.

Full text
Abstract:
Thesis (M. Sc.)--York University, 1999. Graduate Programme in Computer Science.
Typescript. Includes bibliographical references (leaves 65-66). Also available on the Internet. MODE OF ACCESS via web browser by entering the following URL: http://wwwlib.umi.com/cr/yorku/fullcit?pMQ39225.
APA, Harvard, Vancouver, ISO, and other styles
21

Pan, Yu-Kai, and 潘郁凱. "Humanoid Binaural Auditory Localization of Single Sound Source." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/jrt25k.

Full text
Abstract:
碩士
國立臺灣大學
電信工程學研究所
105
In this thesis, we use the binaural-microphone technique to process the sound source localization, and apply the Head-related Transfer Functions (HRTFs) to generate sound signals for the left and right ears in an anechoic environment. The auditory sound source localization system is bio-inspired structures and mechanisms such as the tonotopic organization and the biological nervous system operating principle. The Auditory Peripheral Module in the IPEM Toolbox of the MATLAB library is then applied to simulate the cochlear and convert the sound signal into the neural pulse rate. After the cochlear model, Nengo, a software for simulating neural systems, is applied to simulate Medial Superior Olive (MSO) and Lateral Superior Olive (LSO) for computing ITD and ILD, respectively. The Inferior Colliculus (IC) is finally added to estimate the location of the sound source by integrating the outputs of MSO and LSO. Results of numerical experiments show 82 % accuracy under anechoic environment, but the system becomes not so accurate if echoes with sufficient delay are also received. Nevertheless, we believe that our system can achieve better accuracy, if the echo-processing in IC can be simulated, too. With such revisions, we believe that our system can be more adapted to realistic auditory environments.
APA, Harvard, Vancouver, ISO, and other styles
22

Dingle, Rachel Neville. "A three-channel model of human binaural sound localization." 2012. http://hdl.handle.net/10222/14550.

Full text
Abstract:
The most accepted model of mammalian binaural sound localization postulates two neural/perceptual channels with hemifield tuning and overlapping medial borders; the extent to which the two channels are co-activated by the source is the neural "code" for the source's azimuthal location. This model does not take into account physiological data on the existence of a population of cells with spatial receptive fields centered on the azimuthal midline. The following work sought to test the hypothesis that the mammalian binaural sound localization apparatus includes a third, midline-tuned channel. Ten experiments used a selective adaptation paradigm in human listeners to probe for the existence of a midline channel. Psychometric functions were obtained for lateral position based on ITD or ILD both before and after adaptation with high-frequency (2800 and 4200 Hz) or low-frequency (260 and 570 Hz) tones. Listeners experienced highly lateralized adaptor stimuli with different frequencies at each ear (asymmetrical adaptation), highly lateralized adaptor stimuli of the same frequency at each ear (symmetrical adaptation), and single frequency adaptation at the midline (central adaptation). At both high and low frequencies in the domains of both interaural time difference (ITD) and interaural level difference (ILD), location judgements after asymmetrical adaptation shifted away from the fatigued side. These shifts occurred across each adapted hemifield and extended slightly over the midline, as is consistent with the two-channel model. The two-channel model would predict no effect of symmetrical or central adaptation because fatiguing both lateral channels equally would not change their relative activation by a given source. In practice, the result of symmetrical adaptation was a shift in location judgements towards the midline as would be expected if adaptation of the lateral channels resulted in a greater relative contribution of a third, midline channel. Likewise, central adaptation tended to result in shifts in perceived location towards the sides. The evidence for the midline channel was strong for high and low frequencies localized by ILD, and was present for low frequencies, but not for high frequencies, localized by ITD.
APA, Harvard, Vancouver, ISO, and other styles
23

Liu, Wei-Han, and 劉維瀚. "Binaural room distribution pattern for nonstationary sound source localization." Thesis, 2007. http://ndltd.ncl.edu.tw/handle/87458499407437078703.

Full text
Abstract:
博士
國立交通大學
電機與控制工程系所
96
Nature sound sources are usually nonstationary and the real environment contains complex reverberations. Therefore, nonstationary sound source localization in a reverberant environment is an important research topic. This dissertation discusses the relationships between the nonstationarity of sound sources and the distribution patterns of interaural phase differences (IPDs) and interaural level differences (ILDs) based on short-term frequency analysis. The level fluctuation of nonstationary sound sources is modeled by the exponent of polynomials from the concept of moving pole model. According to this model, the sufficient condition for utilizing the distribution patterns of IPDs and ILDs to localize a nonstationary sound source is suggested and the phenomena of multiple peaks in the distribution pattern can be explained. Simulation is performed to verify the proposed analysis. Furthermore, a Gaussian-mixture binaural room distribution model (GMBRDM) is proposed to model distribution patterns of IPDs and ILDs for nonstationary sound source localization. The effectiveness and performance of the proposed GMBRDM are demonstrated by experimental results. The proposed nonstationary sound source localization algorithm is adopted for robot localization application. A novel and robust robot location and orientation detection method based on sound field features is proposed. Unlike conventional methods, the proposed method does not explicitly utilize the information of direct sound propagation path from sound source to microphones, nor attempt to suppress the reverberation and noise signals. Instead, the proposed method utilizes the sound field features obtained when the robot is at different location and orientation in an indoor environment. The experimental results show that the proposed method using only two microphones can detect robot’s location and orientation under both line-of-sight and non-line-of-sight cases and can be applied to both near-field and far-field conditions. Since this method can provide global location and orientation detection, it is suitable to fuse with other localization methods to provide initial conditions for reduction of the search effort, or to provide the compensation for localizing certain locations that cannot be detected using other localization methods.
APA, Harvard, Vancouver, ISO, and other styles
24

Chen, Chung-Yuan, and 陳重源. "Indoor Navigation for a Robot with Humanoid Binaural Auditory Localization." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/8n6pw4.

Full text
Abstract:
碩士
國立臺灣大學
電信工程學研究所
107
Here we propose a novel robot system which can navigate in indoor environments with spatial cues from the binaural sound. The system receives audio signals with a set of binaural microphones that has a pair of 3-D printed ear mockups for realistically acoustic effect. Afterwards, a series of neural models imitating human''s auditory pathway process the sound signal into neural signals of various stages, e.g., cochlea, superior olivary complex, inferior colliculus, and finally primary auditory cortex, extracting the ITD and ILD information of the sound. Meanwhile, the neural models for analog-front-end signal processing are simulated with various biologically plausible, realistic module or neural modeling tools, the IPEM toolbox and the Nengo simulator. The primary auditory cortex which is responsible for inferring the sound source azimuth is modeled with a supervised-learning deep neural network, with Keras and Tensorflow, to mimic the plasticity of the brain auditory cortex. Finally, a navigation planner generates goals based on the proposed intelligence levels and guide the robot base to explore the sound source. In the hardware aspect, the system is implemented on a Turtlebot3 base as the mobile robot and a GPU-accelerated PC for neural network simulations. The design, implementation details, and testing results are revealed, analyzed and discussed.
APA, Harvard, Vancouver, ISO, and other styles
25

Keyrouz, Fakheredine [Verfasser]. "Efficient binaural sound localization for humanoid robots and telepresence applications / Fakheredine Keyrouz." 2008. http://d-nb.info/989471241/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
26

Mullin, Amy Ruth. "Horizontal localization and hearing in noise ability in adults with sensorineural hearing loss using hearing aids with binaural processing." Thesis, 2010. http://hdl.handle.net/2152/ETD-UT-2010-05-729.

Full text
Abstract:
The purpose of the study was to determine whether hearing aids with binaural processing improve performance during a localization and a hearing in noise task. The study included 16 participants, ages 29 – 67, with bilateral, essentially symmetrical, sensorineural hearing loss who had no prior hearing aid experience. Participants were fit with Oticon Epoq XW hearing aids bilaterally and completed the localization and the hearing in noise task with three listening conditions: (1) without hearing aids (NO), (2) with hearing aids that were not linked (BIL), and (3) with hearing aids that were linked (BIN). For the localization task, 1.5 second pink noise bursts at 75 dB SPL were used as the stimulus. A 180° 11-speaker array was set up to the right or left side of the participants. A twelfth speaker on the contralateral side of the array introduced constant background pink noise at 65 dB SPL. Results revealed that participants performed the best with the NO condition, followed by BIL, then BIN. There was a significant difference between NO and BIL and NO and BIN. For the hearing in noise (HIN) task, sentences from the Hearing in Noise Test (HINT) were used as target stimuli. Continuous discourse by one male and two female talkers were used as maskers. There were four masker conditions for this task: (1) signal at 0°, masker at 90° (S0-N90), (2) signal at 0°, masker at 180° (S0-N180), (3) signal at 0°, masker at 270° (S0-N270), and (4) signal at 0°, maskers at 90°, 180°, and 270° (S0-N90, 180, 270). Results revealed that there was no significant difference between listening conditions when all masker conditions were considered. When the one-masker conditions were included, there was a significant difference between the NO and BIL and the NO and BIN conditions with the best performance for BIL, followed by BIN, then NO. Results also revealed a significant difference between masker conditions with the best performance for S0-N270, next best for S0-N90, followed by S0-N180, then S0-N90, 180, 270.
text
APA, Harvard, Vancouver, ISO, and other styles
27

Calmes, Laurent [Verfasser]. "Biologically inspired binaural sound source localization and tracking for mobile robots / vorgelegt von Laurent Calmes." 2009. http://d-nb.info/1000889858/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
28

Talagala, Dumidu Sanjaya. "Array signal processing algorithms for localization and equalization in complex acoustic channels." Phd thesis, 2013. http://hdl.handle.net/1885/11756.

Full text
Abstract:
The reproduction of realistic soundscapes in consumer electronic applications has been a driving force behind the development of spatial audio signal processing techniques. In order to accurately reproduce or decompose a particular spatial sound field, being able to exploit or estimate the effects of the acoustic environment becomes essential. This requires both an understanding of the source of the complexity in the acoustic channel (the acoustic path between a source and a receiver) and the ability to characterize its spatial attributes. In this thesis, we explore how to exploit or overcome the effects of the acoustic channel for sound source localization and sound field reproduction. The behaviour of a typical acoustic channel can be visualized as a transformation of its free field behaviour, due to scattering and reflections off the measurement apparatus and the surfaces in a room. These spatial effects can be modelled using the solutions to the acoustic wave equation, yet the physical nature of these scatterers typically results in complex behaviour with frequency. The first half of this thesis explores how to exploit this diversity in the frequency-domain for sound source localization, a concept that has not been considered previously. We first extract down-converted subband signals from the broadband audio signal, and collate these signals, such that the spatial diversity is retained. A signal model is then developed to exploit the channel's spatial information using a signal subspace approach. We show that this concept can be applied to multi-sensor arrays on complex-shaped rigid bodies as well as the special case of binaural localization. In both c! ases, an improvement in the closely spaced source resolution is demonstrated over traditional techniques, through simulations and experiments using a KEMAR manikin. The binaural analysis further indicates that the human localization performance in certain spatial regions is limited by the lack of spatial diversity, as suggested in perceptual experiments in the literature. Finally, the possibility of exploiting known inter-subband correlated sources (e.g., speech) for localization in under-determined systems is demonstrated. The second half of this thesis considers reverberation control, where reverberation is modelled as a superposition of sound fields created by a number of spatially distributed sources. We consider the mode/wave-domain description of the sound field, and propose modelling the reverberant modes as linear transformations of the desired sound field modes. This is a novel concept, as we consider each mode transformation to be independent of other modes. This model is then extended to sound field control, and used to derive the compensation signals required at the loudspeakers to equalize the reverberation. We show that estimating the reverberant channel and controlling the sound field now becomes a single adaptive filtering problem in the mode-domain, where the modes can be adapted independently. The performance of the proposed method is compared with existing adaptive and non-adaptive sound field control techniques through simulations. Finally, it is shown that an order of magnitude reduction in the computational complexity can be achieved, while maintaining comparable performance to existing adaptive control techniques.
APA, Harvard, Vancouver, ISO, and other styles
29

Tóth, Peter. "Modelování binaurálního slyšení." Doctoral thesis, 2020. http://www.nusl.cz/ntk/nusl-435793.

Full text
Abstract:
The central theme of this thesis is a description of information processing in the sound localization circuit of the auditory pathway. The focus is on principal neurons of the medial superior olive (MSO), the first major convergence point for binaural information. Selected properties and relations of MSO neurons are derived and expressed through models. In the thesis we present three modeling studies. The first one clarifies a relation- ship between biophysical parameters of the MSO neuron and its ability to detect coincidental spikes from the left and the right ear. The second study describes the statistical behavior of spike trains on the input and output of the MSO neuron. In the third work, we studied how interaural coherence could guide localization of sound sources in complex listening situations with multiple sound sources in reverberant environments. The main results are analytical and numerical models describing the aforemen- tioned relations and behaviors. Secondary results include that inhibitory input to the MSO neuron narrows and shifts the time range of coincidence detection, that ergodic assumption from statistical physics and circular statistics are beneficial in the description of spike trains in the auditory pathway, and that interaural level difference of parts of the signal with...
APA, Harvard, Vancouver, ISO, and other styles
30

Tóth, Peter. "Zpracování zvuku v emulátoru kochleárního implantátu." Master's thesis, 2011. http://www.nusl.cz/ntk/nusl-314007.

Full text
Abstract:
The time accuracy of the auditory neuronal pathway in its sound localization branch is high, compared to other sensory systems. The time differences in the sound arrival between the left and right ear are distinguished by the neural circuit in this branch. The accuracy achieved here is in the order of tens of microseconds. This phenomenon has not yet been definitively clarified. In this master thesis, a model of a neuron central to this neural circuit is presented. This neuron is called binaural (neuron of the two ears) and is located in the medial superior olive (MSO) neural nucleus. The properties of the MSO neuron are described. Specifically, the neuron acts as a coincidence detector, and this is necessary for the circuit functioning. Main result of the thesis is the theory explaining how the function of the coincidence detector can be described based on the interaction of the post-synaptic potentials on the spike-response model neuron. Generality and implications for the auditory pathway are then discussed.
APA, Harvard, Vancouver, ISO, and other styles
31

Aaronson, Neil L. "Speech-on-speech masking in a front-back dimension and analysis of binaural parameters in rooms using MLS methods." Diss., 2008.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
32

Li, Na 1980 Oct 2. "Binaural mechanism revealed with in vivo whole cell patch clamp recordings in the inferior colliculus." Thesis, 2010. http://hdl.handle.net/2152/ETD-UT-2010-12-2065.

Full text
Abstract:
Many cells in the inferior colliculus (IC) are excited by contralateral and inhibited by ipsilateral stimulation and are thought to be important for sound localization. These excitatory-inhibitory (EI) cells comprise a diverse group, even though they exhibit a common binaural response property. Previous extracellular studies proposed specific excitatory and/or inhibitory events that should be evoked by each ear and thereby generate each of the EI discharge properties. The proposals were inferences based on the well established response features of neurons in lower nuclei, the projections of those nuclei, their excitatory or inhibitory neurochemistry, and the changes in response features that occurred when inhibition was blocked. Here we recorded the inputs, the postsynaptic potentials, discharges evoked by monaural and binaural signals in EI cells with in vivo whole cell recordings from the inferior colliculus (IC) of awake bats. We also computed the excitatory and inhibitory synaptic conductances from the recorded sound evoked responses. First, we showed that a minority of EI cells either inherited their binaural property from a lower binaural nucleus or the EI property was created in the IC via inhibitory projections from the ipsilateral ear, features consistent with those observed in extracellular studies. Second, we showed that in a majority of EI cells ipsilateral signals evoked subthreshold EPSPs that behaved paradoxically in that EPSP amplitudes increased with intensity, even though binaural signals with the same ipsilateral intensities generated progressively greater spike suppressions. These ipsilateral EPSPs were unexpected since they could not have been detected with extracellular recordings. These additional responses suggested that the circuitry underlying EI cells was more complex than previously suggested. We also proposed the functional significance of ipsilaterally evoked EPSPs in responding to moving sound sources or multiple sounds. Third, by computing synaptic conductances, we showed the circuitry of the EI cells was even more complicated than those suggested by PSPs, and we also evaluated how the binaural property was produced by the contralateral and ipsilateral synaptic events.
text
APA, Harvard, Vancouver, ISO, and other styles
33

"The Value of Two Ears for Sound Source Localization and Speech Understanding in Complex Listening Environments: Two Cochlear Implants vs. Two Partially Hearing Ears and One Cochlear Implant." Doctoral diss., 2013. http://hdl.handle.net/2286/R.I.17809.

Full text
Abstract:
abstract: Two groups of cochlear implant (CI) listeners were tested for sound source localization and for speech recognition in complex listening environments. One group (n=11) wore bilateral CIs and, potentially, had access to interaural level difference (ILD) cues, but not interaural timing difference (ITD) cues. The second group (n=12) wore a single CI and had low-frequency, acoustic hearing in both the ear contralateral to the CI and in the implanted ear. These `hearing preservation' listeners, potentially, had access to ITD cues but not to ILD cues. At issue in this dissertation was the value of the two types of information about sound sources, ITDs and ILDs, for localization and for speech perception when speech and noise sources were separated in space. For Experiment 1, normal hearing (NH) listeners and the two groups of CI listeners were tested for sound source localization using a 13 loudspeaker array. For the NH listeners, the mean RMS error for localization was 7 degrees, for the bilateral CI listeners, 20 degrees, and for the hearing preservation listeners, 23 degrees. The scores for the two CI groups did not differ significantly. Thus, both CI groups showed equivalent, but poorer than normal, localization. This outcome using the filtered noise bands for the normal hearing listeners, suggests ILD and ITD cues can support equivalent levels of localization. For Experiment 2, the two groups of CI listeners were tested for speech recognition in noise when the noise sources and targets were spatially separated in a simulated `restaurant' environment and in two versions of a `cocktail party' environment. At issue was whether either CI group would show benefits from binaural hearing, i.e., better performance when the noise and targets were separated in space. Neither of the CI groups showed spatial release from masking. However, both groups showed a significant binaural advantage (a combination of squelch and summation), which also maintained separation of the target and noise, indicating the presence of some binaural processing or `unmasking' of speech in noise. Finally, localization ability in Experiment 1 was not correlated with binaural advantage in Experiment 2.
Dissertation/Thesis
Ph.D. Speech and Hearing Science 2013
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography