Dissertations / Theses on the topic 'Stereo audio'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 27 dissertations / theses for your research on the topic 'Stereo audio.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Craig, Shelley. "Stereo audio for television : practical problems in audio post-production techniques." Thesis, McGill University, 1987. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=63957.
Full textBrown, Tim. "PIC controlled two-band stereo audio equalizer." Click here to view, 2009. http://digitalcommons.calpoly.edu/eesp/14/.
Full textProject advisor: Dennis Derickson. Title from PDF title page; viewed on Feb. 4, 2010. Includes bibliographical references. Also available on microfiche.
Konečný, Jiří. "Návrh stereo audio koncového zesilovače spínané třídy." Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2013. http://www.nusl.cz/ntk/nusl-220270.
Full textCapobianco, Julien. "Codage audio stéréo avancé." Thesis, Paris 6, 2015. http://www.theses.fr/2015PA066712/document.
Full textDuring the last ten years, technics for joint coding exploiting relations and redundancies between channels have been developped in order to further reduce the amount of information needed to represent multichannel audio signals.In this document, we focus on the coding of stereo audio signals where prior informations on the nature of sources in presence, their number or the manner they are spatialized is unknown. Such signals are actually the most representative in commercial records of music industry and in multimedia entertainment in general. To address the coding problematic of these signals, we study parametric and signal approaches, where both of them are often mixed.In this context, three types of approaches are used. The spatial parametric approach reduce the number of audio channels of the signal to encode and recreate the original number of channels from reduced channels and spatial parameters extracted from original channels. The signal approach keep the original number of channels, but encode mono signals, built from the combination of the original ones and containing less redundancies. Finally, the hybrid approach introduced in the MPEG USAC standard keep the two channels of a stereo signal, but one is a mono downmix and the other is a residual signal, resulting from a prediction on the downmix, where prediction parameters are encoded as side information.In this document, we first analyse the characteristics of a stereo audio signal coming from a commercial recording and the associated production techniques. This study lead us to consider the relations between the emitter parametric models, elaborated from our analysis of commercial recording production techniques, and the receiver models which are the basis of spatial parametric coding. In the light of these considerations, we present and study the three approaches mentioned earlier. For the parametric approach, we show that transparency cannot be achieved for most of the stereo audio signals, we have a reflection on parametric representations and we propose techniques to improve the audio quality and further reduce the bitrate of their parameters. These improvements are obtained by applying a better segmentation on the signal, based on the significant transient, by exploiting perceptive characteristics of some spatial cues and by adapting the estimation of spatial cues. As the hybrid approach has been recently standardized in MPEG USAC, we propose a full review of it, then we develop a new coding technique to optimize the allocation of the residual bands when the residual is not used on the whole bandwidth of the signal to encode. In the conclusion, we discuss about the future of the general spatial audio coding and we show the importance of developping new technics of segmentation and classification for audio signals to further adapt the coding to the content of the signal
Fan, Yun-Hui. "A stereo audio coder with a nearly constant signal-to-noise ratio." Diss., Georgia Institute of Technology, 2001. http://hdl.handle.net/1853/14788.
Full textWidman, Ludvig. "Binaural versus Stereo Audio in Navigation in a 3D Game: Differences in Perception and Localization of Sound." Thesis, Luleå tekniska universitet, Institutionen för ekonomi, teknik, konst och samhälle, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-85512.
Full textLapierre, Jimmy. "Approches paramétriques pour le codage audio multicanal." Mémoire, Université de Sherbrooke, 2007. http://savoirs.usherbrooke.ca/handle/11143/1355.
Full textMathews, Abraham. "Smart Home Based Li-Fi System : Stereo Audio & Image Streaming by Visible light." Thesis, Mittuniversitetet, Avdelningen för elektronikkonstruktion, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-32835.
Full textLi, Beinan. "Optical audio reproduction for stereo phonograph records by using white-light interferometry and image processing." Thesis, McGill University, 2011. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=103586.
Full textCette thèse présente une nouvelle approche de reproduction optique d'enregistrements phonographiques stéréo. L'enregistrement phonographique s'est imposé, vers la fin du XIXème siècle, comme la technologie d'enregistrement de référence partout dans le monde. Il existe donc une pléthore de cylindres et autres disques où ont été gravés discours, morceaux de musique, et autres artefacts culturel sonores. La préservation de ces enregistrements sonores phonographiques est donc une préoccupation mondiale. Le présent travail de recherche propose une approche alternative de numérisation des enregistrements phonographiques stéréo en vue de leur éventuelle préservation. En effet, à partir de l'acquisition optique du profil (en trois dimensions) de la surface d'enregistrement du disque, les signaux audio peuvent être reconstruits grâce à nos algorithmes d'analyse d'images. Cette thèse examine les étapes de la reproduction optique audio stéréo à partir d'enregistrements phonographiques sur disques stéréo en utilisant l'interférométrie en lumière blanche. Ces étapes comportent: l'acquisition du profil de la surface d'enregistrement d'un disque 3D en utilisant un microscope commercial interférométrique en lumière blanche ; l'extraction des ondulations du sillon, qui encode l'information audio stéréo en utilisant nos algorithmes de traitement d'images ; et finalement, la reproduction du signal audio stéréo depuis les ondulations du sillon par des techniques de traitement du signal. Le processus complet est évalué sur un enregistrement stéréo test comprenant des signaux sinusoïdaux et un enregistrement musical. La qualité de l'audio reproduit par voie optique est évaluée de façon quantitative et comparée avec celle de l'audio numérisé de manière « traditionnelle », à l'aide d'une platine. Cette thèse s'articule en trois parties. La première comporte une introduction des principes nécessaires à la reproduction d'enregistrements phonographiques stéréo par voie optique. Plus précisément, les principes de la technologie d'enregistrement phonographique sont passés en revue ; l'état de l'art des efforts de reproduction optique des enregistrements phonographiques sur disques et cylindres est présenté ; et enfin, les techniques optiques pertinentes incluant l'interférométrie en lumière blanche sont décrites. La deuxième partie livre une présentation détaillée du processus de reproduction optique que nous avons développé. Dans la troisième partie, l'évaluation quantitative de la qualité de la restitution du signal audio obtenue par notre procédé est aussi décrite. La thèse se conclue sur un bilan des défis et des directions possibles dans le futur développement de notre approche de reproduction des signaux audio par voie optique.
Bergqvist, Emil. "Auditory displays : A study in effectiveness between binaural and stereo audio to support interface navigation." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-10072.
Full textJansson, Tomas. "Stereo coding for the ITU-T G.719 codec." Thesis, Uppsala universitet, Signaler och System, 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-153636.
Full textHoang, Thi Minh Nguyet. "New techniques of scalable speech/audio coding for conversational applications : model-based bitplane coding and stereo extension of ITU-T G. 722." Rennes 1, 2011. http://www.theses.fr/2011REN1E003.
Full textBosch, Vicente Juan José. "From heuristics-based to data-driven audio melody extraction." Doctoral thesis, Universitat Pompeu Fabra, 2017. http://hdl.handle.net/10803/404678.
Full textLa identificación de la melodía en una grabación musical es una tarea relativamente fácil para seres humanos, pero muy difícil para sistemas computacionales. Esta tarea se conoce como "extracción de melodía", más formalmente definida como la estimación automática de la secuencia de alturas correspondientes a la melodía de una grabación de música polifónica. Esta tesis investiga los beneficios de utilizar conocimiento derivado automáticamente de datos para extracción de melodía, combinando procesado digital de la señal y métodos de aprendizaje automático. Ampliamos el alcance de la investigación en este campo, al trabajar con un conjunto de datos variado y múltiples definiciones de melodía. En primer lugar presentamos un extenso análisis comparativo del estado de la cuestión y realizamos una evaluación en un contexto de música sinfónica. A continuación, proponemos métodos de extracción de melodía basados en modelos de fuente-filtro y la caracterización de contornos tonales, y los evaluamos en varios géneros musicales. Finalmente, investigamos la caracterización de contornos con información de timbre, tonalidad y posición espacial, y proponemos un método para la estimación de múltiples líneas melódicas. La combinación de enfoques supervisados y no supervisados lleva a mejoras en la extracción de melodía y muestra un camino prometedor para futuras investigaciones y aplicaciones.
Van, Dyne Steven R. "Case Studies in Classical Location Recording Using Improvised Techniques." Ohio University Honors Tutorial College / OhioLINK, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=ouhonors1429807114.
Full textWennerberg, Daniel. "Auditory immersion and the believability of a first-person perspective in computer games : Do players have a preference between mono and stereo foley, and is one perceived as more believable?" Thesis, Luleå tekniska universitet, Medier, ljudteknik och teater, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-73985.
Full textLindmark, Isak. "Mediacentral för skogsmaskiner : Ny konstruktion för Komatsu Forest AB." Thesis, Umeå universitet, Institutionen för tillämpad fysik och elektronik, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-136376.
Full textKomatsu Forest AB behöver en multimedialösning för att uppfylla specifika behov exempelvis simultant användande av två mobiltelefoner och två mikrofoner. Enheten ska klara av att hantera ljud från flertalet ljudkällor samt två mikrofoner och aktiv bullerreducering. Systemet ska styras digitalt från till exempel en PC. Målet är att presentera ett koncept som erbjuder en lösning för dessa problem.Utformningen av enheten har gjorts genom att utvärdera användarens bruksområden tillsammans med den vision som legat till grund för projektet. Detta har genom samtal med anställda på företaget medfört en sammanställning över förarens användnings-områden. Utifrån denna sammanställning av användningsområdena har en teknisk skiss för enhetens uppbyggnad möjliggjorts. Efter att funktionen för skissen har verifierats med simuleringar har ett prototypkort för enhetens grundstomme konstrueras, programmerats samt verifierat konstruktionen.Simuleringar och mätningar verifierar att konstruktionen fungerar som önskat. Resultatet visar att enheten skulle uppfylla de uppsatta önskemålen på ett bra och användarvänligt sätt. Då flera lösningar har tagits fram under projektets gång diskuteras för och nackdelar med dessa samt motiveras varför den valda lösningen är bäst anpassad.
Cheng, Fan-Yu, and 鄭凡寓. "Spatial Localization Evaluation System for Parametric Stereo Audio." Thesis, 2008. http://ndltd.ncl.edu.tw/handle/2z7s7a.
Full text國立臺北科技大學
資訊工程系研究所
96
With the growing use of portable devices, the need for wireless connectivity on portable devices is also increasing. Due to the limitation of bandwidth and storage on portable devices, many low bit-rate audio compression methods have been proposed. The amendment of ISO standard MPEG-4 part 3 Advanced Audio Coding (AAC) published in 2001, includes two modules, Spectral Band Replication (SBR) and Parametric Stereo (PS), for enhancing the audio quality at low bit-rates. But according to some listening test reports, stereo music compressed with PS module may blur the spatial sound localization. Therefore, we want a tool to determine whether a piece of music may suffer this problem. In this thesis, we implement a spatial localization quality evaluation system. By giving original audio and compressed audio, the system is able to evaluate the spatial accuracy of the compressed audio. We can give an objective score of spatial quality for different compression methods with software.
Tong, Run-yu, and 童閏煜. "Low Complexity Decoding in Parametric Stereo Audio Coding Scheme." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/81814540453729650323.
Full text國立中央大學
通訊工程研究所
98
The Parametric Stereo (PS) audio coding is an audio coding object of High Efficiency Advanced Audio Coding version 2 (HE-AAC v2) which was standardized by ISO/MPEG in 2004. Traditional audio codec, e.g. MP3 or AAC, utilize “Psycoaustic Model” and “Masking Effect” to achieve high compression efficiency. However, they mainly process the signal with single channel. Different from traditional audio codec, the PS audio coding incorporates the characteristics of two channels, to extract spatial parameters and to down-mixes stereo signals into a mono signal. The PS can save almost half data size which provides great help in storage and transmission. Nevertheless, the complexity of PS decoder is nearly twice larger than that of PS encoder, which causes a serious problem in implementing PS on portable devices. Therefore, this thesis proposes a modified PS coding scheme to reduce the complexity of decoder. The encoder extracts and transmits the additional residual parameters from the residual signal and the mono signal. On the contrary, the decoder reconstructs the residual signal by the mono signal and the transmitted residual parameters. In addition, we detect the existence of transient signal and measure the artifact of reconstructed residual signal. Finally, “Energy compesated algorithm” is proposed to reduce the artifact produced by the transient signal. The proposed scheme can improve the Objective Difference Grade (ODG) of audio quality measurement “EAQUAL” with 0.6 score. Combining with audio coder AAC, the modified PS coding scheme still maintains a good performance at low coding bitrates.
Tseng, Hui-Yu, and 曾惠虞. "Extracting and Modifying the Spatial Information in Stereo Audio." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/27468502775850926782.
Full text大同大學
資訊工程學系(所)
94
In this thesis, the method to extract the spatial information and single representing source of original sound field in stereo, and then synthesis them as demanded are proposed. The objective is to synthesize appropriate sound field corresponding to vary listening condition. The discussed situation is focused on multi-sources playing the same melody by the same music instrument aligned in line. Since each source plays the same melody, the same music scale would be played on the sector in time. Human perception is insensitive to the phase of audio. So we might assume that the magnitudes of spectrogram of each source is similar even their waveforms are different. Therefore, the signal received by microphone could be treated as the summation of one spectrogram with shifts in time and attenuation. It is similar to an image corrupted by a motion blur function. Thus, the concept of image-restoration may be applied to extract the spatial information and single representing source by which the property of time-frequency components of each original source could be represented. The sound field similar to original sound field can be synthesized using the extracted single representing source and the obtained spatial information. Also the spatial information can be modified to synthesize the different sound field for different playback conditions in pleasure. The simulation is performed to confirm the method in this thesis. And the result shows that the concept of image distortion/restoration process with sound spectrogram could be applied to the spatial information extraction and sound field resynthesis. There will be certain compression effects with applying the concept of decomposing and re-synthesizing in this thesis with multi-channel processing in the future.
Tsai, Hsian-Ming, and 蔡憲銘. "A Study of Stereo Audio Coding Using Wavelet Transform Based Technique." Thesis, 1999. http://ndltd.ncl.edu.tw/handle/14125722647705614500.
Full text國立清華大學
電機工程學系
87
During the last decades, storage and transmission of high quality digital audio are becoming more and more important in, for example, digital audio broadcasting (DAB) and high definition television (HDTV). However, the required storage of high quality digital audio is usually massive. As for the compact disc signals (sampling rate of 44.1 kHz and 16 bits/sample), the bitrate requirement is 44100 * 16 =705.6 kb/s per channel and 1.41 Mb/s for stereo audio, which is too high for most applications. Thus we must develop techniques to reduce the bitrate requirement. In this thesis, the perceptual audio coder is reviewed and a coder based on wavelet packet transform is proposed. The distinct points about the proposed coder include: (1) it uses the wavelet packet transform rather than conventional Fourier transform or discrete cosine transform to exploit the capability of wavelet in, for example, treating nonstationary signals. (2) It tries to remove the redundancies between left and right channels in stereo audio to further increase the compression rate. Several wavelets are simulated and compared for audio compression in our experiments. From the experiments, the symmetric wavelet has the best effect for the coder. Different types of audio signals are also experimented in this study. An FFT based perceptual audio coder is also implemented for performance comparison.
Lee, ChiaHsing, and 李佳興. "Bit Allocation for Stereo Audio Coding in MPEG - 1 Layer III." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/36923743185503113347.
Full text國立交通大學
資訊工程系
89
Two methods for allocating bits to each channel in 3 stereo coding mode, and parameter design of single loop bit allocation is presented in this thesis. Two methods are proposed based on the criterion that optimize the bit allocation result of each channel. One using combine partial bit allocation process to keep quality of two channels equal. Another estimating bit requirement of each channel by energy ratio of two channels. In stereo mode of MPEG - 1 Layer III, with parameter design of single loop bit allocation, better quality is achieved. In other two coding mode of joint stereo, with modifying of masking ability, 0.5 ~ 2db quality improvement is achieved.
Lai, Wei-Chou, and 賴韋州. "Stereo Audio Steganography by Inserting Low-frequency and Octave Equivalent Pure Tones." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/09641543960914960452.
Full text國立臺灣大學
電機工程學研究所
101
Audio steganography is a technique that hiding messages into audio such that no one except the sender and intended recipient suspects the existence of the messages. In this paper we proposed a new method for stereo audio steganography, which employs some characteristics of human auditory system (HAS). Messages are embedded by inserting low-frequency and octave-equivalent pure tones into different channels. By comparing the frequency domain data of left channel and right channel, the hidden messages are detected. The experiment results demonstrate that comparing with the host audio, the quality of the message-hided audio generated by our method are nearly not decreased, thus malicious attackers will not perceive the hidden messages.
Liu, Jen-Chang, and 劉震昌. "The Stereo Audio Coding in the Framework of MPEG1 Layer I, II." Thesis, 1996. http://ndltd.ncl.edu.tw/handle/16573177512286166460.
Full text國立交通大學
資訊工程學系
84
The purpose of stereo audio signal coding is to reduce the required bit rate, while maintaining the signal quality after decoding. The ISO MPEG1 is the most widely used audio compression standard in many commercial applications. Among the vast commercial products, MPEG1 layers I and II coding processes are most widely adopted. MPEG1 layer II can achieve a transparent audio quality above 2x128 kbits/s by independent coding of the left and the right channels. With the use of joint stereo coding technique, such as intensity stereo coding in MPEG1, the decoded audio quality can be improved for the bit rate lower than 2x128 kbits/s. In this thesis, we analyze the data redundancy of stereo audio signals. The Karhunen-Loeve (KL) transform and inter-channel prediction methods are applied to exploit and analyze the data redundancy in the framework of MPEG1 layers I and II. On the KL transform, we propose two modified intensity stereo coding algorithms for MPEG1 layers I and II by KL transform to further improve the decoded stereo audio quality at bit rate below 2x128 kbits/s. Subjective and objective measurements show that the two algorithms have better stereo audio quality than the original MPEG1 method. On the inter-channel prediction, we consider the coding gains along with various parameters such as prediction order, prediction delay, time varying property, the required side information, etc.. The experiment results suggest the applying of inter- channel prediction in the low frequency bands, and transmission of the prediction coefficients once for longer frames to avoid the side information overhead.
Shih, Geng-Yu, and 施畊宇. "Implementation of a 3D Audio Module with Up/Downmix and CCS for Two-channel Stereo Loudspeakers." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/45036398129590730744.
Full text國立交通大學
機械工程系所
94
This dissertation focuses on the 3D audio reproduction for two-channel stereo loudspeakers. To create spatial impression during audio reproduction, the head-related transfer function (HRTF) and the crosstalk cancellation system (CCS) are key elements in many audio spatializers. Two deconvolution approaches, the frequency-domain method and the time-domain method, are employed to design the required inverse filters. Different approaches to design audio spatializers with the HRTF, CCS, and their combination are compared. Issues in the implementation phase such as regularization, complex smoothing and structures of inverse filters are also addressed. In particular, two modified CCS approaches are proposed. In addition, upmix and downmix processing plays an important role in many audio applications, where the number of channels of either the audio content or the reproducing loudspeakers are limited. Various upmix and downmix methods are presented in this work. A comprehensive study is undertaken, by comparing a variety of the proposed approaches, to find guidelines and strategies for 3D audio technique to meet the ever increasing needs of multichannel stereophonic systems. All proposed methods were examined both objectively and subjectively, and had been proven to be effective in 3D audio reproduction.
Liao, Jian-Sheng, and 廖建昇. "16-Bit Dual-Channel Digital Pulse Width Modulator for Hi-Fi Stereo Class-D Audio Amplifier with 8x Oversampling." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/80483783251883395024.
Full text國立臺灣科技大學
電子工程系
101
The digital pulse width modulation (DPWM) has been widely applied to power management IC, motor speed controller, LED driver, and Class-D amplifier. This thesis presents a dual-channel, high-resolution digital pulse width modulation (DPWM) applicable to Class-D audio amplifiers based on the specifications, operating frequency and resolution for CD audio. The DPWM incorporates ring oscillator along with counter for coarse duty adjustment, phase slection for medium duty adjustment and phase interpolation for fine duty tuning. The major advantages of the proposed structure are low cost, high resolution and monotonicity. The proposed DPWM chip is fabricated in a TSMC 0.18μm 1P6M standard CMOS process with a core size of merely 0.059 mm2. It is measured to function well within 263KHz ~ 418KHz operation frequency range. The resolution is 16-bit and the equivalent timing resolution is 43.25ps at 1.8V supply voltage. The power consumption is 39.6mW at 352.8 KHz and the integral nonlinearity is measured to be as small as -0.83~+0.84 LSB. The adjustable duty cycle ranges from 0 to 100%.
Stirnat, Claudia, and Tim Ziemer. "Spaciousness in Music: The Tonmeister’s Intention and the Listener’s Perception." 2019. https://slub.qucosa.de/id/qucosa%3A70634.
Full textMoore, Melanie 1989. "National Beef Quality Audit-2011: In-Plant Survey of Targeted Carcass Characteristics Related to Quality, Quantity, Value, and Marketing of Fed Steers and Heifers." Thesis, 2012. http://hdl.handle.net/1969.1/148397.
Full text