Accedi

Bibliografie tematiche / Near-End listening enhancement

Letteratura scientifica selezionata sul tema "Near-End listening enhancement"

Autore: Grafiati

Pubblicato: 1 giugno 2024

Cita una fonte nei formati APA, MLA, Chicago, Harvard e in molti altri stili

Scegli il tipo di fonte:

Consulta la lista di attuali articoli, libri, tesi, atti di convegni e altre fonti scientifiche attinenti al tema "Near-End listening enhancement".

Accanto a ogni fonte nell'elenco di riferimenti c'è un pulsante "Aggiungi alla bibliografia". Premilo e genereremo automaticamente la citazione bibliografica dell'opera scelta nello stile citazionale di cui hai bisogno: APA, MLA, Harvard, Chicago, Vancouver ecc.

Puoi anche scaricare il testo completo della pubblicazione scientifica nel formato .pdf e leggere online l'abstract (il sommario) dell'opera se è presente nei metadati.

Indice

Articoli di riviste
Tesi
Capitoli di libri
Atti di convegni

Articoli di riviste sul tema "Near-End listening enhancement":

1

Taal, Cees H., Jesper Jensen e Arne Leijon. "On Optimal Linear Filtering of Speech for Near-End Listening Enhancement". IEEE Signal Processing Letters 20, n. 3 (marzo 2013): 225–28. http://dx.doi.org/10.1109/lsp.2013.2240297.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

2

Rennies, J., A. Pusch, H. Schepker e S. Doclo. "Evaluation of a near-end listening enhancement algorithm by combined speech intelligibility and listening effort measurements". Journal of the Acoustical Society of America 144, n. 4 (ottobre 2018): EL315—EL321. http://dx.doi.org/10.1121/1.5064956.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

3

Rennies, Jan, Henning Schepker, David Huelsmeier, Jakob H. Drefs e Simon Doclo. "Evaluating near-end listening enhancement in noise for normal-hearing and hearing-impaired listeners". Journal of the Acoustical Society of America 141, n. 5 (maggio 2017): 4023. http://dx.doi.org/10.1121/1.4989261.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

4

Li, Gang, Ruimin Hu, Xiaochen Wang e Rui Zhang. "A near-end listening enhancement system by RNN-based noise cancellation and speech modification". Multimedia Tools and Applications 78, n. 11 (5 dicembre 2018): 15483–505. http://dx.doi.org/10.1007/s11042-018-6947-8.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

5

Rennies, Jan, Jakob Drefs, David Hülsmeier, Henning Schepker e Simon Doclo. "Extension and evaluation of a near-end listening enhancement algorithm for listeners with normal and impaired hearing". Journal of the Acoustical Society of America 141, n. 4 (aprile 2017): 2526–37. http://dx.doi.org/10.1121/1.4979591.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

6

Fallah, Ali, e Steven van de Par. "A Speech Preprocessing Method Based on Perceptually Optimized Envelope Processing to Increase Intelligibility in Reverberant Environments". Applied Sciences 11, n. 22 (15 novembre 2021): 10788. http://dx.doi.org/10.3390/app112210788.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

Abstract (sommario):

Speech intelligibility in public places can be degraded by the environmental noise and reverberation. In this study, a new near-end listening enhancement (NELE) approach is proposed in which using a time varying filter jointly enhances the onsets and reduces the overlap masking. For optimization, some look-ahead in clean speech and prior knowledge of room impulse response (RIR) are required. In this method, by optimizing a defined cost function, the Spectro-Temporal Envelope of reverb speech is optimized to be as close as possible to that of clean speech. In this cost function, onsets of speech are optimized with increased weight. This approach is different from overlap-masking ratio (OMR) and speech enhancement (OE) approaches (Grosse, van de Par, 2017, J. Audio Eng. Soc., Vol. 65 (1/2), pp. 31–41) that only consider previous frames in each time slot for determining the time variant filtering. The SRT measurements show that the new optimization framework enhances the speech intelligibility up to 2 dB more that OE.

7

Fuglsig, Andreas Jonas, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof e Jan Østergaard. "Minimum Processing Near-end Listening Enhancement". IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 1–13. http://dx.doi.org/10.1109/taslp.2023.3282094.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

8

Celkan, Prof Dr Gul. "From the Editor". New Trends and Issues Proceedings on Humanities and Social Sciences 2, n. 3 (7 dicembre 2016). http://dx.doi.org/10.18844/prosoc.v2i3.1244.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

Abstract (sommario):

EditorialIt is the great honor for us to edit proceedings of â€œ2nd Global Conference onÂ Contemporary Issues in Educationâ€¨(GLOBE-EDU 2015)â€ August 27-28,Â The University of Chicago Chicago, USA. This privileged scientific event has contributed to the field of educational research for two years.Â As the guest editors of this issue, we are glad to see variety of articles focusing on arts education, Adult Education, Competitive Skills, Continuing Education, Higher Education, Vocational Education, Transferring Disciplines, Business Education, Educational Administration, Human Resource Development, Academic Advising and Counselling, Education Policy and Leadership, Industrial Cooperation, Life-long Learning Experiences, Workplace Learning and Collaborative Learning, Work Employability, Educational Institution Government Partnership, Patent Registration and Technology Transfer, University Spin-Off Companies, Course Management, Accreditation and Quality Assurance, Academic Experiences and Best Practice Contributions, Copy-right, Digital Libraries and Repositories, Digital Rights Management, Evaluation and Assessment, E-content Management and Development, Open Content, e-Portfolios, Grading Methods, Knowledge Management, Quality processes at National and International level, Security and Data Protection, Student Selection Criteria in Interdisciplinary Studies, User-Generated Content, Curriculum, Research and Development, Acoustics in Education Environment, APD/Listening, Counsellor Education, Courses, Tutorials and Labs, Curriculum Design, ESL/TESL, Special Area Education, Early Childhood Education, Elementary Education, Geographical Education, Health Education, Home Education, Mathematics Education, Rural Education, Science Education, Secondary Education, Second life Educators, Social Studies Education, Special Education, Learning / Teaching Methodologies and Assessment, Simulated Communities and Online Mentoring, e-Testing and new Test Theories, Supervising and Managing Student Projects, Pedagogy Enhancement with e-Learning, Educating the Educators, Immersive Learning, Blended Learning, Computer-Aided Assessment, Metrics and Performance Measurement, Assessment Software Tools, Assessment Methods in Blended Learning Environments, Global Issues In Education and Research, Education, Research and Globalization, Barriers to Learning (ethnicity, age, psychosocial factors, â€¦), Women and Minorities in Science and Technology, Indigenous and Diversity Issues, Government Policy issues, Organizational, Legal and Financial Aspects, Digital Divide, Increasing Affordability and Access to the Internet, Ethical issues in Education, Intellectual Property Rights and Plagiarism, Pedagogy, Teacher Education, Cross-disciplinary areas of Education, Educational Psychology, Education practice trends and issues, Indigenous Education, Kinesiology and Leisure Science, K12, Life-long Learning Education, Mathematics Education, Physical Education (PE), Reading Education, Religion and Education Studies Research Management, Research Methodologies,Academic Research Projects, Joint-research programmes, Research on Technology in Education, Research Centres, Links between Education and Research, New Challenges in Education, ECTS experiences, The Bologna Process and its implementation, Joint-Degree Programmes, Erasmus and Exchange experiences in universities, Students and Teaching staff Exchange programmes, Ubiquitous Learning, Accessibility to Disabled Users, Animation, 3D, and Web 3D Applications, Context Dependent Learning, Distance Education, E-Learning, E-Manufacturing, Educational Technology, Educational Games and Software, Human Computer Interaction, ICT Education, Internet technologies, Learning Management Systems (LMS), Mobile Applications and Learning (M-learning), Multi-Virtual Environment, Standards and Interoperability, Technology Enhanced Learning, Technology Support for Pervasive Learning, Ubiquitous Computing, Videos for Learning and Educational Multimedia, Virtual and Augmented Reality, Virtual Learning Environments (VLE), Web 2.0, Social Networking, Blogs and Wikis, Wireless Applications, Research In Progress, On going research from undergraduates, graduates/postgraduates and professionals Projects, Collaborative Research, Integration of cross-cultural studies in curriculum, Research Assessment Exercise (RAE), New Trends And Experiences, Other Areas of EducationÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Furthermore, the conference is getting more international each year, which is an indicator that it is getting worldwide known and recognized. Scholars from all over the world contributed to the conference. Special thanks are to all the reviewers, the members of the international editorial board, the publisher, and those involved in technical processes. We would like to thank all who contributed to in every process to make this issue actualized. A total of 21 full papers or abstracts were submitted for this conference and each paper has been peer reviewed by the reviewers specialized in the related field. At the end of the review process, a total of 7 high quality research papers were selected and accepted for publication.Â I hope that you will enjoy reading the papers.Best Regards Â Guest EditorsProf. Dr. Gul Celkan, Middle Georgia State College, USAÂ Editorial AssistantMsc. Zeynep GenÃ§, Near East University, North Cyprus

9

Loess, Nicholas. "Augmentation and Improvisation". M/C Journal 16, n. 6 (7 novembre 2013). http://dx.doi.org/10.5204/mcj.739.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

Abstract (sommario):

Preamble: Medium/Format/Marker Medium/Format/Marker (M/F/M) was a visual-aural improvisational performance involving myself, and musicians Joe Sorbara, and Ben Grossman. It was formed through my work as a PhD candidate at the Improvisation, Community, and Social Practice research initiative at the University of Guelph. This performance was conceived as an attempted intervention against the propensity to reify the “new.” It also sought to address the proliferation of the screen and question how the increased presence of screens in everyday life has augmented the way in which an audience is conceived and positioned. This conception is in direct conversation with my thesis, which is a practice-based research project exploring what the experimental combination of intermediality, improvisation, and the cinema might offer towards developing a reflexive approach to "new" media, screen culture, and expanded cinemas. One of the ways I chose to explore this area involved developing an interface that allowed an audio-visual ensemble to improvise with a film's audio-visual projection. I experimented with different VJ programs. These programs often utilize digital filters and effects to alter images through real-time mixing and layering, much like a DJ does with sound. I found a program developed by Chicago-based artist Ontologist called Ontoplayer, which he developed out of his practice as an improvisational video artist. The program works through a dual-channel interface where two separate digital files could be augmented, with their projected tempo capable of being determined by musicians through a MIDI interface. I conceptualized the performance around the possibility of networking myself with two other musicians via this interface. I approached percussionist Joe Sorbara and multi-instrumentalist Ben Grossman with the idea to use Ontoplayer as a means to improvise with Chris Marker's La Jetée (1962, 28 mins). The film itself would be projected simultaneously in four different formats: 16mm celluloid, VHS, Blu-ray, and Standard Definition video (the format the ensemble improvised with) projected onto four separate screens. From left to right, the first screen contained the projected version of La Jetée that we improvised with, next to it was its Blu-ray format, next to that, a degraded VHS copy of the film, and next to that, the 16mm print. The performance materialized through performing a number of improvisatory experiments. A last minute experiment conceived a few hours before the performance involved placing contact microphones overtop of the motor on a Bell & Howell 16mm projector. The projector was tested in the days leading up to the performance and it ran as smoothly as could be expected. It had a nice cacophonous hum that Ben Grossman intended to improvise with using some contact mics attached directly over the projector’s motor, a $5 iPad app, and his hurdy-gurdy. Fifteen minutes before the performance began, the three of us huddled to discuss how long we'd like to go. We had met briefly the day before to discuss the technical setup of the performance but not its execution and length. I hadn't considered duration. Joe broke the silence by asking if we'd be "finding beginnings and endings." I didn't know what that entailed, but nodded. We started. I turned on the projector and it immediately started to cough and chew on the 40 year old 16mm print I found online. My first impulse was to intervene, to try to save it. The film continued and I sat frozen for a moment. Joe started playing and Ben, expecting me to send him the audio track from La Jetée, prompted me to do so. I let the projector go and began. Joe had a digital kick-drum and two contact mics on his drum kit hooked into a MIDI hub, while Ben's hurdy-gurdy had a contact mic inside it, wired into the hub. The hub hooked into my laptop and allowed for an intermedial conversation to emerge between the three of us. While the 16mm, VHS, and Blu-Ray formats proceeded relatively unimpeded alongside each other on their respective screens, the fourth screen was where this conversation took place. I digitally reordered different image sequences from La Jetée. The fact that it’s a film (almost) comprised entirely of still images made this reordering intriguing in that I was able control the speed of progressing from each image to the next. The movement from image to image was structured between Ben and Joe’s improvisations and the kind of effects and filters I had initialized. Ontoplayer has a number of effects and filters that push the base image into more abstract territories (e.g.: geometric shapes, over pixelation) I was uninterested in exploring. I utilized effects that to some degree still kept the representational content of the image intact. The degree to which these effects took hold of the image were determined by whether or not Ben and Joe decided to use the part of their instrument that would trigger them. The decision to linger on an image, colour it differently, or skip ahead in the film’s real-time projection destabilized my sense of where I was in the film. It became an event in the sense that each movement, both visual and aural was happening with an indeterminate duration. La Jetée opens with the narrator proclaiming: “this is the story of a man marked by an image from his childhood.” The story itself is situated around a man in a post-apocalyptic world, haunted by the persistent memory of a woman he saw as a child while standing on the jetty at Orly Airport in Paris. The man was a soldier, now captured, and imprisoned in an underground camp. The prison guards have been conducting experiments on the prisoners, attempting to use the prisoner’s memories as a mechanism to send them backwards and forwards in time. The narrator explains, “with the surface of the planet irradiated … The human race was doomed. Space was off limits. The only link with survival passed through time … The purpose of the experiments was to throw emissaries into time to call the past and future to the aid of the present.” La Jetée is visually structured as a photomontage, with voice-over narration, diegetic and non-diegetic sound existing as component parts to the whole film. I decided to separate these components for the sake of isolating them before the performance as instruments of the film to be improvisationally deployed through the intermedial connection between Ben, Joe, and myself. The resulting projections that emerged from our interface became a kind of improvised "grooving" to La Jetée that restricted the impulse to discriminately place sound beneath and behind the image. I selected images from different points in the film that felt "timely" given the changing dynamic between the three of us. I remember lingering on an image of the woman's face, her hand against her mouth, her hair being blown back by the wind. I looked and listened for the moment when the film would catch and then catch fire. It never came. We let the reel run to the end and continued on improvising until we found an ending. But the sound of that film catching but never breaking, the intention and tension of the film being near death the entire time made everything we did more precious, teetering on the brink of failure. We could never have predicted that, and it gave us something I continue to ponder and be thankful for. Celluloid junkies in the room commented on how precipitous the whole thing was, given how rare it is to encounter the sound of celluloid film travelling through a projector inside a cinematic space. An audiophile mused over how there wasn’t any document, his mind adequately blown by how “funky” the projector sounded. With there being no document of the performance, I'm left with my own memories. In mining the aftermath of this performance, I hope to find an addendum that considers how improvisation might negotiate with augmentation in ways that speak to Walter Benjamin's assertion that the "camera, the film, on the one hand, extends our comprehension of the necessities which rule our lives; on the other hand, it manages to assure us of an immense and unexpected field of action” (Benjamin 236-7).Images to be Determined I got a job working in a photo lab eight years ago, right around the time digital cameras started becoming not only affordable, but technologically-comparable alternatives to film cameras. The photo printer in the lab was setup to scan and digitize celluloid filmstrips to allow for digital “touchups” by the technician. It was also hooked into touchscreen media stations that accepted a variety of memory card formats so that customers could “touchup” their own images. Celluloid film meant that as long as their format was chemical, touching up their images remained the task of the technician. Against the urging of the lab’s manager, I resisted altering other people’s images. It felt like a violation, despite the fact that almost every customer was unaware of this process. They assumed a degree of responsibility for a chemically-exposed image. I still got blamed for a lot of bad photography, but an image chemically under or overexposed was irreparable. Digital cameras changed all of that. I still preferred an evenly exposed celluloid print to a digital, but the allure was the ability for these images to be augmented. Augmentation is synonymous with "enhancement," "prosthesis," "addition," "amplification," "enrichment," "expansion,” and "extension" (to name a few). For the purpose of this essay, I am situating augmentation as an agential act engaging with a static form to purposefully alter its aesthetic and political relation to a reality. To what extent can we say that the digital image is itself, an augmentation? If Instagram is any indication, the digital image's existence is bound by its perpetual augmentation. A digital image is only as good as its capacity to be worked on. The ubiquity of digitally applying lomographic filters to digital images, as a defining step in their distributive chain, is indicative of the discursive impact remediating the old into the new has on digital forms. These digitally-coded filters used to augment “clear” digital images are comprised of exaggerated imperfections that existed to varying degrees, as unforeseen side effects of working with comparatively more unstable celluloid textures. The filtered images themselves are digital distortions of a digital original. The filters augment this original through obscuring one or a number of components. Some filters might exaggerate the green values or sharpen a particular quadrant within the frame that might coincide with the look of a particular film stock from the past. The discourse of “film” and “vintage” photography has become a synonymous component of the digital aesthetic, discursively warming up what is often considered to be a cold, and disembodied medium. Augmentation works to re-establish a congruous relationship between the filmic and the digital, attempting to reconcile the aesthetic distance between granularity and pixelation. This is ironic because this process is encapsulated through digitally encoding and applying these filters for the sake of obscuring clarity. Thus, the object is both hailed as clear and clearly manipulable. Another example a bit closer to the cinema is the development of digital video cameras offering RAW, or minimally compressed file formats for the sole purpose of augmenting the initial recording in post-production workflows in an attempt to minimize degradation in the image. The colour values and dynamic range of these images are muted, or flattened so that the human can control their elevation after the fact. To some degree the initial image, in itself, is an augmentation of its filmic relative. From early experiments with video synthesizers to the present digital coding of film effects, digital images have tantalized video artists and filmmakers with possibility shrouded in instantaneity and malleability. A key problem with this structure remains the unbridled proliferation and expansion of the digital image, set free for the sake of newness. How might improvisation work towards establishing an ethics of augmentation? An ethics of this kind must disrupt the popular notion of the digital image existing beyond analogical constraints. The belief that “if you can imagine it, you can do it” obfuscates the reality that to work with images, whatever their texture, is a negotiation with constraint. Part of M/F/M’s fruition emerged from a conversation I'd had with Canadian Animator Pierre Hébert last summer. Now obvious, but for Hébert, the first obstacle he needed to overcome as an improviser was developing an instrument that he could gig with. Through the act of designing an instrument I immediately became aware of what wasn't possible, and so the work leading up to the performance involved attempting to expand the possibilities of that instrument. How might I conceive of my own treatment of images simultaneously treated by Joe and Ben as a kind of cinematic extended technique we collaboratively bring into being? Constraint necessitates the need for extension, for finding new ways to sound and appear. Constraint is also consistently conceived as shackling progress. In scientific methodologies it is often arbitrarily imposed to steer an experiment into a desired direction. This sort of experimental methodology is in the business of presupposing outcomes, which I feel is often the case with what ultimately becomes the essay of end result in Humanities research. Constraint is an important imposition in improvisation only if the parties involved are willing to find new ways to move in consort with it. The act of improvisation is thus an engagement with the spatio-temporal constraints of performance, politics, memory, texture, and difference. My conception of the cinema is that of an instrument, whose past is what I work with to better understand its future. Critic Gene Youngblood, in his landmark book, Expanded Cinema, theorized a new conception of the cinema as a global planetary phenomenon suffused inside a space of intermedia, where immersive, interactive, and interconnected realms necessitated the need to critically conceptualise the cinema in cosmic terms. At around the time of Youngblood's writing, another practitioner of the cosmic way, improviser and composer Sun Ra was staking a similar claim for music's ability to uplift the species cosmically. Ra's popular line “If we came from nowhere here, why can’t we go somewhere there?” (Heble 125), articulated the problematic racial politics in post-WWII America, that fixed African-American identity into a static domain with little room to move upward. The "somewhere there" to Ra was a non-space, created from "a desire to opt out of the very codes of representation and intelligibility, the very frameworks of interpretation and assumption which have legitimated the workings of dominant culture" (Heble 125). Though Youngblood's and Ra's intellectual and creative impulses formed from differing political circumstances, the work and thinking of these two figures remain significant articulations of the need to work from and towards the cosmic. In 2003, Youngblood published a follow-up essay in a reprint of Expanded Cinema entitled Cinema and the Code. In it, he defines cinema as a “phenomenology of the moving image.” Rather than conceiving of it through any of its particular media, Youngblood advocates for a segregated conception of the cinema: Just as we separate music from its instruments. Cinema is the art of organizing a stream of audiovisual events in time. It is an event-stream, like music. There are at least four media through which we can practice cinema – film, video, holography, and structured digital code—just as there are many instruments through which we can practice music. (Youngblood cited in Marchessault and Lord 7) Music and cinema are thus conceived as the exterior consequences of creative and co-creative instrumental experimentation. For Ra and Youngblood, the planetary stakes of this project are infused with the need to manufacture and occupy an imaginative space (if only for a moment) outside of the known. This is not to say that the action itself is transcendental. But rather this outside is the planetary. For the past year I've been making a documentary with Joe Sorbara on the free improv scene in Toronto. Listening to musicians talk about improvisation in expansive terms, as this ethereal and ephemeral experience, that exists on the brink of failure, that is as much an act of memory as renewal, reverberated with my own feelings surrounding the cinema. Improvisation, to philosopher Gary Peters, is the "entwinement of preservation and destruction", that "invites us to make a transition from a closed conception of the past to one that re-thinks it as an endlessly ongoing event or occurrence whereby tradition is re-originated (Benjamin) or re-opened (Heidegger)” (Peters 2). This “entwinement of preservation and destruction” takes me back to my earlier discussion of the ways in which digital photography, in particular lomographically filtered snapshots, is structured through preserving the discursive past of film while destroying its standard. The performance of M/F/M attempted to connect the augmentation of the digital image and the impact this augmentation had on conceptualizing the past through an improvisational approach to intermediality. The issue I have with the determination of images concerns their technological standardization. As long as manufacturers and technicians control this process then the practice of gathering, projecting, and experiencing digital images is predetermined by their commercial obligation. It assures that augmenting the “immense and unexpected field of action” comprising the domain of images is itself a predetermination. References Benjamin, Walter. Illuminations. New York: Schocken Books, 1985. Heble, Ajay. Landing on the Wrong Note. London: Routledge, 2000. Marker, Chris, dir. La Jetée. Argos Films. 1962. Marchessault, Janine, and Susan Lord. Fluid Screens, Expanded Cinema. Toronto: University of Toronto Press, 2007. Peters, Gary. The Philosophy of Improvisation. Chicago: University of Chicago Press, 2009.

Tesi sul tema "Near-End listening enhancement":

1

Sauert, Bastian [Verfasser]. "Near-end listening enhancement : theory and application / Bastian Sauert". Aachen : Hochschulbibliothek der Rheinisch-Westfälischen Technischen Hochschule Aachen, 2014. http://d-nb.info/1057037257/34.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

2

Gentet, Enguerrand. "Amélioration de l'intelligibilité de signaux audio de parole en contexte bruité automobile". Electronic Thesis or Diss., Institut polytechnique de Paris, 2021. http://www.theses.fr/2021IPPAT008.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

Abstract (sommario):

La quantité de diffusion de signaux de parole dans les habitacles automobiles est de plus en plus importante : télécommunications, radio, système de navigation... Cependant, malgré les efforts et les avancées mécaniques, beaucoup de bruits persistent au sein de l'habitacle dégradant fortement l'intelligibilité de ces signaux de parole. L'objectif de cette thèse est alors de développer des outils de renforcement de la parole visant à traiter les signaux avant leur dégradation afin d'assurer une bonne intelligibilité dans le bruit des habitacles automobiles. Une approche de renforcement de la parole très performante consiste à utiliser un égaliseur fréquentiel afin d’optimiser un critère d’intelligibilité : le Speech Intelligibility Index (SII). Pour faciliter l'optimisation, les méthodes actuelles se basent sur des approximations du critère. De plus, en concentrant l'énergie spectrale du signal dans des zones où l'oreille est plus sensible, ces méthodes augmentent le volume perçu ce qui peut détériorer l'expérience utilisateur. Ainsi, en plus de proposer une méthode de résolution exacte du problème de maximisation du SII, nos travaux proposent d’introduire et étudier l'influence d'une nouvelle contrainte perceptive maintenant les signaux à leur niveau perçu. La popularisation des approches d’apprentissage automatique pousse à apprendre les traitements de renforcement de la parole à partir d’exemples naturellement produits dans le bruit (parole Lombard), ou en sur-articulant (parole claire). Les travaux actuels ne parviennent pas à obtenir des gains d’intelligibilité aussi significatifs qu’avec les modifications naturelles et nous pensons que la négligence de nombreux aspects temporels pourrait en être partiellement responsable. Nos travaux proposent donc d’approfondir ces approches en exploitant des modèles d’apprentissage et des pré-traitements adaptés aux séquences temporelles longues. Nous proposons aussi une nouvelle modélisation des modifications du débit de la parole directement intégrable dans l’apprentissage machine ce qui n'avait jamais été fait auparavant
Speech is nowadays present in a number of in-car applications ranging from hands-free communications, radio programs to speech synthesis messages from the various car devices.However, despite the steady car manufacturing progress, significant noise still remains in the car interior that leads to a loss of intelligibility of speech signals. The PhD work aims at developping speech reinforcement tools in order to process the signals before they are played in a noisy in-car environment.A highly effective speech reinforcement approach is to use a frequency equalizer to optimize an intelligibility criterion : the Speech Intelligibility Index (SII). To facilitate optimization, current methods are based on approximations of the criterion. In addition, by concentrating the spectral energy of the signal in areas where the ear is more sensitive, these methods increase the perceived volume which can deteriorate the user experience. Thus, in addition to proposing an exact method of solving the SII maximization problem, our work proposes to introduce and study the influence of a new perceptual constraint in order to maintain the signals at their perceived level.The popularization of machine learning approaches pushes to learn speech reinforcement processings from examples naturally produced in noise (Lombard speech), or by over-articulation (clear speech). Current work fails to achieve intelligibility gains as significant as with natural modification, and we believe that the many temporal aspects neglect may be partially responsible. Our work therefore proposes to deepen these approaches by exploiting learning models and pre-processings adapted to long duration sequences. We also propose a new modeling of the speech rate modifications that directly fits in the machine learning model which had never been done before

Capitoli di libri sul tema "Near-End listening enhancement":

1

Herasimovich, Vadzim, Alexey Petrovsky, Vladislav Avramov e Alexander Petrovsky. "Audio/Speech Coding Based on the Perceptual Sparse Representation of the Signal with DAE Neural Network Quantizer and Near-End Listening Enhancement". In Cryptology and Network Security, 109–19. Cham: Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-319-98678-4_13.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

Atti di convegni sul tema "Near-End listening enhancement":

1

Niermann, Markus, Peter Jax e Peter Vary. "Joint Near-End Listening Enhancement and far-end noise reduction". In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2017. http://dx.doi.org/10.1109/icassp.2017.7953102.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

2

Chermaz, Carol, Cassia Valentini-Botinhao, Henning Schepker e Simon King. "Evaluating Near End Listening Enhancement Algorithms in Realistic Environments". In Interspeech 2019. ISCA: ISCA, 2019. http://dx.doi.org/10.21437/interspeech.2019-1800.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

3

Chermaz, Carol, e Simon King. "A Sound Engineering Approach to Near End Listening Enhancement". In Interspeech 2020. ISCA: ISCA, 2020. http://dx.doi.org/10.21437/interspeech.2020-2748.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

4

Niermann, Markus, Peter Jax e Peter Vary. "Near-end listening enhancement by noise-inverse speech shaping". In 2016 24th European Signal Processing Conference (EUSIPCO). IEEE, 2016. http://dx.doi.org/10.1109/eusipco.2016.7760677.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

5

Lavanya, T., K. Mrinalini, P. Vijayalakshmi e T. Nagarajan. "Histogram Matching based Optimized Energy Redistribution for Near End Listening Enhancement". In TENCON 2019 - 2019 IEEE Region 10 Conference (TENCON). IEEE, 2019. http://dx.doi.org/10.1109/tencon.2019.8929292.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

6

Schepker, Henning, David Hülsmeier, Jan Rennies e Simon Doclo. "Model-based integration of reverberation for noise-adaptive near-end listening enhancement". In Interspeech 2015. ISCA: ISCA, 2015. http://dx.doi.org/10.21437/interspeech.2015-30.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri

7

Zorilă, Tudor-Cătălin, e Yannis Stylianou. "On the Quality and Intelligibility of Noisy Speech Processed for Near-End Listening Enhancement". In Interspeech 2017. ISCA: ISCA, 2017. http://dx.doi.org/10.21437/interspeech.2017-1225.

Testo completo

Gli stili APA, Harvard, Vancouver, ISO e altri