Academic literature on the topic 'Multi-channel ASR'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Multi-channel ASR.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Multi-channel ASR"

1

Yun, Dajung, and Kyoungwon Lee. "Changes of the Korean Consonant-Vowel Syllabic Recognition Scores According to the Compression Threshold Control of Multi-Channel Hearing Aid." Audiology and Speech Research 16, no. 4 (October 31, 2020): 321–28. http://dx.doi.org/10.21848/asr.200061.

Full text
Abstract:
Purpose: While adjusting the compression threshold (CT) of low and high frequency bands of multi-channel hearing aids in noise, we tried to evaluate any changes in Korean consonant-vowel (CV) syllabic recognition scores, sound quality, and loudness of noise with white noise.Methods: A total of twenty one subjects with hearing loss (mean age of 66.2 years) participated voluntarily. Their pure-tone average was 46.0 dB HL and the word recognition score was 72.1%. The CTs of low/high frequency bands were adjusted to 50/50 dB SPL, 50/65 dB SPL, 65/50 dB SPL and 60/65 dB SPL in multi-channel hearing aids. While presenting white noise to the non-test ear, Korean CV syllables were presented to the test ear to evaluate the recognition scores, clarity of conversational speech, and loudness of noise. The intensity of CV syllables presented to the subject was 50 dB HL, and the signal-to-noise ratios (SNRs) were 5 dB and 10 dB.Results: First, when the CT of low/high frequency bands was adjusted to 50/50 dB SPL, the CV syllabic recognition scores were higher compared to 65/50 dB SPL. Second, in the noise environment, the clarity of conversational speech, and loudness of the noise did not have any difference regardless of levels of CT.Conclusion: Based on current findings, the frequency bands setting of CT in the multi-channel hearing aids might provide changes to recognize Korean speech sounds, which may affect the adjustment of frequency band CT from multichannel hearing aids in the future.
APA, Harvard, Vancouver, ISO, and other styles
2

Wang, Zhong-Qiu, Peidong Wang, and DeLiang Wang. "Complex Spectral Mapping for Single- and Multi-Channel Speech Enhancement and Robust ASR." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 1778–87. http://dx.doi.org/10.1109/taslp.2020.2998279.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Shin, Suyeon, and Kyoungwon Lee. "Comparisons of Sentence Recognition Scores and Preferences according to Compression Types in a Multi-Channel Hearing Aid." Audiology and Speech Research 13, no. 1 (January 25, 2017): 88–94. http://dx.doi.org/10.21848/asr.2017.13.1.88.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Kim, Seeon, and Kyoungwon Lee. "Changes of Consonant Recognition and Sound Quality as a Function of Compression Threshold on Multi-Channel Hearing Aids." Audiology and Speech Research 15, no. 1 (January 31, 2019): 23–29. http://dx.doi.org/10.21848/asr.2019.15.1.23.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Lin, Shaoxiong, Wangyou Zhang, and Yanmin Qian. "Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering." Applied Sciences 13, no. 8 (April 14, 2023): 4926. http://dx.doi.org/10.3390/app13084926.

Full text
Abstract:
Speech enhancement has been extensively studied and applied in the fields of automatic speech recognition (ASR), speaker recognition, etc. With the advances of deep learning, attempts to apply Deep Neural Networks (DNN) to speech enhancement have achieved remarkable results and the quality of enhanced speech has been greatly improved. In this study, we propose a two-stage model for single-channel speech enhancement. The model has two DNNs with the same architecture. In the first stage, only the first DNN is trained. In the second stage, the second DNN is trained to refine the enhanced output from the first DNN, while the first DNN is frozen. A multi-frame filter is introduced to help the second DNN reduce the distortion of the enhanced speech. Experimental results on both synthetic and real datasets show that the proposed model outperforms other enhancement models not only in terms of speech enhancement evaluation metrics and word error rate (WER), but also in its superior generalization ability. The results of the ablation experiments also demonstrate that combining the two-stage model with the multi-frame filter yields better enhancement performance and less distortion.
APA, Harvard, Vancouver, ISO, and other styles
6

Jeong, Jihaeng, and Kyoungwon Lee. "Changes of Non-Sense Syllabic Word Scores and Preferences as a Function of Release Time on Multi-Channel Hearing Aids." Audiology and Speech Research 14, no. 4 (October 31, 2018): 250–58. http://dx.doi.org/10.21848/asr.2018.14.4.250.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Bang, Eojini, and Kyoungwon Lee. "A Study on the Preferred Real-Ear Insertion Gain of Multi-Channel Hearing Aid for the Korean with Sensorineural Hearing Loss." Audiology and Speech Research 16, no. 2 (April 30, 2020): 85–94. http://dx.doi.org/10.21848/asr.200006.

Full text
Abstract:
Purpose: This study aimed to compare the preferred real-ear insertion gain for Korean (PREIG-K) wearing multi-channel hearing aid with the National Acoustics Laboratories-Non-Linear version 2 (NAL-NL2; National Acoustic Laboratories) gains in order to develop Korean hearing aid fitting formula.Methods: A total of thirty one (62 ears) Korean hearing aid users were included in this study. All subjects wore in-the-canal or custom hearing aids in both ears. Individual hearing aid fitting procedures involved to adjust the gains for 50, 65, and 80 dB sound pressure level of speech across low, high, and wideband frequency bands based on participant’s subjective responses. In addition, only the high frequency bands of 1 kHz or more of the PREIG-K were re-adjusted to be the same as NAL-NL2 gain and then the word recognition scores (WRSs) were compared before and after the adjusting gain. Results: The results showed that the PREIG-K increased up to 1.5 kHz with the maximum amount, then the PREIG-K decreased across the frequencies. For all half octave frequencies, the PREIG-Ks were substantially less than the NAL-NL2. When the PREIG-K of high frequencies were re-adjusted same as the NAL-NL2 gains, the WRSs of the PREIG-K were not significantly different before and after gain adjustment. The slopes up to 1.5 kHz frequencies of the PREIG-K were steeper than the slopes of NAL-NL2 gain, however similar to the slope of manufactures’ fitting formulae.Conclusion: The development of an effective hearing aid fitting formula for improving the communication abilities of hearing-impaired Korean will require further experiments considering the language, physical characteristics, and word recognition used by Koreans.
APA, Harvard, Vancouver, ISO, and other styles
8

Aysa, Zuhragvl, Mijit Ablimit, Hankiz Yilahun, and Askar Hamdulla. "Language Identification-Based Evaluation of Single Channel Speech Separation of Overlapped Speeches." Information 13, no. 10 (October 11, 2022): 492. http://dx.doi.org/10.3390/info13100492.

Full text
Abstract:
In multi-lingual, multi-speaker environments (e.g., international conference scenarios), speech, language, and background sounds can overlap. In real-world scenarios, source separation techniques are needed to separate target sounds. Downstream tasks, such as ASR, speaker recognition, speech recognition, VAD, etc., can be combined with speech separation tasks to gain a better understanding. Since most of the evaluation methods for monophonic separation are either single or subjective, this paper used the downstream recognition task as an overall evaluation criterion. Thus, the performance could be directly evaluated by the metrics of the downstream task. In this paper, we investigated a two-stage training scheme that combined speech separation and language identification tasks. To analyze and optimize the separation performance of single-channel overlapping speech, the separated speech was fed to a language identification engine to evaluate its accuracy. The speech separation model was a single-channel speech separation network trained with WSJ0-2mix. For the language identification system, we used an Oriental Language Dataset and a dataset synthesized by directly mixing different proportions of speech groups. The combined effect of these two models was evaluated for various overlapping speech scenarios. When the language identification network model was based on single-person single-speech frequency spectrum features, Chinese, Japanese, Korean, Indonesian, and Vietnamese had significantly improved recognition results over the mixed audio spectrum.
APA, Harvard, Vancouver, ISO, and other styles
9

Liu, Cai Xia, Xiao Qing Tian, and Zhi Bin Zhang. "Optimal Space Management Mechanism of Sharable Multi-Channel On-Chip Memory." Advanced Materials Research 629 (December 2012): 542–47. http://dx.doi.org/10.4028/www.scientific.net/amr.629.542.

Full text
Abstract:
A kind of shared multi-channel on-chip memory CMP architecture is proposed in this article to efficiently support embedded applications. For the multi-channel on-chip memory being scarce resource, optimal space management mechanism of multi-channel on-chip memory is proposed including automatic space allocation strategy based on application parallelization mapping pattern and optimal space utilization scheme. ILP-model-based analysis of system performance verifies that the proposed optimal space management mechanism can deeply exploit the efficiency of multi-channel on-chip memory to improve system performance.
APA, Harvard, Vancouver, ISO, and other styles
10

Kareem, B., and A. A. Aderoba. "Development of a Heuristics Model for Simplifying a Multi-Channel Queuing System." Advanced Materials Research 367 (October 2011): 647–52. http://dx.doi.org/10.4028/www.scientific.net/amr.367.647.

Full text
Abstract:
Queuing model has been discussed widely in literature. The structures of queuing systems are broadly divided into three namely; single, multi-channel, and mixed. Equations for solving these queuing problems vary in complexity. The most complex of them is the multi-channel queuing problem. A heuristically simplified equation based on relative comparison, using proportionality principle, of the measured effectiveness from the single and multi-channel models seems promising in solving this complex problem. In this study, six different queuing models were used from which five of them are single-channel systems while the balance is multi-channel. Equations for solving these models were identified based on their properties. Queuing models’ performance parameters were measured using relative proportionality principle from which complexity of multi-channel system was transformed to a simple linear relation of the form = . This showed that the performance obtained from single channel model has a linear relationship with corresponding to multi-channel, and is a factor which varies with the structure of queuing system. The model was tested with practical data collected on the arrival and departure of customers from a cocoa processing factory. The performances obtained based on average number of customers on line , average number of customers in the system , average waiting time in line and average waiting time in the system, under certain conditions showed no significant difference between using heuristics and analytical models.
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "Multi-channel ASR"

1

Koepke, Corbin G. (Corbin Gene) 1977. "Multi-mission optimized re-planning in air mobility command's channel route execution." Thesis, Massachusetts Institute of Technology, 2004. http://hdl.handle.net/1721.1/17726.

Full text
Abstract:
Thesis (S.M.)--Massachusetts Institute of Technology, Sloan School of Management, Operations Research Center, 2004.
Includes bibliographical references (p. 143-145).
The United States Air Force's Air Mobility Command is responsible for creating a schedule and executing that schedule for a large-scale air mobility network that encompasses different mission areas. One of the mission areas is channel route. Channel route execution often experiences disruptions that motivate a need for changes in the current channel route schedule. Traditionally, re-planning the channel route schedule has been a manual process that usually stops after the first feasible set of changes is found, due to the challenges of large amounts of data and urgency for a re-plan. Other challenges include subjective trade-offs and a desire for minimal changes to the channel route schedule. We re-plan the channel route schedule using a set of integer programs and heuristics that overcomes these challenges. The integer programs' variables incorporate many of Air Mobility Command's operating constraints, so they do not have to be explicitly included in the formulations. The re-plan uses opportunities in the other mission areas and reroutes channel route aircraft. Finally, our methods can quickly find a solution, allow for "what-if' analysis and interaction with the user, and can be adapted to an evolution in Air Mobility Command's operations while the underlying models remain constant.
by Corbin G. Koepke.
S.M.
APA, Harvard, Vancouver, ISO, and other styles
2

Belhabib, Mounia. "Investigation on radio channel over the air emulation by multi-probe setup." Thesis, Rennes 1, 2017. http://www.theses.fr/2017REN1S070/document.

Full text
Abstract:
La nécessité d'une transmission sans fils des données à des débits élevés, à la fois fiables et avec de faible latence a donné lieu à ces dernières années à une succession de normes sans fil, allant de 3G-4G, WLAN à la cinquième génération (5G) des réseaux mobiles. Dans ce contexte, les équipementiers, ainsi que les opérateurs, doivent élaborer des méthodes d'essai standard précises et efficaces pour évaluer les performances des systèmes et des terminaux. Les méthodologies de test en direct par voie aérienne ("Over-The-Air") (OTA) visent à reproduire des environnements multi-trajets radio en laboratoire de manière répétable et contrôlable, en évitant les coûteuses mesures in-situ. L'objectif de cette thèse est de proposer une nouvelle méthodologie d'essai OTA, afin de reproduire la propagation des canaux radio, sur une large bande et d'évaluer les performances des systèmes sans fil dans des environnements réels. La thèse débute en présentant les bases de la chaîne radio et de certains modèles de chaînes présentés dans la littérature. Ensuite, un examen critique des méthodologies OTA existantes dans la littérature est fourni. Parmi les différentes méthodologies, nous avons opté pour l'approche de la chambre anéchoïde multi-sonde, qui consiste à déployer un certain nombre de sondes autour d'un équipement radio sous test et à les alimenter avec un émulateur d’évanouissements (fading). Cette méthodologie fournit une reproduction précise des caractéristiques des canaux spatiaux, qui sont nécessaires pour évaluer la performance des terminaux multi-antennes dans des environnements réels. L'avantage le plus important de cette méthodologie est la capacité d'imiter différents modèles de canaux en termes de résolution spatiale, d’évanouissements angulaire et temporel. Un outil de simulation a été développé pour étudier et déterminer les caractéristiques de l'installation OTA pour différents types de canaux d’intérêt. En particulier, le nombre et la mise en place des antennes nécessaires et la taille de l'installation ont été étudiés en fonction de la taille électrique du dispositif testé. Sur la base des études de dimensionnement, une configuration OTA expérimentale a été réalisée pour reproduire les caractéristiques des canaux dans l'espace tridimensionnel pour une plage de fréquences de 2 à 6 GHz
The need for high data-rate, reliable and low latency transmission in wireless communication systems motivated a multitude of wireless standards, spanning from 3G-4G, WLAN to the upcoming fifth generation (5G) of mobile networks. In this context, technology providers, as well as operators, need to develop accurate and cost effective standard test methods, to evaluate devices performance. Over-The-Air (OTA) test methodologies aim to reproduce radio multipath environments in laboratory in repeatable and controllable manner, avoiding costly field test. The focus of this thesis is to propose a new OTA test methodology, in order to emulate radio channel propagation, over a wide band, and to evaluate the performance of the wireless systems in real environments. We start our study by introducing the basics of radio channel and some channel models presented in literature. Then a critical review of existing OTA methodologies in literature is provided. Among the different methodologies we opted for the multi-probe anechoic chamber approach, which consists into deploying a number of probes around a device, and feed them with fading emulator. This methodology provides an accurate reproduction of spatial channel characteristics, which are needed to assess the performance of multi-antenna terminals in real environments. The most important advantage of this methodology is the capability to emulate different channel model in term of spatial resolution, angular and temporal fading. A simulation tool was developed to investigate and determine the OTA setup under different channel condition. In particular the number and emplacement of antennas needed and the size of the setup were investigated as a function of the electrical size of the device under test. Based on the dimensioning studies, an experimental OTA setup was realized to reproduce the channel characteristics in the three dimensional space for a frequency range from 2 to 6 GHz
APA, Harvard, Vancouver, ISO, and other styles
3

Christopoulos, Charitos Andreas. "Brain disease classification using multi-channel 3D convolutional neural networks." Thesis, Linköpings universitet, Statistik och maskininlärning, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-174329.

Full text
Abstract:
Functional magnetic resonance imaging (fMRI) technology has been used in the investigation of human brain functionality and assist in brain disease diagnosis. While fMRI can be used to model both spatial and temporal brain functionality, the analysis of the fMRI images and the discovery of patterns for certain brain diseases is still a challenging task in medical imaging. Deep learning has been used more and more in medical field in an effort to further improve disease diagnosis due to its effectiveness in discovering high-level features in images. Convolutional neural networks (CNNs) is a class of deep learning algorithm that have been successfully used in medical imaging and extract spatial hierarchical features. The application of CNNs in fMRI and the extraction of brain functional patterns is an open field for research. This project focuses on how fMRIs can be used to improve Autism Spectrum Disorders (ASD) detection and diagnosis with 3D resting-state functional MRI (rs-fMRI) images. ASDs are a range of neurodevelopment brain diseases that mostly affect social function. Some of the symptoms include social and communicating difficulties, and also restricted  and repetitive  behaviors. The  symptoms appear on early childhood and tend to develop in time thus an early diagnosis is required. Finding a proper model for identifying between ASD and healthy subject is a challenging task and involves a lot of hyper-parameter tuning. In this project a grid search approach is followed in the quest of the optimal CNN architecture. Additionally, regularization and augmentation techniques are implemented in an effort to further improve the models performance.
APA, Harvard, Vancouver, ISO, and other styles

Books on the topic "Multi-channel ASR"

1

Mal'shina, N., and Andrey Garnov. MODERN PRINCIPLES ANALYSIS OF RESOURCE FLOWS IN CRISIS CONDITIONS: CULTURE AND CREATIVE INDUSTRY. xxu: Academus Publishing, 2020. http://dx.doi.org/10.31519/978-1-4946-0018-1.

Full text
Abstract:
The culture industry should become attractive for capital investment through the development of organizational-economic mechanisms of support in the form of integrated structures as well as through the development of mechanisms of its financing: systems of culture multi-channel backing and state-private partnership which would allow to create prerequisites for the appearance and implementation of new ideas and projects in the culture field, contributing to culture sphere formation as a full-fledged source of state income. As a result of this project implementation, original new fundamental theoretical assumptions and empirical data in the culture industry field and the regional development will be obtained. The project focuses on the study of fundamental basics research of the culture industry functioning, solves scientific problems of the culture industry effectiveness evaluation and direction justification of its support and funding; there are being worked out strategies and mechanisms of the culture industry efficiency development and increase in accordance with the characteristics and needs of regional economies.
APA, Harvard, Vancouver, ISO, and other styles
2

Mal'shina, Nataliya, and Andrey Garnov. Culture and creative industry: the control of flow processes. ru: INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1158704.

Full text
Abstract:
The cultural industry should become attractive for capital investment through the development of project activities, organizational and economic support mechanisms in the form of integrated structures, as well as through the development of mechanisms for its financing: multi-channel cultural financing systems and public-private partnerships that would create prerequisites for the emergence and implementation of new ideas and projects in the field of culture, contributing to its formation as a full-fledged source of state income. As a result of the implementation of this project, original new fundamental theoretical positions and empirical data in the field of the cultural and creative industries were obtained. The research focuses on the analysis of the fundamental foundations of the functioning of the cultural and creative industries, solves the scientific problems of evaluating its effectiveness and justifying the directions of support and financing, develops strategies and mechanisms for the development of the Russian cultural industry in accordance with the specifics and needs of regional economies. For a wide range of readers interested in the development of the cultural industry.
APA, Harvard, Vancouver, ISO, and other styles
3

Ng, Kia, Atta Badii, and Pierfrancesco Bellini, eds. Axmedis 2006. Proceedings of the 2nd International Conference on Automated Production of Cross Media Content for Multi-channel Distribution. Volume for Workshops, Tutorials, Applications and Industrial (Leeds, UK, 13-15 December 2006). Florence: Firenze University Press, 2006. http://dx.doi.org/10.36253/88-8453-525-5.

Full text
Abstract:
The AxMEDIS 2006 International Conference seeks to promote discussion and exchange of ideas amongst researchers, practitioners, developers and users of tools, technology transfer experts, and project managers. This conference series brings together a variety of participants from the academic, business and industrial worlds, to address the emergent research and technological issues as well as the engineering and commercial challenges of large-scale collaborative production and distribution of media as experienced by the associated industrial sectors in the emergent media markets. The conference focuses on the outstanding problems to be resolved in the new age of media computing including cross-domain production, protection, representation, formatting, aggregation, workflow, distribution and business and transaction models i.e. all lifecycle aspects of the new media value chain management. Additionally it explores the integration of new forms of content, content management systems and distribution chains, with particular emphasis on cost structures re-engineering to support the reduction of costs and the integration of innovative solutions to facilitate complex creative collaboration in cross-domain media production with benefit realisation to all stakeholders through optimised rights-protective multichannel distribution.
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Multi-channel ASR"

1

Gabler, Manuel, Knut Paasche, and Ragnheiður Traustadóttir. "Experiences from motorized GPR surveys in Iceland." In Advances in On- and Offshore Archaeological Prospection, 221–30. Kiel: Universitätsverlag Kiel | Kiel University Publishing, 2023. http://dx.doi.org/10.38072/978-3-928794-83-1/p23.

Full text
Abstract:
Motorized multi-channel GPR investigations in Iceland in July 2022 showed great potential for large-scale archaeological geophysical surveys. Investigations on selected sites demonstrated effective fieldwork opportunities with excellent signal penetration through thick layers of ash and sand and very good contrast of turf houses towards the surrounding soil. Although Iceland has very challenging landscapes, many places are perfect suitable for motorized surveys making them very efficient for landscape archaeological investigations.
APA, Harvard, Vancouver, ISO, and other styles
2

Jung Kim, Hyung, Deock Gu Jee, Man Ho Park, Byung Sik Yoon, and Song In Choi. "The Real-Time Implementation of Multi-channel AMR Codec Using TMS320C62xx DSP." In Mobile Communications, 373–78. Berlin, Heidelberg: Springer Berlin Heidelberg, 2003. http://dx.doi.org/10.1007/3-540-36555-9_38.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Duan, Fuqiao, Kai Mao, Qiuming Zhu, Yanheng Qiu, Xiaomin Chen, and Zhipeng Lin. "Design and Optimization of Sounding Sequence for Multi-antenna Air-to-ground Channel Measurements." In Lecture Notes in Electrical Engineering, 347–55. Singapore: Springer Nature Singapore, 2023. http://dx.doi.org/10.1007/978-981-99-1260-5_44.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Fan, Xianguang, Hailing Mao, Chengxiang Zhu, Juntao Wu, Yingjie Xu, and Xin Wang. "Design of Multi-channel Pressure Data Acquisition System Based on Resonant Pressure Sensor for FADS." In Proceeding of 2021 International Conference on Wireless Communications, Networking and Applications, 446–55. Singapore: Springer Nature Singapore, 2022. http://dx.doi.org/10.1007/978-981-19-2456-9_46.

Full text
Abstract:
AbstractResonant pressure sensors have high accuracy and are widely used in meteorological data acquisition, aerospace and other fields. The design and experiment of multi-channel pressure data acquisition system based on resonant pressure sensor, which used for the flush air data sensing(FADS) system, are described. The hardware architecture of DSP and FPGA is applied to the data acquisition system. The digital cymometer and 16-bit analog-to-digital converter are used to measure the output signal of the sensor. It is shown the data acquisition system has favourable performance within the operating temperature range. The maximum experimental error is less than 0.02%FS over the range 2–350 kPa. The period of sampling and fitting is less than 8 ms. The frequency and voltage measurements meet accuracy requirements. The calculated pressure and standard pressure result appears excellent linearity, which reach up to 0.9999.
APA, Harvard, Vancouver, ISO, and other styles
5

Maltsev, Alexander, Olesya Bolkhovskaya, Vadim Sergeev, Anton Elokhin, and Alexander Kokarev. "3D Air-Ground Channel Model Reconstruction Based on the Experimental Data and Q-D Method." In Machine Learning and Artificial Intelligence. IOS Press, 2020. http://dx.doi.org/10.3233/faia200799.

Full text
Abstract:
This paper proposes to use the Q-D (Quasi-Deterministic) method for reconstructing the angular and frequency characteristics of the 3D Air-to-Ground (A-G) channel from the available experimental data. This method allows to expand the applicability of tapped delay line (TDL) channel models for performance investigation of the aviation radio systems with directional antennas and multi-element antenna arrays. The developed 3D A-G channel models also give opportunity to take into account Doppler spread impact on the communication system performance.
APA, Harvard, Vancouver, ISO, and other styles
6

Sikirda, Yuliya, Mykola Kasatkin, and Dmytro Tkachenko. "Intelligent Automated System for Supporting the Collaborative Decision Making by Operators of the Air Navigation System During Flight Emergencies." In Handbook of Research on Artificial Intelligence Applications in the Aviation and Aerospace Industries, 66–90. IGI Global, 2020. http://dx.doi.org/10.4018/978-1-7998-1415-3.ch003.

Full text
Abstract:
This chapter researches pilot and air traffic controller collaborative decision making (CDM) during flight emergencies for maximum synchronization of operators' technological procedures. Deterministic models of CDM by the Air Navigation System's human operators were obtained by network planning methods; their adequacy is confirmed by full-scale modeling on a complex flight simulator. For the sequential optimization of the collaborative two-channel network “Air traffic controller-Pilot” to achieve the end-to-end effectiveness of joint solutions, a multi-criteria approach was used: ensuring the minimum time to parry flight emergency with maximum safety/maximum consistency over the time of operators' actions. With the help of the multiplicative function, the influence of organizational risk factors on flight safety in the air traffic control was evaluated. A conceptual model of System for control and forecasting the flight emergency development on the base of Intelligent Automated System for supporting the CDM by operators was developed.
APA, Harvard, Vancouver, ISO, and other styles
7

Guire, Joseph Lee, C. Byron Winn, and Patrick J. Burns. "A MULTI-CHANNEL, LOW VELOCITY, HOT FILM ANEMOMETRY SYSTEM FOR MEASURING AIR FLOWS IN BUILDINGS." In Advances In Solar Energy Technology, 3165–69. Elsevier, 1988. http://dx.doi.org/10.1016/b978-0-08-034315-0.50579-6.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Multi-channel ASR"

1

Gaudesi, Marco, Felix Weninger, Dushyant Sharma, and Puming Zhan. "ChannelAugment: Improving Generalization of Multi-Channel ASR by Training with Input Channel Randomization." In 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 2021. http://dx.doi.org/10.1109/asru51503.2021.9687902.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Wolf, Martin, and Climent Nadeu. "Channel selection using n-best hypothesis for multi-microphone ASR." In Interspeech 2013. ISCA: ISCA, 2013. http://dx.doi.org/10.21437/interspeech.2013-266.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Yu, Fan, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, and Lei Xie. "MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario." In 2022 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2023. http://dx.doi.org/10.1109/slt54892.2023.10022715.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Kim, Hyeongju, Hyeonseung Lee, Woo Hyun Kang, Hyung Yong Kim, and Nam Soo Kim. "Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation." In Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence {IJCAI-PRICAI-20}. California: International Joint Conferences on Artificial Intelligence Organization, 2020. http://dx.doi.org/10.24963/ijcai.2020/518.

Full text
Abstract:
For multi-channel speech recognition, speech enhancement techniques such as denoising or dereverberation are conventionally applied as a front-end processor. Deep learning-based front-ends using such techniques require aligned clean and noisy speech pairs which are generally obtained via data simulation. Recently, several joint optimization techniques have been proposed to train the front-end without parallel data within an end-to-end automatic speech recognition (ASR) scheme. However, the ASR objective is sub-optimal and insufficient for fully training the front-end, which still leaves room for improvement. In this paper, we propose a novel approach which incorporates flow-based density estimation for the robust front-end using non-parallel clean and noisy speech. Experimental results on the CHiME-4 dataset show that the proposed method outperforms the conventional techniques where the front-end is trained only with ASR objective.
APA, Harvard, Vancouver, ISO, and other styles
5

Heymann, Jahn, Lukas Drude, Christoph Boeddeker, Patrick Hanebrink, and Reinhold Haeb-Umbach. "Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system." In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2017. http://dx.doi.org/10.1109/icassp.2017.7953173.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Tu, Yan-Hui, Jun Du, Nan Zhou, and Chin-Hui Lee. "Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR." In 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2018. http://dx.doi.org/10.23919/apsipa.2018.8659564.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Wang, Xiaofei, Naoyuki Kanda, Yashesh Gaur, Zhuo Chen, Zhong Meng, and Takuya Yoshioka. "Exploring End-to-End Multi-Channel ASR with Bias Information for Meeting Transcription." In 2021 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2021. http://dx.doi.org/10.1109/slt48900.2021.9383500.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Ollerenshaw, A., M. A. Jalal, and T. Hain. "Insights of Neural Representations in Multi-Banded and Multi-Channel Convolutional Transformers for End-to-End ASR." In 2022 30th European Signal Processing Conference (EUSIPCO). IEEE, 2022. http://dx.doi.org/10.23919/eusipco55093.2022.9909875.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Pasa, Luca, Giovanni Morrone, and Leonardo Badino. "An Analysis of Speech Enhancement and Recognition Losses in Limited Resources Multi-Talker Single Channel Audio-Visual ASR." In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020. http://dx.doi.org/10.1109/icassp40776.2020.9054697.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Keskin, Gokce, Minhua Wu, Brian King, Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, and Roland Maas. "Do You Listen with one or two Microphones? A Unified ASR Model for Single and Multi-Channel Audio." In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE, 2022. http://dx.doi.org/10.1109/iwaenc53105.2022.9914704.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "Multi-channel ASR"

1

Goeckeritz, Joel, Nathan Schank, Ryan L Wood, Beverly L Roeder, and Alonzo D Cook. Use of Urinary Bladder Matrix Conduits in a Rat Model of Sciatic Nerve Regeneration after Nerve Transection Injury. Science Repository, December 2022. http://dx.doi.org/10.31487/j.rgm.2022.03.01.

Full text
Abstract:
Previous research has demonstrated the use of single-channel porcine-derived urinary bladder matrix (UBM) conduits in segmental-loss, peripheral nerve repairs as comparable to criterion-standard nerve autografts. This study aimed to replicate and expand upon this research with additional novel UBM conduits and coupled therapies. Fifty-four Wistar Albino rats were divided into 6 groups, and each underwent a surgical neurectomy to remove a 7-millimeter section of the sciatic nerve. Bridging of this nerve gap and treatment for each group was as follows: i) reverse autograft—the segmented nerve was reversed 180 degrees and used to reconnect the proximal and distal nerve stumps; ii) the nerve gap was bridged via a silicone conduit; iii) a single-channel UBM conduit; iv) a multi-channel UBM conduit; v) a single-channel UBM conduit identical to group 3 coupled with fortnightly transcutaneous electrical nerve stimulation (TENS); vi) or, a multi-channel UBM conduit identical to group 4 coupled with fortnightly TENS. The extent of nerve recovery was assessed by behavioural parameters: foot fault asymmetry scoring measured weekly for six weeks; electrophysiological parameters: compound muscle action potential (CMAP) amplitudes, measured at weeks 0 and 6; and morphological parameters: total fascicle areas, myelinated fiber counts, fiber densities, and fiber sizes measured at week 6. All the above parameters demonstrated recovery of the test groups (3-6) as being either comparable or less than that of reverse autograft, but none were shown to outperform reverse autograft. As such, UBM conduits may yet prove to be an effective treatment to repair relatively short segmental peripheral nerve injuries, but further research is required to demonstrate greater efficacy over nerve autografts.
APA, Harvard, Vancouver, ISO, and other styles
2

Goeckeritz, Joel, Nathan Schank, Ryan L Wood, Beverly L Roeder, and Alonzo D Cook. Use of Urinary Bladder Matrix Conduits in a Rat Model of Sciatic Nerve Regeneration after Nerve Transection Injury. Science Repository, December 2022. http://dx.doi.org/10.31487/j.rgm.2022.03.01.sup.

Full text
Abstract:
Previous research has demonstrated the use of single-channel porcine-derived urinary bladder matrix (UBM) conduits in segmental-loss, peripheral nerve repairs as comparable to criterion-standard nerve autografts. This study aimed to replicate and expand upon this research with additional novel UBM conduits and coupled therapies. Fifty-four Wistar Albino rats were divided into 6 groups, and each underwent a surgical neurectomy to remove a 7-millimeter section of the sciatic nerve. Bridging of this nerve gap and treatment for each group was as follows: i) reverse autograft—the segmented nerve was reversed 180 degrees and used to reconnect the proximal and distal nerve stumps; ii) the nerve gap was bridged via a silicone conduit; iii) a single-channel UBM conduit; iv) a multi-channel UBM conduit; v) a single-channel UBM conduit identical to group 3 coupled with fortnightly transcutaneous electrical nerve stimulation (TENS); vi) or, a multi-channel UBM conduit identical to group 4 coupled with fortnightly TENS. The extent of nerve recovery was assessed by behavioural parameters: foot fault asymmetry scoring measured weekly for six weeks; electrophysiological parameters: compound muscle action potential (CMAP) amplitudes, measured at weeks 0 and 6; and morphological parameters: total fascicle areas, myelinated fiber counts, fiber densities, and fiber sizes measured at week 6. All the above parameters demonstrated recovery of the test groups (3-6) as being either comparable or less than that of reverse autograft, but none were shown to outperform reverse autograft. As such, UBM conduits may yet prove to be an effective treatment to repair relatively short segmental peripheral nerve injuries, but further research is required to demonstrate greater efficacy over nerve autografts.
APA, Harvard, Vancouver, ISO, and other styles
3

Ehiabhi, Jolly, and Haifeng Wang. A Systematic Review of Machine Learning Models in Mental Health Analysis Based on Multi-Channel Multi-Modal Biometric Signals. INPLASY - International Platform of Registered Systematic Review and Meta-analysis Protocols, February 2023. http://dx.doi.org/10.37766/inplasy2023.2.0003.

Full text
Abstract:
Review question / Objective: A systematic review of Mental health diagnosis/prognoses of mental disorders using Machine Learning techniques with information from biometric signals. A review of the trend and status of these ML techniques in mental health diagnosis and an investigation of how these signals are used to help increase the efficiency of mental health disease diagnosis. Using Machine learning techniques to classify mental health diseases as against using only expert knowledge for diagnosis. Feature Extraction from signal gotten from biometric signals that help classify sleep disorders. Rationale: To review the application of ML techniques on multimodal and multichannel PSG datasets got from biosensors typically used in the Hospital. To help professionals grasp the steps of using machine learning to classify mental health diseases.
APA, Harvard, Vancouver, ISO, and other styles
4

Budzich, Jeffrey. PR-685-184506-R01 Development of Risk Assessment Procedures and Tools. Chantilly, Virginia: Pipeline Research Council International, Inc. (PRCI), April 2020. http://dx.doi.org/10.55274/r0011668.

Full text
Abstract:
Evaluating risks for hydrotechnical threats at pipeline waterway crossings is important to maintaining pipeline integrity. Establishing standards and best practices for evaluating the risks associated with scour, bank erosion, and channel avulsion is essential to developing a consistent process that can be applied across a network of pipeline crossings and provide data to make decisions about prioritizing mitigation and monitoring. A multi-step process including inventory, screening, desktop analysis, and site visit can be applied to compile regional and local data from existing sources as well as site-specific information that informs risk assessment and identification of mitigation measures and monitoring approach.
APA, Harvard, Vancouver, ISO, and other styles
5

Budzich, Jeffrey. PR-685-184506-R09 Improve Upon Existing Tools to Estimate Hydrotechnical Concerns. Chantilly, Virginia: Pipeline Research Council International, Inc. (PRCI), January 2023. http://dx.doi.org/10.55274/r0012249.

Full text
Abstract:
Evaluating risks for hydrotechnical threats at pipeline waterway crossings is important to maintaining pipeline integrity. Establishing standards and best practices for evaluating the risks associated with scour, bank erosion, and channel avulsion is essential to developing a consistent process that can be applied across a network of pipeline crossings and provide data to make decisions about prioritizing mitigation and monitoring. A specific detailed multi-step process with support from improved evaluation tools, including a pilot platform, that incorporates inventory, screening, desktop analysis, and site inspections can be applied to compile regional and local data from existing sources as well as site-specific information that informs risk assessment and identification of appropriate monitoring approaches and mitigation measures
APA, Harvard, Vancouver, ISO, and other styles
6

Bell, Gary, David Abraham, Nathan Clifton, and Lamkin Kenneth. Wabash and Ohio River confluence hydraulic and sediment transport model investigation : a report for US Army Corps of Engineers, Louisville District. Engineer Research and Development Center (U.S.), March 2022. http://dx.doi.org/10.21079/11681/43441.

Full text
Abstract:
Avulsions of the Wabash River in 2008 through 2011 at its confluence with the Ohio River resulted in significant shoaling in the Ohio River. This caused a re-alignment of the navigation channel and the need for frequent dredging. A two-dimensional numerical hydrodynamic model, Adaptive Hydraulics (AdH), was developed to simulate base (existing) conditions and then altered to simulate multiple alternative scenarios to address these sediment issues. The study was conducted in two phases, Phase 1 in 2013 – 2015 and Phase 2 in 2018 – 2020. Field data were collected and consisted of multi-beam bathymetric elevations, bed sediment samples, suspended sediment samples, and discharge and velocity measurements. The model hydrodynamic and sediment transport computations adequately replicated the water surface slope, flow splits, bed sediment gradations, and suspended sediment concentrations when compared with field data. Thus, it was shown to be dependable as a predictive tool. The alternative that produced the most desirable results included a combination of three level-crested emergent dikes on Wabash Island and four submerged dikes on the Illinois shore with a level crest from the bank to the tip of the dike. The selected alternative produced an improved sailing line while maintaining authorized channel depths.
APA, Harvard, Vancouver, ISO, and other styles
7

Greinert, Jens. Mine Monitoring in the German Baltic Sea 2020; Dumped munition monitoring AL548, 03rd – 16th November 2020, Kiel (Germany) – Kiel (Germany) „MineMoni-II 2020“. GEOMAR Helmholtz Centre for Ocean Research Kiel, 2021. http://dx.doi.org/10.3289/cr_al548.

Full text
Abstract:
ALKOR cruise AL548 took place as part of the EMFF (European Maritime and Fisheries Fund)-funded project BASTA (Boost Applied munition detection through Smart data inTegration and AI workflows; https://www.basta-munition.eu) and as continuation of the munition monitoring started within the BMBF-funded project UDEMM (Environmental Monitoring for the Delaboration of Munition in the Sea; https://udemm.geomar.de/). In October 2018, a first cruise (POS530 MineMoni2018) was conducted, to gather data for a broad baseline study in the German Baltic Sea. Results show a moderate contamination level on regional and coastal scale, but indicate higher levels for specific local areas. Within UDEMM, expertise was developed to detect, exactly locate and monitor munition (e.g. torpedoes, sea mines, ground mines) on the seafloor using optical and hydroacoustic means. In addition, chemical analyses of dissolved contaminants in the water and sediments was performed. Data acquired during this cruise are used in BASTA, which aims for enhanced munition detection via AUV-based artificial intelligence applied on multi-sensor datasets. At the same time, the project ExPloTect (Ex-situ, near-real-time exPlosive compound deTection in seawater) (also EMFF-funded) addresses the need for an innovative approach to detect explosive compounds in seawater. A prototype system was used and successfully tested for the first time during this cruise. The main focus was placed onto the two already known dumpsites Kolberger Heide and Lübeck Bight. Additionally, new areas Falshöft (Schleswig-Holstein) and Cadet Channel, Trollegrund and Großklützhöved (Mecklenburg-Vorpommern) were explored. In each area high-resolution multibeam mapping was performed and contact lists, indicating potential munition objects were produced on board. AUV surveys were conducted to ground-truth possible contacts via detailed photograph and magnetometer mapping. This was complemented with towed video (TV)-CTD profiles. The transits to and between those sites were planned along former constraint routes during WWII. These routes were main targets of the British Air Force and mines and bombs can be expected along these ways. During transits water samples were taken with on a CTD- (conductivity, temperature, depth) rosette-mounted Niskin bottles in regular distances, in order to obtain a comprehensive understanding munition compounds (inter alia trinitrotoluene (TNT)) measurements across the German Baltic Sea.
APA, Harvard, Vancouver, ISO, and other styles
8

Hudgens, Bian, Jene Michaud, Megan Ross, Pamela Scheffler, Anne Brasher, Megan Donahue, Alan Friedlander, et al. Natural resource condition assessment: Puʻuhonua o Hōnaunau National Historical Park. National Park Service, September 2022. http://dx.doi.org/10.36967/2293943.

Full text
Abstract:
Natural Resource Condition Assessments (NRCAs) evaluate current conditions of natural resources and resource indicators in national park units (parks). NRCAs are meant to complement—not replace—traditional issue- and threat-based resource assessments. NRCAs employ a multi-disciplinary, hierarchical framework within which reference conditions for natural resource indicators are developed for comparison against current conditions. NRCAs do not set management targets for study indicators, and reference conditions are not necessarily ideal or target conditions. The goal of a NRCA is to deliver science-based information that will assist park managers in their efforts to describe and quantify a park’s desired resource conditions and management targets, and inform management practices related to natural resource stewardship. The resources and indicators emphasized in a given NRCA depend on the park’s resource setting, status of resource stewardship planning and science in identifying high-priority indicators, and availability of data and expertise to assess current conditions for a variety of potential study resources and indicators. Puʻuhonua o Hōnaunau National Historical Park (hereafter Puʻuhonua o Hōnaunau NHP) encompasses 1.7 km2 (0.7 mi2) at the base of the Mauna Loa Volcano on the Kona coast of the island of Hawaiʻi. The Kona coast of Hawaiʻi Island is characterized by calm winds that increase in the late morning to evening hours, especially in the summer when there is also a high frequency of late afternoon or early evening showers. The climate is mild, with mean high temperature of 26.2° C (79.2° F) and a mean low temperature of 16.6° C (61.9° F) and receiving on average 66 cm (26 in) of rainfall per year. The Kona coast is the only region in Hawaiʻi where more precipitation falls in the summer than in the winter. There is limited surface water runoff or stream development at Puʻuhonua o Hōnaunau NHP due to the relatively recent lava flows (less than 1,500 years old) overlaying much of the park. Kiʻilae Stream is the only watercourse within the park. Kiʻilae Stream is ephemeral, with occasional flows and a poorly characterized channel within the park. A stream gauge was located uphill from the park, but no measurements have been taken since 1982. Floods in Kiʻilae Stream do occur, resulting in transport of fluvial sediment to the ocean, but there are no data documenting this phenomenon. There are a small number of naturally occurring anchialine pools occupying cracks and small depressions in the lava flows, including the Royal Fishponds; an anchialine pool modified for the purpose of holding fish. Although the park’s legal boundaries end at the high tide mark, the sense of place, story, and visitor experience would be completely different without the marine waters adjacent to the park. Six resource elements were chosen for evaluation: air and night sky, water-related processes, terrestrial vegetation, vertebrates, anchialine pools, and marine resources. Resource conditions were determined through reviewing existing literature, meta-analysis, and where appropriate, analysis of unpublished short- and long-term datasets. However, in a number of cases, data were unavailable or insufficient to either establish a quantitative reference condition or conduct a formal statistical comparison of the status of a resource within the park to a quantitative reference condition. In those cases, data gaps are noted, and comparisons were made based on qualitative descriptions. Overall, the condition of natural resources within Puʻuhonua o Hōnaunau NHP reflects the surrounding landscape. The coastal lands immediately surrounding Puʻuhonua o Hōnaunau NHP are zoned for conservation, while adjacent lands away from the coast are agricultural. The condition of most natural resources at Puʻuhonua o Hōnaunau NHP reflect the overall condition of ecological communities on the west Hawai‘i coast. Although little of the park’s vegetation...
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography