Journal articles on the topic 'Speech diarization'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Speech diarization.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Mertens, Robert, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, Ajay Divakaran, and Mark Hasegawa-Johnson. "On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks." International Journal of Multimedia Data Engineering and Management 3, no. 3 (July 2012): 1–19. http://dx.doi.org/10.4018/jmdem.2012070101.
Full textAstapov, Sergei, Aleksei Gusev, Marina Volkova, Aleksei Logunov, Valeriia Zaluskaia, Vlada Kapranova, Elena Timofeeva, Elena Evseeva, Vladimir Kabarov, and Yuri Matveev. "Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization." Mathematics 9, no. 23 (November 23, 2021): 2998. http://dx.doi.org/10.3390/math9232998.
Full textLyu, Ke-Ming, Ren-yuan Lyu, and Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation." PeerJ Computer Science 10 (March 29, 2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Full textPrabhala, Jagat Chaitanya, Venkatnareshbabu K, and Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION." Applied Mathematics and Sciences An International Journal (MathSJ) 10, no. 1/2 (June 26, 2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Full textV, Sethuram, Ande Prasad, and R. Rajeswara Rao. "Metaheuristic adapted convolutional neural network for Telugu speaker diarization." Intelligent Decision Technologies 15, no. 4 (January 10, 2022): 561–77. http://dx.doi.org/10.3233/idt-211005.
Full textMurali, Abhejay, Satwik Dutta, Meena Chandra Shekar, Dwight Irvin, Jay Buzhardt, and John H. Hansen. "Towards developing speaker diarization for parent-child interactions." Journal of the Acoustical Society of America 152, no. 4 (October 2022): A61. http://dx.doi.org/10.1121/10.0015551.
Full textTaha, Thaer Mufeed, Zaineb Ben Messaoud, and Mondher Frikha. "Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization." International Journal of Interactive Mobile Technologies (iJIM) 18, no. 03 (February 9, 2024): 88–103. http://dx.doi.org/10.3991/ijim.v18i03.43013.
Full textKothalkar, Prasanna V., John H. L. Hansen, Dwight Irvin, and Jay Buzhardt. "Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation." Journal of the Acoustical Society of America 155, no. 2 (February 1, 2024): 1198–215. http://dx.doi.org/10.1121/10.0024353.
Full textKshirod, Kshirod Sarmah. "Speaker Diarization with Deep Learning Techniques." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 11, no. 3 (December 15, 2020): 2570–82. http://dx.doi.org/10.61841/turcomat.v11i3.14309.
Full textLleida, Eduardo, Alfonso Ortega, Antonio Miguel, Virginia Bazán-Gil, Carmen Pérez, Manuel Gómez, and Alberto de Prada. "Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media." Applied Sciences 9, no. 24 (December 11, 2019): 5412. http://dx.doi.org/10.3390/app9245412.
Full textAhmad, Rehan, Syed Zubair, and Hani Alquhayz. "Speech Enhancement for Multimodal Speaker Diarization System." IEEE Access 8 (2020): 126671–80. http://dx.doi.org/10.1109/access.2020.3007312.
Full textKothalkar, Prasanna V., Dwight Irvin, Jay Buzhardt, and John H. Hansen. "End-to-end child-adult speech diarization in naturalistic conditions of preschool classrooms." Journal of the Acoustical Society of America 153, no. 3_supplement (March 1, 2023): A174. http://dx.doi.org/10.1121/10.0018568.
Full textKaur, Sukhvinder, and J. S. Sohal. "Speech Activity Detection and its Evaluation in Speaker Diarization System." INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 16, no. 1 (March 13, 2017): 7567–72. http://dx.doi.org/10.24297/ijct.v16i1.5893.
Full textHansen, John H., Aditya Joglekar, and Meena Chandra Shekar. "Fearless steps Apollo: Advancements in robust speech technologies and naturalistic corpus development from Earth to the Moon." Journal of the Acoustical Society of America 152, no. 4 (October 2022): A61. http://dx.doi.org/10.1121/10.0015549.
Full textSultan, Wael Ali, Mourad Samir Semary, and Sherif Mahdy Abdou. "An Efficient Speaker Diarization Pipeline for Conversational Speech." Benha Journal of Applied Sciences 9, no. 5 (May 29, 2024): 141–46. http://dx.doi.org/10.21608/bjas.2024.284482.1414.
Full textKone, Tenon Charly, Sebastian Ghinet, Sayed Ahmed Dana, and Anant Grewal. "Speech detection models for effective communicable disease risk assessment in air travel environments." Journal of the Acoustical Society of America 155, no. 3_Supplement (March 1, 2024): A277. http://dx.doi.org/10.1121/10.0027492.
Full textZelenak, Martin, Carlos Segura, Jordi Luque, and Javier Hernando. "Simultaneous Speech Detection With Spatial Features for Speaker Diarization." IEEE Transactions on Audio, Speech, and Language Processing 20, no. 2 (February 2012): 436–46. http://dx.doi.org/10.1109/tasl.2011.2160167.
Full textViñals, Ignacio, Alfonso Ortega, Antonio Miguel, and Eduardo Lleida. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task." Applied Sciences 11, no. 18 (September 14, 2021): 8521. http://dx.doi.org/10.3390/app11188521.
Full textIndu D. "A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients." Journal of Electrical Systems 20, no. 6s (May 2, 2024): 2938–45. http://dx.doi.org/10.52783/jes.3299.
Full textSathyapriya, S., and A. Indhumathi. "An Efficient Speaker Diarization using Privacy Preserving Audio Features Based of Speech/Non Speech Detection." International Journal of Computer Trends and Technology 9, no. 4 (March 25, 2014): 184–87. http://dx.doi.org/10.14445/22312803/ijctt-v9p136.
Full textHuang, Zili, Marc Delcroix, Leibny Paola Garcia, Shinji Watanabe, Desh Raj, and Sanjeev Khudanpur. "Joint speaker diarization and speech recognition based on region proposal networks." Computer Speech & Language 72 (March 2022): 101316. http://dx.doi.org/10.1016/j.csl.2021.101316.
Full textKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi, and Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library." Sensors 23, no. 4 (February 13, 2023): 2082. http://dx.doi.org/10.3390/s23042082.
Full textJung, Dahae, Min-Kyoung Bae, Man Yong Choi, Eui Chul Lee, and Jinoo Joung. "Speaker diarization method of telemarketer and client for improving speech dictation performance." Journal of Supercomputing 72, no. 5 (July 3, 2015): 1757–69. http://dx.doi.org/10.1007/s11227-015-1470-4.
Full textZhu, Qiushi, Jie Zhang, Yu Gu, Yuchen Hu, and Lirong Dai. "Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 17 (March 24, 2024): 19768–76. http://dx.doi.org/10.1609/aaai.v38i17.29951.
Full textPapala, Gowtham, Aniket Ransing, and Pooja Jain. "Sentiment Analysis and Speaker Diarization in Hindi and Marathi Using using Finetuned Whisper." Scalable Computing: Practice and Experience 24, no. 4 (November 17, 2023): 835–46. http://dx.doi.org/10.12694/scpe.v24i4.2248.
Full textSenoussaoui, Mohammed, Patrick Kenny, Themos Stafylakis, and Pierre Dumouchel. "A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization." IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, no. 1 (January 2014): 217–27. http://dx.doi.org/10.1109/taslp.2013.2285474.
Full textVryzas, Nikolaos, Nikolaos Tsipas, and Charalampos Dimoulas. "Web Radio Automation for Audio Stream Management in the Era of Big Data." Information 11, no. 4 (April 11, 2020): 205. http://dx.doi.org/10.3390/info11040205.
Full textLleida, Eduardo, Luis Javier Rodriguez-Fuentes, Javier Tejedor, Alfonso Ortega, Antonio Miguel, Virginia Bazán, Carmen Pérez, et al. "An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies." Applied Sciences 13, no. 15 (July 25, 2023): 8577. http://dx.doi.org/10.3390/app13158577.
Full textHansen, John H. L., Maryam Najafian, Rasa Lileikyte, Dwight Irvin, and Beth Rous. "Speech and language processing for assessing child–adult interaction based on diarization and location." International Journal of Speech Technology 22, no. 3 (June 5, 2019): 697–709. http://dx.doi.org/10.1007/s10772-019-09590-0.
Full textCerva, Petr, Jan Silovsky, Jindrich Zdansky, Jan Nouza, and Ladislav Seps. "Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives." Speech Communication 55, no. 10 (November 2013): 1033–46. http://dx.doi.org/10.1016/j.specom.2013.06.017.
Full textJoglekar, Aditya, Ivan Lopez-Espejo, and John H. Hansen. "Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams." Journal of the Acoustical Society of America 153, no. 3_supplement (March 1, 2023): A173. http://dx.doi.org/10.1121/10.0018566.
Full textXiao, Bo, Chewei Huang, Zac E. Imel, David C. Atkins, Panayiotis Georgiou, and Shrikanth S. Narayanan. "A technology prototype system for rating therapist empathy from audio recordings in addiction counseling." PeerJ Computer Science 2 (April 20, 2016): e59. http://dx.doi.org/10.7717/peerj-cs.59.
Full textKalanadhabhatta, Manasa, Mohammad Mehdi Rastikerdar, Tauhidur Rahman, Adam S. Grabell, and Deepak Ganesan. "Playlogue: Dataset and Benchmarks for Analyzing Adult-Child Conversations During Play." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8, no. 4 (November 21, 2024): 1–34. http://dx.doi.org/10.1145/3699775.
Full textDi Cesare, Michele Giuseppe, David Perpetuini, Daniela Cardone, and Arcangelo Merla. "Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques." Sensors 24, no. 5 (February 26, 2024): 1499. http://dx.doi.org/10.3390/s24051499.
Full textYella, Sree Harsha, and Herve Bourlard. "Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations." IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, no. 12 (December 2014): 1688–700. http://dx.doi.org/10.1109/taslp.2014.2346315.
Full textGhorbani, Shahram, and John H. L. Hansen. "Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition." Journal of the Acoustical Society of America 155, no. 6 (June 1, 2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Full textAnmella, Gerard, Michele De Prisco, Jeremiah B. Joyce, Claudia Valenzuela-Pascual, Ariadna Mas-Musons, Vincenzo Oliva, Giovanna Fico, et al. "Automated Speech Analysis in Bipolar Disorder: The CALIBER Study Protocol and Preliminary Results." Journal of Clinical Medicine 13, no. 17 (August 23, 2024): 4997. http://dx.doi.org/10.3390/jcm13174997.
Full textZeulner, Tobias, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez, and Peter A. Gloor. "Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features." Information 15, no. 4 (April 12, 2024): 217. http://dx.doi.org/10.3390/info15040217.
Full textKaur, Sukhvinder, Chander Prabha, Ravinder Pal Singh, Deepali Gupta, Sapna Juneja, Punit Gupta, and Ali Nauman. "Optimized technique for speaker changes detection in multispeaker audio recording using pyknogram and efficient distance metric." PLOS ONE 19, no. 11 (November 20, 2024): e0314073. http://dx.doi.org/10.1371/journal.pone.0314073.
Full textDelgado, Héctor, Anna Matamala, and Javier Serrano. "Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?" Cadernos de Tradução 35, no. 2 (June 17, 2015): 308. http://dx.doi.org/10.5007/2175-7968.2015v35n2p308.
Full textDiez, Mireia, Lukas Burget, Federico Landini, and Jan Cernocky. "Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 355–68. http://dx.doi.org/10.1109/taslp.2019.2955293.
Full textDawalatabad, Nauman, Srikanth Madikeri, C. Chandra Sekhar, and Hema A. Murthy. "Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings." IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 14–27. http://dx.doi.org/10.1109/taslp.2020.3036231.
Full textO’Malley, Ronan, Bahman Mirhedari, Kirsty Harkness, Markus Reuber, Annalena Venneri, Heidi Christensen, and Daniel Blackburn. "055 The digital doctor: a fully automated stratification and monitoring system for patients with memory complaints." Journal of Neurology, Neurosurgery & Psychiatry 90, no. 12 (November 14, 2019): A23.2—A23. http://dx.doi.org/10.1136/jnnp-2019-abn-2.76.
Full textDing, Huitong, Adrian Lister, Cody Karjadi, Rhoda Au, Honghuang Lin, Brian Bischoff, and Phillip Hwang. "EARLY DETECTION OF ALZHEIMER’S DISEASE AND RELATED DEMENTIAS FROM VOICE RECORDINGS: THE FRAMINGHAM HEART STUDY." Innovation in Aging 7, Supplement_1 (December 1, 2023): 1024. http://dx.doi.org/10.1093/geroni/igad104.3291.
Full textPraharaj, Sambit, Maren Scheffel, Marcel Schmitz, Marcus Specht, and Hendrik Drachsler. "Towards Automatic Collaboration Analytics for Group Speech Data Using Learning Analytics." Sensors 21, no. 9 (May 2, 2021): 3156. http://dx.doi.org/10.3390/s21093156.
Full textHershkovich, Leeor, Sabyasachi Bandyopadhyay, Jack Wittmayer, Patrick Tighe, David J. Libon, Catherine C. Price, and Parisa Rashidi. "96 Proof of Principle: Can Paragraph Recall Pauses and Speech Frequencies Correctly Classify Cognitively Compromised Older Adults?" Journal of the International Neuropsychological Society 29, s1 (November 2023): 767–68. http://dx.doi.org/10.1017/s1355617723009530.
Full textMcDonald, Margarethe, Taeahn Kwon, Hyunji Kim, Youngki Lee, and Eon-Suk Ko. "Evaluating the Language ENvironment Analysis System for Korean." Journal of Speech, Language, and Hearing Research 64, no. 3 (March 17, 2021): 792–808. http://dx.doi.org/10.1044/2020_jslhr-20-00489.
Full textKumar, Krishna. "Speaker Diarization: A Review." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 07, no. 06 (June 24, 2023). http://dx.doi.org/10.55041/ijsrem24075.
Full textXu, Sean Shensheng, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao, and Chunqi Chang. "Speaker-turn aware diarization for speech-based cognitive assessments." Frontiers in Neuroscience 17 (January 16, 2024). http://dx.doi.org/10.3389/fnins.2023.1351848.
Full textRoberto Sánchez Cárdenas and Marvin Coto-Jiménez. "Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts." Revista Tecnología en Marcha, November 16, 2022. http://dx.doi.org/10.18845/tm.v35i8.6464.
Full text