Artículos de revistas sobre el tema "Speech diarization"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 50 mejores artículos de revistas para su investigación sobre el tema "Speech diarization".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore artículos de revistas sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Mertens, Robert, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, Ajay Divakaran y Mark Hasegawa-Johnson. "On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks". International Journal of Multimedia Data Engineering and Management 3, n.º 3 (julio de 2012): 1–19. http://dx.doi.org/10.4018/jmdem.2012070101.
Texto completoAstapov, Sergei, Aleksei Gusev, Marina Volkova, Aleksei Logunov, Valeriia Zaluskaia, Vlada Kapranova, Elena Timofeeva, Elena Evseeva, Vladimir Kabarov y Yuri Matveev. "Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization". Mathematics 9, n.º 23 (23 de noviembre de 2021): 2998. http://dx.doi.org/10.3390/math9232998.
Texto completoLyu, Ke-Ming, Ren-yuan Lyu y Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation". PeerJ Computer Science 10 (29 de marzo de 2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Texto completoPrabhala, Jagat Chaitanya, Venkatnareshbabu K y Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION". Applied Mathematics and Sciences An International Journal (MathSJ) 10, n.º 1/2 (26 de junio de 2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Texto completoV, Sethuram, Ande Prasad y R. Rajeswara Rao. "Metaheuristic adapted convolutional neural network for Telugu speaker diarization". Intelligent Decision Technologies 15, n.º 4 (10 de enero de 2022): 561–77. http://dx.doi.org/10.3233/idt-211005.
Texto completoMurali, Abhejay, Satwik Dutta, Meena Chandra Shekar, Dwight Irvin, Jay Buzhardt y John H. Hansen. "Towards developing speaker diarization for parent-child interactions". Journal of the Acoustical Society of America 152, n.º 4 (octubre de 2022): A61. http://dx.doi.org/10.1121/10.0015551.
Texto completoTaha, Thaer Mufeed, Zaineb Ben Messaoud y Mondher Frikha. "Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization". International Journal of Interactive Mobile Technologies (iJIM) 18, n.º 03 (9 de febrero de 2024): 88–103. http://dx.doi.org/10.3991/ijim.v18i03.43013.
Texto completoKothalkar, Prasanna V., John H. L. Hansen, Dwight Irvin y Jay Buzhardt. "Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation". Journal of the Acoustical Society of America 155, n.º 2 (1 de febrero de 2024): 1198–215. http://dx.doi.org/10.1121/10.0024353.
Texto completoKshirod, Kshirod Sarmah. "Speaker Diarization with Deep Learning Techniques". Turkish Journal of Computer and Mathematics Education (TURCOMAT) 11, n.º 3 (15 de diciembre de 2020): 2570–82. http://dx.doi.org/10.61841/turcomat.v11i3.14309.
Texto completoLleida, Eduardo, Alfonso Ortega, Antonio Miguel, Virginia Bazán-Gil, Carmen Pérez, Manuel Gómez y Alberto de Prada. "Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media". Applied Sciences 9, n.º 24 (11 de diciembre de 2019): 5412. http://dx.doi.org/10.3390/app9245412.
Texto completoAhmad, Rehan, Syed Zubair y Hani Alquhayz. "Speech Enhancement for Multimodal Speaker Diarization System". IEEE Access 8 (2020): 126671–80. http://dx.doi.org/10.1109/access.2020.3007312.
Texto completoKothalkar, Prasanna V., Dwight Irvin, Jay Buzhardt y John H. Hansen. "End-to-end child-adult speech diarization in naturalistic conditions of preschool classrooms". Journal of the Acoustical Society of America 153, n.º 3_supplement (1 de marzo de 2023): A174. http://dx.doi.org/10.1121/10.0018568.
Texto completoKaur, Sukhvinder y J. S. Sohal. "Speech Activity Detection and its Evaluation in Speaker Diarization System". INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 16, n.º 1 (13 de marzo de 2017): 7567–72. http://dx.doi.org/10.24297/ijct.v16i1.5893.
Texto completoHansen, John H., Aditya Joglekar y Meena Chandra Shekar. "Fearless steps Apollo: Advancements in robust speech technologies and naturalistic corpus development from Earth to the Moon". Journal of the Acoustical Society of America 152, n.º 4 (octubre de 2022): A61. http://dx.doi.org/10.1121/10.0015549.
Texto completoSultan, Wael Ali, Mourad Samir Semary y Sherif Mahdy Abdou. "An Efficient Speaker Diarization Pipeline for Conversational Speech". Benha Journal of Applied Sciences 9, n.º 5 (29 de mayo de 2024): 141–46. http://dx.doi.org/10.21608/bjas.2024.284482.1414.
Texto completoKone, Tenon Charly, Sebastian Ghinet, Sayed Ahmed Dana y Anant Grewal. "Speech detection models for effective communicable disease risk assessment in air travel environments". Journal of the Acoustical Society of America 155, n.º 3_Supplement (1 de marzo de 2024): A277. http://dx.doi.org/10.1121/10.0027492.
Texto completoZelenak, Martin, Carlos Segura, Jordi Luque y Javier Hernando. "Simultaneous Speech Detection With Spatial Features for Speaker Diarization". IEEE Transactions on Audio, Speech, and Language Processing 20, n.º 2 (febrero de 2012): 436–46. http://dx.doi.org/10.1109/tasl.2011.2160167.
Texto completoViñals, Ignacio, Alfonso Ortega, Antonio Miguel y Eduardo Lleida. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task". Applied Sciences 11, n.º 18 (14 de septiembre de 2021): 8521. http://dx.doi.org/10.3390/app11188521.
Texto completoIndu D. "A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients". Journal of Electrical Systems 20, n.º 6s (2 de mayo de 2024): 2938–45. http://dx.doi.org/10.52783/jes.3299.
Texto completoSathyapriya, S. y A. Indhumathi. "An Efficient Speaker Diarization using Privacy Preserving Audio Features Based of Speech/Non Speech Detection". International Journal of Computer Trends and Technology 9, n.º 4 (25 de marzo de 2014): 184–87. http://dx.doi.org/10.14445/22312803/ijctt-v9p136.
Texto completoHuang, Zili, Marc Delcroix, Leibny Paola Garcia, Shinji Watanabe, Desh Raj y Sanjeev Khudanpur. "Joint speaker diarization and speech recognition based on region proposal networks". Computer Speech & Language 72 (marzo de 2022): 101316. http://dx.doi.org/10.1016/j.csl.2021.101316.
Texto completoKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi y Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library". Sensors 23, n.º 4 (13 de febrero de 2023): 2082. http://dx.doi.org/10.3390/s23042082.
Texto completoJung, Dahae, Min-Kyoung Bae, Man Yong Choi, Eui Chul Lee y Jinoo Joung. "Speaker diarization method of telemarketer and client for improving speech dictation performance". Journal of Supercomputing 72, n.º 5 (3 de julio de 2015): 1757–69. http://dx.doi.org/10.1007/s11227-015-1470-4.
Texto completoZhu, Qiushi, Jie Zhang, Yu Gu, Yuchen Hu y Lirong Dai. "Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation". Proceedings of the AAAI Conference on Artificial Intelligence 38, n.º 17 (24 de marzo de 2024): 19768–76. http://dx.doi.org/10.1609/aaai.v38i17.29951.
Texto completoPapala, Gowtham, Aniket Ransing y Pooja Jain. "Sentiment Analysis and Speaker Diarization in Hindi and Marathi Using using Finetuned Whisper". Scalable Computing: Practice and Experience 24, n.º 4 (17 de noviembre de 2023): 835–46. http://dx.doi.org/10.12694/scpe.v24i4.2248.
Texto completoSenoussaoui, Mohammed, Patrick Kenny, Themos Stafylakis y Pierre Dumouchel. "A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization". IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, n.º 1 (enero de 2014): 217–27. http://dx.doi.org/10.1109/taslp.2013.2285474.
Texto completoVryzas, Nikolaos, Nikolaos Tsipas y Charalampos Dimoulas. "Web Radio Automation for Audio Stream Management in the Era of Big Data". Information 11, n.º 4 (11 de abril de 2020): 205. http://dx.doi.org/10.3390/info11040205.
Texto completoLleida, Eduardo, Luis Javier Rodriguez-Fuentes, Javier Tejedor, Alfonso Ortega, Antonio Miguel, Virginia Bazán, Carmen Pérez et al. "An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies". Applied Sciences 13, n.º 15 (25 de julio de 2023): 8577. http://dx.doi.org/10.3390/app13158577.
Texto completoHansen, John H. L., Maryam Najafian, Rasa Lileikyte, Dwight Irvin y Beth Rous. "Speech and language processing for assessing child–adult interaction based on diarization and location". International Journal of Speech Technology 22, n.º 3 (5 de junio de 2019): 697–709. http://dx.doi.org/10.1007/s10772-019-09590-0.
Texto completoCerva, Petr, Jan Silovsky, Jindrich Zdansky, Jan Nouza y Ladislav Seps. "Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives". Speech Communication 55, n.º 10 (noviembre de 2013): 1033–46. http://dx.doi.org/10.1016/j.specom.2013.06.017.
Texto completoJoglekar, Aditya, Ivan Lopez-Espejo y John H. Hansen. "Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams". Journal of the Acoustical Society of America 153, n.º 3_supplement (1 de marzo de 2023): A173. http://dx.doi.org/10.1121/10.0018566.
Texto completoXiao, Bo, Chewei Huang, Zac E. Imel, David C. Atkins, Panayiotis Georgiou y Shrikanth S. Narayanan. "A technology prototype system for rating therapist empathy from audio recordings in addiction counseling". PeerJ Computer Science 2 (20 de abril de 2016): e59. http://dx.doi.org/10.7717/peerj-cs.59.
Texto completoKalanadhabhatta, Manasa, Mohammad Mehdi Rastikerdar, Tauhidur Rahman, Adam S. Grabell y Deepak Ganesan. "Playlogue: Dataset and Benchmarks for Analyzing Adult-Child Conversations During Play". Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8, n.º 4 (21 de noviembre de 2024): 1–34. http://dx.doi.org/10.1145/3699775.
Texto completoDi Cesare, Michele Giuseppe, David Perpetuini, Daniela Cardone y Arcangelo Merla. "Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques". Sensors 24, n.º 5 (26 de febrero de 2024): 1499. http://dx.doi.org/10.3390/s24051499.
Texto completoYella, Sree Harsha y Herve Bourlard. "Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations". IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, n.º 12 (diciembre de 2014): 1688–700. http://dx.doi.org/10.1109/taslp.2014.2346315.
Texto completoGhorbani, Shahram y John H. L. Hansen. "Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition". Journal of the Acoustical Society of America 155, n.º 6 (1 de junio de 2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Texto completoAnmella, Gerard, Michele De Prisco, Jeremiah B. Joyce, Claudia Valenzuela-Pascual, Ariadna Mas-Musons, Vincenzo Oliva, Giovanna Fico et al. "Automated Speech Analysis in Bipolar Disorder: The CALIBER Study Protocol and Preliminary Results". Journal of Clinical Medicine 13, n.º 17 (23 de agosto de 2024): 4997. http://dx.doi.org/10.3390/jcm13174997.
Texto completoZeulner, Tobias, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez y Peter A. Gloor. "Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features". Information 15, n.º 4 (12 de abril de 2024): 217. http://dx.doi.org/10.3390/info15040217.
Texto completoKaur, Sukhvinder, Chander Prabha, Ravinder Pal Singh, Deepali Gupta, Sapna Juneja, Punit Gupta y Ali Nauman. "Optimized technique for speaker changes detection in multispeaker audio recording using pyknogram and efficient distance metric". PLOS ONE 19, n.º 11 (20 de noviembre de 2024): e0314073. http://dx.doi.org/10.1371/journal.pone.0314073.
Texto completoDelgado, Héctor, Anna Matamala y Javier Serrano. "Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?" Cadernos de Tradução 35, n.º 2 (17 de junio de 2015): 308. http://dx.doi.org/10.5007/2175-7968.2015v35n2p308.
Texto completoDiez, Mireia, Lukas Burget, Federico Landini y Jan Cernocky. "Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors". IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 355–68. http://dx.doi.org/10.1109/taslp.2019.2955293.
Texto completoDawalatabad, Nauman, Srikanth Madikeri, C. Chandra Sekhar y Hema A. Murthy. "Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings". IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 14–27. http://dx.doi.org/10.1109/taslp.2020.3036231.
Texto completoO’Malley, Ronan, Bahman Mirhedari, Kirsty Harkness, Markus Reuber, Annalena Venneri, Heidi Christensen y Daniel Blackburn. "055 The digital doctor: a fully automated stratification and monitoring system for patients with memory complaints". Journal of Neurology, Neurosurgery & Psychiatry 90, n.º 12 (14 de noviembre de 2019): A23.2—A23. http://dx.doi.org/10.1136/jnnp-2019-abn-2.76.
Texto completoDing, Huitong, Adrian Lister, Cody Karjadi, Rhoda Au, Honghuang Lin, Brian Bischoff y Phillip Hwang. "EARLY DETECTION OF ALZHEIMER’S DISEASE AND RELATED DEMENTIAS FROM VOICE RECORDINGS: THE FRAMINGHAM HEART STUDY". Innovation in Aging 7, Supplement_1 (1 de diciembre de 2023): 1024. http://dx.doi.org/10.1093/geroni/igad104.3291.
Texto completoPraharaj, Sambit, Maren Scheffel, Marcel Schmitz, Marcus Specht y Hendrik Drachsler. "Towards Automatic Collaboration Analytics for Group Speech Data Using Learning Analytics". Sensors 21, n.º 9 (2 de mayo de 2021): 3156. http://dx.doi.org/10.3390/s21093156.
Texto completoHershkovich, Leeor, Sabyasachi Bandyopadhyay, Jack Wittmayer, Patrick Tighe, David J. Libon, Catherine C. Price y Parisa Rashidi. "96 Proof of Principle: Can Paragraph Recall Pauses and Speech Frequencies Correctly Classify Cognitively Compromised Older Adults?" Journal of the International Neuropsychological Society 29, s1 (noviembre de 2023): 767–68. http://dx.doi.org/10.1017/s1355617723009530.
Texto completoMcDonald, Margarethe, Taeahn Kwon, Hyunji Kim, Youngki Lee y Eon-Suk Ko. "Evaluating the Language ENvironment Analysis System for Korean". Journal of Speech, Language, and Hearing Research 64, n.º 3 (17 de marzo de 2021): 792–808. http://dx.doi.org/10.1044/2020_jslhr-20-00489.
Texto completoKumar, Krishna. "Speaker Diarization: A Review". INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 07, n.º 06 (24 de junio de 2023). http://dx.doi.org/10.55041/ijsrem24075.
Texto completoXu, Sean Shensheng, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao y Chunqi Chang. "Speaker-turn aware diarization for speech-based cognitive assessments". Frontiers in Neuroscience 17 (16 de enero de 2024). http://dx.doi.org/10.3389/fnins.2023.1351848.
Texto completoRoberto Sánchez Cárdenas y Marvin Coto-Jiménez. "Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts". Revista Tecnología en Marcha, 16 de noviembre de 2022. http://dx.doi.org/10.18845/tm.v35i8.6464.
Texto completo