Artykuły w czasopismach na temat „Speech diarization”
Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych
Sprawdź 50 najlepszych artykułów w czasopismach naukowych na temat „Speech diarization”.
Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.
Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.
Przeglądaj artykuły w czasopismach z różnych dziedzin i twórz odpowiednie bibliografie.
Mertens, Robert, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, Ajay Divakaran i Mark Hasegawa-Johnson. "On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks". International Journal of Multimedia Data Engineering and Management 3, nr 3 (lipiec 2012): 1–19. http://dx.doi.org/10.4018/jmdem.2012070101.
Pełny tekst źródłaAstapov, Sergei, Aleksei Gusev, Marina Volkova, Aleksei Logunov, Valeriia Zaluskaia, Vlada Kapranova, Elena Timofeeva, Elena Evseeva, Vladimir Kabarov i Yuri Matveev. "Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization". Mathematics 9, nr 23 (23.11.2021): 2998. http://dx.doi.org/10.3390/math9232998.
Pełny tekst źródłaLyu, Ke-Ming, Ren-yuan Lyu i Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation". PeerJ Computer Science 10 (29.03.2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Pełny tekst źródłaPrabhala, Jagat Chaitanya, Venkatnareshbabu K i Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION". Applied Mathematics and Sciences An International Journal (MathSJ) 10, nr 1/2 (26.06.2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Pełny tekst źródłaV, Sethuram, Ande Prasad i R. Rajeswara Rao. "Metaheuristic adapted convolutional neural network for Telugu speaker diarization". Intelligent Decision Technologies 15, nr 4 (10.01.2022): 561–77. http://dx.doi.org/10.3233/idt-211005.
Pełny tekst źródłaMurali, Abhejay, Satwik Dutta, Meena Chandra Shekar, Dwight Irvin, Jay Buzhardt i John H. Hansen. "Towards developing speaker diarization for parent-child interactions". Journal of the Acoustical Society of America 152, nr 4 (październik 2022): A61. http://dx.doi.org/10.1121/10.0015551.
Pełny tekst źródłaTaha, Thaer Mufeed, Zaineb Ben Messaoud i Mondher Frikha. "Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization". International Journal of Interactive Mobile Technologies (iJIM) 18, nr 03 (9.02.2024): 88–103. http://dx.doi.org/10.3991/ijim.v18i03.43013.
Pełny tekst źródłaKothalkar, Prasanna V., John H. L. Hansen, Dwight Irvin i Jay Buzhardt. "Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation". Journal of the Acoustical Society of America 155, nr 2 (1.02.2024): 1198–215. http://dx.doi.org/10.1121/10.0024353.
Pełny tekst źródłaKshirod, Kshirod Sarmah. "Speaker Diarization with Deep Learning Techniques". Turkish Journal of Computer and Mathematics Education (TURCOMAT) 11, nr 3 (15.12.2020): 2570–82. http://dx.doi.org/10.61841/turcomat.v11i3.14309.
Pełny tekst źródłaLleida, Eduardo, Alfonso Ortega, Antonio Miguel, Virginia Bazán-Gil, Carmen Pérez, Manuel Gómez i Alberto de Prada. "Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media". Applied Sciences 9, nr 24 (11.12.2019): 5412. http://dx.doi.org/10.3390/app9245412.
Pełny tekst źródłaAhmad, Rehan, Syed Zubair i Hani Alquhayz. "Speech Enhancement for Multimodal Speaker Diarization System". IEEE Access 8 (2020): 126671–80. http://dx.doi.org/10.1109/access.2020.3007312.
Pełny tekst źródłaKothalkar, Prasanna V., Dwight Irvin, Jay Buzhardt i John H. Hansen. "End-to-end child-adult speech diarization in naturalistic conditions of preschool classrooms". Journal of the Acoustical Society of America 153, nr 3_supplement (1.03.2023): A174. http://dx.doi.org/10.1121/10.0018568.
Pełny tekst źródłaKaur, Sukhvinder, i J. S. Sohal. "Speech Activity Detection and its Evaluation in Speaker Diarization System". INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 16, nr 1 (13.03.2017): 7567–72. http://dx.doi.org/10.24297/ijct.v16i1.5893.
Pełny tekst źródłaHansen, John H., Aditya Joglekar i Meena Chandra Shekar. "Fearless steps Apollo: Advancements in robust speech technologies and naturalistic corpus development from Earth to the Moon". Journal of the Acoustical Society of America 152, nr 4 (październik 2022): A61. http://dx.doi.org/10.1121/10.0015549.
Pełny tekst źródłaSultan, Wael Ali, Mourad Samir Semary i Sherif Mahdy Abdou. "An Efficient Speaker Diarization Pipeline for Conversational Speech". Benha Journal of Applied Sciences 9, nr 5 (29.05.2024): 141–46. http://dx.doi.org/10.21608/bjas.2024.284482.1414.
Pełny tekst źródłaKone, Tenon Charly, Sebastian Ghinet, Sayed Ahmed Dana i Anant Grewal. "Speech detection models for effective communicable disease risk assessment in air travel environments". Journal of the Acoustical Society of America 155, nr 3_Supplement (1.03.2024): A277. http://dx.doi.org/10.1121/10.0027492.
Pełny tekst źródłaZelenak, Martin, Carlos Segura, Jordi Luque i Javier Hernando. "Simultaneous Speech Detection With Spatial Features for Speaker Diarization". IEEE Transactions on Audio, Speech, and Language Processing 20, nr 2 (luty 2012): 436–46. http://dx.doi.org/10.1109/tasl.2011.2160167.
Pełny tekst źródłaViñals, Ignacio, Alfonso Ortega, Antonio Miguel i Eduardo Lleida. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task". Applied Sciences 11, nr 18 (14.09.2021): 8521. http://dx.doi.org/10.3390/app11188521.
Pełny tekst źródłaIndu D. "A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients". Journal of Electrical Systems 20, nr 6s (2.05.2024): 2938–45. http://dx.doi.org/10.52783/jes.3299.
Pełny tekst źródłaSathyapriya, S., i A. Indhumathi. "An Efficient Speaker Diarization using Privacy Preserving Audio Features Based of Speech/Non Speech Detection". International Journal of Computer Trends and Technology 9, nr 4 (25.03.2014): 184–87. http://dx.doi.org/10.14445/22312803/ijctt-v9p136.
Pełny tekst źródłaHuang, Zili, Marc Delcroix, Leibny Paola Garcia, Shinji Watanabe, Desh Raj i Sanjeev Khudanpur. "Joint speaker diarization and speech recognition based on region proposal networks". Computer Speech & Language 72 (marzec 2022): 101316. http://dx.doi.org/10.1016/j.csl.2021.101316.
Pełny tekst źródłaKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi i Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library". Sensors 23, nr 4 (13.02.2023): 2082. http://dx.doi.org/10.3390/s23042082.
Pełny tekst źródłaJung, Dahae, Min-Kyoung Bae, Man Yong Choi, Eui Chul Lee i Jinoo Joung. "Speaker diarization method of telemarketer and client for improving speech dictation performance". Journal of Supercomputing 72, nr 5 (3.07.2015): 1757–69. http://dx.doi.org/10.1007/s11227-015-1470-4.
Pełny tekst źródłaZhu, Qiushi, Jie Zhang, Yu Gu, Yuchen Hu i Lirong Dai. "Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation". Proceedings of the AAAI Conference on Artificial Intelligence 38, nr 17 (24.03.2024): 19768–76. http://dx.doi.org/10.1609/aaai.v38i17.29951.
Pełny tekst źródłaPapala, Gowtham, Aniket Ransing i Pooja Jain. "Sentiment Analysis and Speaker Diarization in Hindi and Marathi Using using Finetuned Whisper". Scalable Computing: Practice and Experience 24, nr 4 (17.11.2023): 835–46. http://dx.doi.org/10.12694/scpe.v24i4.2248.
Pełny tekst źródłaSenoussaoui, Mohammed, Patrick Kenny, Themos Stafylakis i Pierre Dumouchel. "A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization". IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, nr 1 (styczeń 2014): 217–27. http://dx.doi.org/10.1109/taslp.2013.2285474.
Pełny tekst źródłaVryzas, Nikolaos, Nikolaos Tsipas i Charalampos Dimoulas. "Web Radio Automation for Audio Stream Management in the Era of Big Data". Information 11, nr 4 (11.04.2020): 205. http://dx.doi.org/10.3390/info11040205.
Pełny tekst źródłaLleida, Eduardo, Luis Javier Rodriguez-Fuentes, Javier Tejedor, Alfonso Ortega, Antonio Miguel, Virginia Bazán, Carmen Pérez i in. "An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies". Applied Sciences 13, nr 15 (25.07.2023): 8577. http://dx.doi.org/10.3390/app13158577.
Pełny tekst źródłaHansen, John H. L., Maryam Najafian, Rasa Lileikyte, Dwight Irvin i Beth Rous. "Speech and language processing for assessing child–adult interaction based on diarization and location". International Journal of Speech Technology 22, nr 3 (5.06.2019): 697–709. http://dx.doi.org/10.1007/s10772-019-09590-0.
Pełny tekst źródłaCerva, Petr, Jan Silovsky, Jindrich Zdansky, Jan Nouza i Ladislav Seps. "Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives". Speech Communication 55, nr 10 (listopad 2013): 1033–46. http://dx.doi.org/10.1016/j.specom.2013.06.017.
Pełny tekst źródłaJoglekar, Aditya, Ivan Lopez-Espejo i John H. Hansen. "Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams". Journal of the Acoustical Society of America 153, nr 3_supplement (1.03.2023): A173. http://dx.doi.org/10.1121/10.0018566.
Pełny tekst źródłaXiao, Bo, Chewei Huang, Zac E. Imel, David C. Atkins, Panayiotis Georgiou i Shrikanth S. Narayanan. "A technology prototype system for rating therapist empathy from audio recordings in addiction counseling". PeerJ Computer Science 2 (20.04.2016): e59. http://dx.doi.org/10.7717/peerj-cs.59.
Pełny tekst źródłaKalanadhabhatta, Manasa, Mohammad Mehdi Rastikerdar, Tauhidur Rahman, Adam S. Grabell i Deepak Ganesan. "Playlogue: Dataset and Benchmarks for Analyzing Adult-Child Conversations During Play". Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8, nr 4 (21.11.2024): 1–34. http://dx.doi.org/10.1145/3699775.
Pełny tekst źródłaDi Cesare, Michele Giuseppe, David Perpetuini, Daniela Cardone i Arcangelo Merla. "Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques". Sensors 24, nr 5 (26.02.2024): 1499. http://dx.doi.org/10.3390/s24051499.
Pełny tekst źródłaYella, Sree Harsha, i Herve Bourlard. "Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations". IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, nr 12 (grudzień 2014): 1688–700. http://dx.doi.org/10.1109/taslp.2014.2346315.
Pełny tekst źródłaGhorbani, Shahram, i John H. L. Hansen. "Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition". Journal of the Acoustical Society of America 155, nr 6 (1.06.2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Pełny tekst źródłaAnmella, Gerard, Michele De Prisco, Jeremiah B. Joyce, Claudia Valenzuela-Pascual, Ariadna Mas-Musons, Vincenzo Oliva, Giovanna Fico i in. "Automated Speech Analysis in Bipolar Disorder: The CALIBER Study Protocol and Preliminary Results". Journal of Clinical Medicine 13, nr 17 (23.08.2024): 4997. http://dx.doi.org/10.3390/jcm13174997.
Pełny tekst źródłaZeulner, Tobias, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez i Peter A. Gloor. "Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features". Information 15, nr 4 (12.04.2024): 217. http://dx.doi.org/10.3390/info15040217.
Pełny tekst źródłaKaur, Sukhvinder, Chander Prabha, Ravinder Pal Singh, Deepali Gupta, Sapna Juneja, Punit Gupta i Ali Nauman. "Optimized technique for speaker changes detection in multispeaker audio recording using pyknogram and efficient distance metric". PLOS ONE 19, nr 11 (20.11.2024): e0314073. http://dx.doi.org/10.1371/journal.pone.0314073.
Pełny tekst źródłaDelgado, Héctor, Anna Matamala i Javier Serrano. "Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?" Cadernos de Tradução 35, nr 2 (17.06.2015): 308. http://dx.doi.org/10.5007/2175-7968.2015v35n2p308.
Pełny tekst źródłaDiez, Mireia, Lukas Burget, Federico Landini i Jan Cernocky. "Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors". IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 355–68. http://dx.doi.org/10.1109/taslp.2019.2955293.
Pełny tekst źródłaDawalatabad, Nauman, Srikanth Madikeri, C. Chandra Sekhar i Hema A. Murthy. "Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings". IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 14–27. http://dx.doi.org/10.1109/taslp.2020.3036231.
Pełny tekst źródłaO’Malley, Ronan, Bahman Mirhedari, Kirsty Harkness, Markus Reuber, Annalena Venneri, Heidi Christensen i Daniel Blackburn. "055 The digital doctor: a fully automated stratification and monitoring system for patients with memory complaints". Journal of Neurology, Neurosurgery & Psychiatry 90, nr 12 (14.11.2019): A23.2—A23. http://dx.doi.org/10.1136/jnnp-2019-abn-2.76.
Pełny tekst źródłaDing, Huitong, Adrian Lister, Cody Karjadi, Rhoda Au, Honghuang Lin, Brian Bischoff i Phillip Hwang. "EARLY DETECTION OF ALZHEIMER’S DISEASE AND RELATED DEMENTIAS FROM VOICE RECORDINGS: THE FRAMINGHAM HEART STUDY". Innovation in Aging 7, Supplement_1 (1.12.2023): 1024. http://dx.doi.org/10.1093/geroni/igad104.3291.
Pełny tekst źródłaPraharaj, Sambit, Maren Scheffel, Marcel Schmitz, Marcus Specht i Hendrik Drachsler. "Towards Automatic Collaboration Analytics for Group Speech Data Using Learning Analytics". Sensors 21, nr 9 (2.05.2021): 3156. http://dx.doi.org/10.3390/s21093156.
Pełny tekst źródłaHershkovich, Leeor, Sabyasachi Bandyopadhyay, Jack Wittmayer, Patrick Tighe, David J. Libon, Catherine C. Price i Parisa Rashidi. "96 Proof of Principle: Can Paragraph Recall Pauses and Speech Frequencies Correctly Classify Cognitively Compromised Older Adults?" Journal of the International Neuropsychological Society 29, s1 (listopad 2023): 767–68. http://dx.doi.org/10.1017/s1355617723009530.
Pełny tekst źródłaMcDonald, Margarethe, Taeahn Kwon, Hyunji Kim, Youngki Lee i Eon-Suk Ko. "Evaluating the Language ENvironment Analysis System for Korean". Journal of Speech, Language, and Hearing Research 64, nr 3 (17.03.2021): 792–808. http://dx.doi.org/10.1044/2020_jslhr-20-00489.
Pełny tekst źródłaKumar, Krishna. "Speaker Diarization: A Review". INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 07, nr 06 (24.06.2023). http://dx.doi.org/10.55041/ijsrem24075.
Pełny tekst źródłaXu, Sean Shensheng, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao i Chunqi Chang. "Speaker-turn aware diarization for speech-based cognitive assessments". Frontiers in Neuroscience 17 (16.01.2024). http://dx.doi.org/10.3389/fnins.2023.1351848.
Pełny tekst źródłaRoberto Sánchez Cárdenas i Marvin Coto-Jiménez. "Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts". Revista Tecnología en Marcha, 16.11.2022. http://dx.doi.org/10.18845/tm.v35i8.6464.
Pełny tekst źródła