Zeitschriftenartikel zum Thema „Speech diarization“
Geben Sie eine Quelle nach APA, MLA, Chicago, Harvard und anderen Zitierweisen an
Machen Sie sich mit Top-50 Zeitschriftenartikel für die Forschung zum Thema "Speech diarization" bekannt.
Neben jedem Werk im Literaturverzeichnis ist die Option "Zur Bibliographie hinzufügen" verfügbar. Nutzen Sie sie, wird Ihre bibliographische Angabe des gewählten Werkes nach der nötigen Zitierweise (APA, MLA, Harvard, Chicago, Vancouver usw.) automatisch gestaltet.
Sie können auch den vollen Text der wissenschaftlichen Publikation im PDF-Format herunterladen und eine Online-Annotation der Arbeit lesen, wenn die relevanten Parameter in den Metadaten verfügbar sind.
Sehen Sie die Zeitschriftenartikel für verschiedene Spezialgebieten durch und erstellen Sie Ihre Bibliographie auf korrekte Weise.
Mertens, Robert, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, Ajay Divakaran und Mark Hasegawa-Johnson. „On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks“. International Journal of Multimedia Data Engineering and Management 3, Nr. 3 (Juli 2012): 1–19. http://dx.doi.org/10.4018/jmdem.2012070101.
Der volle Inhalt der QuelleAstapov, Sergei, Aleksei Gusev, Marina Volkova, Aleksei Logunov, Valeriia Zaluskaia, Vlada Kapranova, Elena Timofeeva, Elena Evseeva, Vladimir Kabarov und Yuri Matveev. „Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization“. Mathematics 9, Nr. 23 (23.11.2021): 2998. http://dx.doi.org/10.3390/math9232998.
Der volle Inhalt der QuelleLyu, Ke-Ming, Ren-yuan Lyu und Hsien-Tsung Chang. „Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation“. PeerJ Computer Science 10 (29.03.2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Der volle Inhalt der QuellePrabhala, Jagat Chaitanya, Venkatnareshbabu K und Ragoju Ravi. „OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION“. Applied Mathematics and Sciences An International Journal (MathSJ) 10, Nr. 1/2 (26.06.2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Der volle Inhalt der QuelleV, Sethuram, Ande Prasad und R. Rajeswara Rao. „Metaheuristic adapted convolutional neural network for Telugu speaker diarization“. Intelligent Decision Technologies 15, Nr. 4 (10.01.2022): 561–77. http://dx.doi.org/10.3233/idt-211005.
Der volle Inhalt der QuelleMurali, Abhejay, Satwik Dutta, Meena Chandra Shekar, Dwight Irvin, Jay Buzhardt und John H. Hansen. „Towards developing speaker diarization for parent-child interactions“. Journal of the Acoustical Society of America 152, Nr. 4 (Oktober 2022): A61. http://dx.doi.org/10.1121/10.0015551.
Der volle Inhalt der QuelleTaha, Thaer Mufeed, Zaineb Ben Messaoud und Mondher Frikha. „Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization“. International Journal of Interactive Mobile Technologies (iJIM) 18, Nr. 03 (09.02.2024): 88–103. http://dx.doi.org/10.3991/ijim.v18i03.43013.
Der volle Inhalt der QuelleKothalkar, Prasanna V., John H. L. Hansen, Dwight Irvin und Jay Buzhardt. „Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation“. Journal of the Acoustical Society of America 155, Nr. 2 (01.02.2024): 1198–215. http://dx.doi.org/10.1121/10.0024353.
Der volle Inhalt der QuelleKshirod, Kshirod Sarmah. „Speaker Diarization with Deep Learning Techniques“. Turkish Journal of Computer and Mathematics Education (TURCOMAT) 11, Nr. 3 (15.12.2020): 2570–82. http://dx.doi.org/10.61841/turcomat.v11i3.14309.
Der volle Inhalt der QuelleLleida, Eduardo, Alfonso Ortega, Antonio Miguel, Virginia Bazán-Gil, Carmen Pérez, Manuel Gómez und Alberto de Prada. „Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media“. Applied Sciences 9, Nr. 24 (11.12.2019): 5412. http://dx.doi.org/10.3390/app9245412.
Der volle Inhalt der QuelleAhmad, Rehan, Syed Zubair und Hani Alquhayz. „Speech Enhancement for Multimodal Speaker Diarization System“. IEEE Access 8 (2020): 126671–80. http://dx.doi.org/10.1109/access.2020.3007312.
Der volle Inhalt der QuelleKothalkar, Prasanna V., Dwight Irvin, Jay Buzhardt und John H. Hansen. „End-to-end child-adult speech diarization in naturalistic conditions of preschool classrooms“. Journal of the Acoustical Society of America 153, Nr. 3_supplement (01.03.2023): A174. http://dx.doi.org/10.1121/10.0018568.
Der volle Inhalt der QuelleKaur, Sukhvinder, und J. S. Sohal. „Speech Activity Detection and its Evaluation in Speaker Diarization System“. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 16, Nr. 1 (13.03.2017): 7567–72. http://dx.doi.org/10.24297/ijct.v16i1.5893.
Der volle Inhalt der QuelleHansen, John H., Aditya Joglekar und Meena Chandra Shekar. „Fearless steps Apollo: Advancements in robust speech technologies and naturalistic corpus development from Earth to the Moon“. Journal of the Acoustical Society of America 152, Nr. 4 (Oktober 2022): A61. http://dx.doi.org/10.1121/10.0015549.
Der volle Inhalt der QuelleSultan, Wael Ali, Mourad Samir Semary und Sherif Mahdy Abdou. „An Efficient Speaker Diarization Pipeline for Conversational Speech“. Benha Journal of Applied Sciences 9, Nr. 5 (29.05.2024): 141–46. http://dx.doi.org/10.21608/bjas.2024.284482.1414.
Der volle Inhalt der QuelleKone, Tenon Charly, Sebastian Ghinet, Sayed Ahmed Dana und Anant Grewal. „Speech detection models for effective communicable disease risk assessment in air travel environments“. Journal of the Acoustical Society of America 155, Nr. 3_Supplement (01.03.2024): A277. http://dx.doi.org/10.1121/10.0027492.
Der volle Inhalt der QuelleZelenak, Martin, Carlos Segura, Jordi Luque und Javier Hernando. „Simultaneous Speech Detection With Spatial Features for Speaker Diarization“. IEEE Transactions on Audio, Speech, and Language Processing 20, Nr. 2 (Februar 2012): 436–46. http://dx.doi.org/10.1109/tasl.2011.2160167.
Der volle Inhalt der QuelleViñals, Ignacio, Alfonso Ortega, Antonio Miguel und Eduardo Lleida. „The Domain Mismatch Problem in the Broadcast Speaker Attribution Task“. Applied Sciences 11, Nr. 18 (14.09.2021): 8521. http://dx.doi.org/10.3390/app11188521.
Der volle Inhalt der QuelleIndu D. „A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients“. Journal of Electrical Systems 20, Nr. 6s (02.05.2024): 2938–45. http://dx.doi.org/10.52783/jes.3299.
Der volle Inhalt der QuelleSathyapriya, S., und A. Indhumathi. „An Efficient Speaker Diarization using Privacy Preserving Audio Features Based of Speech/Non Speech Detection“. International Journal of Computer Trends and Technology 9, Nr. 4 (25.03.2014): 184–87. http://dx.doi.org/10.14445/22312803/ijctt-v9p136.
Der volle Inhalt der QuelleHuang, Zili, Marc Delcroix, Leibny Paola Garcia, Shinji Watanabe, Desh Raj und Sanjeev Khudanpur. „Joint speaker diarization and speech recognition based on region proposal networks“. Computer Speech & Language 72 (März 2022): 101316. http://dx.doi.org/10.1016/j.csl.2021.101316.
Der volle Inhalt der QuelleKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi und Alexander Konovalov. „Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library“. Sensors 23, Nr. 4 (13.02.2023): 2082. http://dx.doi.org/10.3390/s23042082.
Der volle Inhalt der QuelleJung, Dahae, Min-Kyoung Bae, Man Yong Choi, Eui Chul Lee und Jinoo Joung. „Speaker diarization method of telemarketer and client for improving speech dictation performance“. Journal of Supercomputing 72, Nr. 5 (03.07.2015): 1757–69. http://dx.doi.org/10.1007/s11227-015-1470-4.
Der volle Inhalt der QuelleZhu, Qiushi, Jie Zhang, Yu Gu, Yuchen Hu und Lirong Dai. „Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation“. Proceedings of the AAAI Conference on Artificial Intelligence 38, Nr. 17 (24.03.2024): 19768–76. http://dx.doi.org/10.1609/aaai.v38i17.29951.
Der volle Inhalt der QuellePapala, Gowtham, Aniket Ransing und Pooja Jain. „Sentiment Analysis and Speaker Diarization in Hindi and Marathi Using using Finetuned Whisper“. Scalable Computing: Practice and Experience 24, Nr. 4 (17.11.2023): 835–46. http://dx.doi.org/10.12694/scpe.v24i4.2248.
Der volle Inhalt der QuelleSenoussaoui, Mohammed, Patrick Kenny, Themos Stafylakis und Pierre Dumouchel. „A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization“. IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, Nr. 1 (Januar 2014): 217–27. http://dx.doi.org/10.1109/taslp.2013.2285474.
Der volle Inhalt der QuelleVryzas, Nikolaos, Nikolaos Tsipas und Charalampos Dimoulas. „Web Radio Automation for Audio Stream Management in the Era of Big Data“. Information 11, Nr. 4 (11.04.2020): 205. http://dx.doi.org/10.3390/info11040205.
Der volle Inhalt der QuelleLleida, Eduardo, Luis Javier Rodriguez-Fuentes, Javier Tejedor, Alfonso Ortega, Antonio Miguel, Virginia Bazán, Carmen Pérez et al. „An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies“. Applied Sciences 13, Nr. 15 (25.07.2023): 8577. http://dx.doi.org/10.3390/app13158577.
Der volle Inhalt der QuelleHansen, John H. L., Maryam Najafian, Rasa Lileikyte, Dwight Irvin und Beth Rous. „Speech and language processing for assessing child–adult interaction based on diarization and location“. International Journal of Speech Technology 22, Nr. 3 (05.06.2019): 697–709. http://dx.doi.org/10.1007/s10772-019-09590-0.
Der volle Inhalt der QuelleCerva, Petr, Jan Silovsky, Jindrich Zdansky, Jan Nouza und Ladislav Seps. „Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives“. Speech Communication 55, Nr. 10 (November 2013): 1033–46. http://dx.doi.org/10.1016/j.specom.2013.06.017.
Der volle Inhalt der QuelleJoglekar, Aditya, Ivan Lopez-Espejo und John H. Hansen. „Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams“. Journal of the Acoustical Society of America 153, Nr. 3_supplement (01.03.2023): A173. http://dx.doi.org/10.1121/10.0018566.
Der volle Inhalt der QuelleXiao, Bo, Chewei Huang, Zac E. Imel, David C. Atkins, Panayiotis Georgiou und Shrikanth S. Narayanan. „A technology prototype system for rating therapist empathy from audio recordings in addiction counseling“. PeerJ Computer Science 2 (20.04.2016): e59. http://dx.doi.org/10.7717/peerj-cs.59.
Der volle Inhalt der QuelleKalanadhabhatta, Manasa, Mohammad Mehdi Rastikerdar, Tauhidur Rahman, Adam S. Grabell und Deepak Ganesan. „Playlogue: Dataset and Benchmarks for Analyzing Adult-Child Conversations During Play“. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8, Nr. 4 (21.11.2024): 1–34. http://dx.doi.org/10.1145/3699775.
Der volle Inhalt der QuelleDi Cesare, Michele Giuseppe, David Perpetuini, Daniela Cardone und Arcangelo Merla. „Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques“. Sensors 24, Nr. 5 (26.02.2024): 1499. http://dx.doi.org/10.3390/s24051499.
Der volle Inhalt der QuelleYella, Sree Harsha, und Herve Bourlard. „Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations“. IEEE/ACM Transactions on Audio, Speech, and Language Processing 22, Nr. 12 (Dezember 2014): 1688–700. http://dx.doi.org/10.1109/taslp.2014.2346315.
Der volle Inhalt der QuelleGhorbani, Shahram, und John H. L. Hansen. „Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition“. Journal of the Acoustical Society of America 155, Nr. 6 (01.06.2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Der volle Inhalt der QuelleAnmella, Gerard, Michele De Prisco, Jeremiah B. Joyce, Claudia Valenzuela-Pascual, Ariadna Mas-Musons, Vincenzo Oliva, Giovanna Fico et al. „Automated Speech Analysis in Bipolar Disorder: The CALIBER Study Protocol and Preliminary Results“. Journal of Clinical Medicine 13, Nr. 17 (23.08.2024): 4997. http://dx.doi.org/10.3390/jcm13174997.
Der volle Inhalt der QuelleZeulner, Tobias, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez und Peter A. Gloor. „Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features“. Information 15, Nr. 4 (12.04.2024): 217. http://dx.doi.org/10.3390/info15040217.
Der volle Inhalt der QuelleKaur, Sukhvinder, Chander Prabha, Ravinder Pal Singh, Deepali Gupta, Sapna Juneja, Punit Gupta und Ali Nauman. „Optimized technique for speaker changes detection in multispeaker audio recording using pyknogram and efficient distance metric“. PLOS ONE 19, Nr. 11 (20.11.2024): e0314073. http://dx.doi.org/10.1371/journal.pone.0314073.
Der volle Inhalt der QuelleDelgado, Héctor, Anna Matamala und Javier Serrano. „Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?“ Cadernos de Tradução 35, Nr. 2 (17.06.2015): 308. http://dx.doi.org/10.5007/2175-7968.2015v35n2p308.
Der volle Inhalt der QuelleDiez, Mireia, Lukas Burget, Federico Landini und Jan Cernocky. „Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors“. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 355–68. http://dx.doi.org/10.1109/taslp.2019.2955293.
Der volle Inhalt der QuelleDawalatabad, Nauman, Srikanth Madikeri, C. Chandra Sekhar und Hema A. Murthy. „Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings“. IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 14–27. http://dx.doi.org/10.1109/taslp.2020.3036231.
Der volle Inhalt der QuelleO’Malley, Ronan, Bahman Mirhedari, Kirsty Harkness, Markus Reuber, Annalena Venneri, Heidi Christensen und Daniel Blackburn. „055 The digital doctor: a fully automated stratification and monitoring system for patients with memory complaints“. Journal of Neurology, Neurosurgery & Psychiatry 90, Nr. 12 (14.11.2019): A23.2—A23. http://dx.doi.org/10.1136/jnnp-2019-abn-2.76.
Der volle Inhalt der QuelleDing, Huitong, Adrian Lister, Cody Karjadi, Rhoda Au, Honghuang Lin, Brian Bischoff und Phillip Hwang. „EARLY DETECTION OF ALZHEIMER’S DISEASE AND RELATED DEMENTIAS FROM VOICE RECORDINGS: THE FRAMINGHAM HEART STUDY“. Innovation in Aging 7, Supplement_1 (01.12.2023): 1024. http://dx.doi.org/10.1093/geroni/igad104.3291.
Der volle Inhalt der QuellePraharaj, Sambit, Maren Scheffel, Marcel Schmitz, Marcus Specht und Hendrik Drachsler. „Towards Automatic Collaboration Analytics for Group Speech Data Using Learning Analytics“. Sensors 21, Nr. 9 (02.05.2021): 3156. http://dx.doi.org/10.3390/s21093156.
Der volle Inhalt der QuelleHershkovich, Leeor, Sabyasachi Bandyopadhyay, Jack Wittmayer, Patrick Tighe, David J. Libon, Catherine C. Price und Parisa Rashidi. „96 Proof of Principle: Can Paragraph Recall Pauses and Speech Frequencies Correctly Classify Cognitively Compromised Older Adults?“ Journal of the International Neuropsychological Society 29, s1 (November 2023): 767–68. http://dx.doi.org/10.1017/s1355617723009530.
Der volle Inhalt der QuelleMcDonald, Margarethe, Taeahn Kwon, Hyunji Kim, Youngki Lee und Eon-Suk Ko. „Evaluating the Language ENvironment Analysis System for Korean“. Journal of Speech, Language, and Hearing Research 64, Nr. 3 (17.03.2021): 792–808. http://dx.doi.org/10.1044/2020_jslhr-20-00489.
Der volle Inhalt der QuelleKumar, Krishna. „Speaker Diarization: A Review“. INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 07, Nr. 06 (24.06.2023). http://dx.doi.org/10.55041/ijsrem24075.
Der volle Inhalt der QuelleXu, Sean Shensheng, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao und Chunqi Chang. „Speaker-turn aware diarization for speech-based cognitive assessments“. Frontiers in Neuroscience 17 (16.01.2024). http://dx.doi.org/10.3389/fnins.2023.1351848.
Der volle Inhalt der QuelleRoberto Sánchez Cárdenas und Marvin Coto-Jiménez. „Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts“. Revista Tecnología en Marcha, 16.11.2022. http://dx.doi.org/10.18845/tm.v35i8.6464.
Der volle Inhalt der Quelle