Academic literature on the topic 'Speaker recognition'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Speaker recognition.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Speaker recognition"
Sun, Linhui, Yunyi Bu, Bo Zou, Sheng Fu, and Pingan Li. "Speaker Recognition Based on Fusion of a Deep and Shallow Recombination Gaussian Supervector." Electronics 10, no. 1 (December 25, 2020): 20. http://dx.doi.org/10.3390/electronics10010020.
Full textSingh, Satyanand. "Forensic and Automatic Speaker Recognition System." International Journal of Electrical and Computer Engineering (IJECE) 8, no. 5 (October 1, 2018): 2804. http://dx.doi.org/10.11591/ijece.v8i5.pp2804-2811.
Full textMarkowitz, Judith A. "Speaker recognition." Information Security Technical Report 3, no. 1 (January 1998): 14–20. http://dx.doi.org/10.1016/s1363-4127(98)80014-9.
Full textMarkowitz, Judith A. "Speaker recognition." Information Security Technical Report 4 (January 1999): 28. http://dx.doi.org/10.1016/s1363-4127(99)80053-3.
Full textFurui, Sadaoki. "Speaker recognition." Scholarpedia 3, no. 4 (2008): 3715. http://dx.doi.org/10.4249/scholarpedia.3715.
Full textO'Shaughnessy, D. "Speaker recognition." IEEE ASSP Magazine 3, no. 4 (October 1986): 4–17. http://dx.doi.org/10.1109/massp.1986.1165388.
Full textSingh, Mahesh K., P. Mohana Satya, Vella Satyanarayana, and Sridevi Gamini. "Speaker Recognition Assessment in a Continuous System for Speaker Identification." International Journal of Electrical and Electronics Research 10, no. 4 (December 30, 2022): 862–67. http://dx.doi.org/10.37391/ijeer.100418.
Full textGonzalez-Rodriguez, Joaquin. "Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)." Loquens 1, no. 1 (June 30, 2014): e007. http://dx.doi.org/10.3989/loquens.2014.007.
Full textMannepalli, Kasiprasad, Suman Maloji, Panyam Narahari Sastry, Swetha Danthala, and Durgaprasad Mannepalli. "Text independent emotion recognition for Telugu speech by using prosodic features." International Journal of Engineering & Technology 7, no. 2.7 (March 18, 2018): 594. http://dx.doi.org/10.14419/ijet.v7i2.7.10887.
Full textLakshmi Prasanna, P. "Attention for the speech of cleft lip and palate in speaker recognition." Open Journal of Pain Medicine 7, no. 1 (December 1, 2023): 7–1. http://dx.doi.org/10.17352/ojpm.000036.
Full textDissertations / Theses on the topic "Speaker recognition"
Chatzaras, Anargyros, and Georgios Savvidis. "Seamless speaker recognition." Thesis, KTH, Radio Systems Laboratory (RS Lab), 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-159021.
Full textI ett teknologiskt avancerat samhälle så hanterar den genomsnittliga personen dussintals konton för e-post, sociala nätverk, internetbanker, och andra elektroniska tjänster. Allt eftersom antalet konton ökar, blir behovet av automatisk identifiering av användaren mer väsentlig. Biometri har länge använts för att identifiera personer och är den vanligaste (om inte den enda) metoden för att utföra denna uppgift. Smartphones har under de senaste åren blivit allt mer vanligt förekommande, de ger användaren tillgång till de flesta av sina konton och, i viss mån, även personifiering av enheterna baserat på deras profiler på sociala nätverk. Dessa enheter har inbyggda mikrofoner och används ofta av en enskild användare eller en liten grupp av användare, till exempel ett par eller en familj. Denna avhandling använder mikrofonen i en smartphone för att spela in användarens tal och identifiera honom/henne. Befintliga lösningar för talarigenkänning ber vanligtvis användaren om att ge långa röstprover för att kunna ge korrekta resultat. Detta resulterar i en dålig användarupplevelse och avskräcker användare som inte har tålamod att gå igenom en sådan process. Huvudtanken bakom den strategi för talarigenkänningen som presenteras i denna avhandling är att ge en sömlös användarupplevelse där inspelningen av användarens röst sker i bakgrunden. En Android-applikation har utvecklats som, utan att märkas, samlar in röstprover och utför talarigenkänning på dessa utan att kräva omfattande interaktion av användaren. Två varianter av verktyget har utvecklats och dessa beskrivs ingående i denna avhandling. Öpen source-ramverket Recognito används för att utföra talarigenkänningen. Analysen av Recognito visade att det inte klarar av att uppnå tillräckligt hög noggrannhet, speciellt när röstproverna innehåller bakgrundsbrus. Dessutom visade jämförelsen mellan de två arkitekturerna att de inte skiljer sig nämnvärt i fråga om prestanda.
VASILAKAKIS, VASILEIOS. "Forensic speaker recognition: speaker and height estimation techniques." Doctoral thesis, Politecnico di Torino, 2014. http://hdl.handle.net/11583/2551370.
Full textKamarauskas, Juozas. "Speaker recognition by voice." Doctoral thesis, Lithuanian Academic Libraries Network (LABT), 2009. http://vddb.library.lt/obj/LT-eLABa-0001:E.02~2009~D_20090615_093847-20773.
Full textDisertacijoje nagrinėjami kalbančiojo atpažinimo pagal balsą klausimai. Aptartos kalbančiojo atpažinimo sistemos, jų raida, atpažinimo problemos, požymių sistemos įvairovė bei kalbančiojo modeliavimo ir požymių palyginimo metodai, naudojami nuo ištarto teksto nepriklausomame bei priklausomame kalbančiojo atpažinime. Darbo metu sukurta nuo ištarto teksto nepriklausanti kalbančiojo atpažinimo sistema. Kalbėtojų modelių kūrimui ir požymių palyginimui buvo panaudoti Gauso mišinių modeliai. Pasiūlytas automatinis vokalizuotų garsų išrinkimo (segmentavimo) metodas. Šis metodas yra greitai veikiantis ir nereikalaujantis iš vartotojo jokių papildomų veiksmų, tokių kaip kalbos signalo ir triukšmo pavyzdžių nurodymas. Pasiūlyta požymių vektorių sistema, susidedanti iš žadinimo signalo bei balso trakto parametrų. Kaip žadinimo signalo parametras, panaudotas žadinimo signalo pagrindinis dažnis, kaip balso trakto parametrai, panaudotos keturios formantės bei trys antiformantės. Siekiant suvienodinti žemesnių bei aukštesnių formančių ir antiformančių dispersijas, jas pasiūlėme skaičiuoti melų skalėje. Rezultatų palyginimui sistemoje buvo realizuoti standartiniai požymiai, naudojami kalbos bei asmens atpažinime – melų skalės kepstro koeficientai (MSKK). Atlikti kalbančiojo atpažinimo eksperimentai parodė, kad panaudojus pasiūlytą požymių sistemą buvo gauti geresni atpažinimo rezultatai, nei panaudojus standartinius požymius (MSKK). Gautas lygių klaidų lygis, panaudojant pasiūlytą požymių... [toliau žr. visą tekstą]
Du, Toit Ilze. "Non-acoustic speaker recognition." Thesis, Stellenbosch : University of Stellenbosch, 2004. http://hdl.handle.net/10019.1/16315.
Full textENGLISH ABSTRACT: In this study the phoneme labels derived from a phoneme recogniser are used for phonetic speaker recognition. The time-dependencies among phonemes are modelled by using hidden Markov models (HMMs) for the speaker models. Experiments are done using firstorder and second-order HMMs and various smoothing techniques are examined to address the problem of data scarcity. The use of word labels for lexical speaker recognition is also investigated. Single word frequencies are counted and the use of various word selections as feature sets are investigated. During April 2004, the University of Stellenbosch, in collaboration with Spescom DataVoice, participated in an international speaker verification competition presented by the National Institute of Standards and Technology (NIST). The University of Stellenbosch submitted phonetic and lexical (non-acoustic) speaker recognition systems and a fused system (the primary system) that fuses the acoustic system of Spescom DataVoice with the non-acoustic systems of the University of Stellenbosch. The results were evaluated by means of a cost model. Based on the cost model, the primary system obtained second and third position in the two categories that were submitted.
AFRIKAANSE OPSOMMING: Hierdie projek maak gebruik van foneem-etikette wat geklassifiseer word deur ’n foneemherkenner en daarna gebruik word vir fonetiese sprekerherkenning. Die tyd-afhanklikhede tussen foneme word gemodelleer deur gebruik te maak van verskuilde Markov modelle (HMMs) as sprekermodelle. Daar word ge¨eksperimenteer met eerste-orde en tweede-orde HMMs en verskeie vergladdingstegnieke word ondersoek om dataskaarsheid aan te spreek. Die gebruik van woord-etikette vir sprekerherkenning word ook ondersoek. Enkelwoordfrekwensies word getel en daar word ge¨eksperimenteer met verskeie woordseleksies as kenmerke vir sprekerherkenning. Gedurende April 2004 het die Universiteit van Stellenbosch in samewerking met Spescom DataVoice deelgeneem aan ’n internasionale sprekerverifikasie kompetisie wat deur die National Institute of Standards and Technology (NIST) aangebied is. Die Universiteit van Stellenbosch het ingeskryf vir ’n fonetiese en ’n woordgebaseerde (nie-akoestiese) sprekerherkenningstelsel, asook ’n saamgesmelte stelsel wat as primˆere stelsel dien. Die saamgesmelte stelsel is ’n kombinasie van Spescom DataVoice se akoestiese stelsel en die twee nie-akoestiese stelsels van die Universiteit van Stellenbosch. Die resultate is ge¨evalueer deur gebruik te maak van ’n koste-model. Op grond van die koste-model het die primˆere stelsel tweede en derde plek behaal in die twee kategorie¨e waaraan deelgeneem is.
Hong, Z. (Zimeng). "Speaker gender recognition system." Master's thesis, University of Oulu, 2017. http://jultika.oulu.fi/Record/nbnfioulu-201706082645.
Full textAl-Kilani, Menia. "Voice-signature-based Speaker Recognition." University of the Western Cape, 2017. http://hdl.handle.net/11394/5888.
Full textPersonal identification and the protection of data are important issues because of the ubiquitousness of computing and these have thus become interesting areas of research in the field of computer science. Previously people have used a variety of ways to identify an individual and protect themselves, their property and their information. This they did mostly by means of locks, passwords, smartcards and biometrics. Verifying individuals by using their physical or behavioural features is more secure than using other data such as passwords or smartcards, because everyone has unique features which distinguish him or her from others. Furthermore the biometrics of a person are difficult to imitate or steal. Biometric technologies represent a significant component of a comprehensive digital identity solution and play an important role in security. The technologies that support identification and authentication of individuals is based on either their physiological or their behavioural characteristics. Live-‐data, in this instance the human voice, is the topic of this research. The aim is to recognize a person’s voice and to identify the user by verifying that his/her voice is the same as a record of his / her voice-‐signature in a systems database. To address the main research question: “What is the best way to identify a person by his / her voice signature?”, design science research, was employed. This methodology is used to develop an artefact for solving a problem. Initially a pilot study was conducted using visual representation of voice signatures, to check if it is possible to identify speakers without using feature extraction or matching methods. Subsequently, experiments were conducted with 6300 data sets derived from Texas Instruments and the Massachusetts Institute of Technology audio database. Two methods of feature extraction and classification were considered—mel frequency cepstrum coefficient and linear prediction cepstral coefficient feature extraction—and for classification, the Support Vector Machines method was used. The three methods were compared in terms of their effectiveness and it was found that the system using the mel frequency cepstrum coefficient, for feature extraction, gave the marginally better results for speaker recognition.
Oglesby, J. "Neural models for speaker recognition." Thesis, Swansea University, 1991. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.638359.
Full textThompson, J. "Speech variability in speaker recognition." Thesis, Swansea University, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.639230.
Full textMukherjee, Rishiraj. "Speaker Recognition Using Shifted MFCC." Scholar Commons, 2012. http://scholarcommons.usf.edu/etd/4136.
Full textMwangi, Elijah. "Speaker independent isolated word recognition." Thesis, Loughborough University, 1987. https://dspace.lboro.ac.uk/2134/15425.
Full textBooks on the topic "Speaker recognition"
Neustein, Amy, and Hemant A. Patil, eds. Forensic Speaker Recognition. New York, NY: Springer New York, 2012. http://dx.doi.org/10.1007/978-1-4614-0263-3.
Full textAuditory speaker recognition. Hamburg: Buske, 1987.
Find full textBeigi, Homayoon. Fundamentals of Speaker Recognition. Boston, MA: Springer US, 2011. http://dx.doi.org/10.1007/978-0-387-77592-0.
Full textFundamentals of speaker recognition. New York: Springer, 2011.
Find full textLee, Chin-Hui, Frank K. Soong, and Kuldip K. Paliwal, eds. Automatic Speech and Speaker Recognition. Boston, MA: Springer US, 1996. http://dx.doi.org/10.1007/978-1-4613-1367-0.
Full textKeshet, Joseph, and Samy Bengio, eds. Automatic Speech and Speaker Recognition. Chichester, UK: John Wiley & Sons, Ltd, 2009. http://dx.doi.org/10.1002/9780470742044.
Full textZheng, Thomas Fang, and Lantian Li. Robustness-Related Issues in Speaker Recognition. Singapore: Springer Singapore, 2017. http://dx.doi.org/10.1007/978-981-10-3238-7.
Full textRao, K. Sreenivasa, and Sourjya Sarkar. Robust Speaker Recognition in Noisy Environments. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-07130-5.
Full textSpeaker separation and tracking. Konstanz: Hartung-Gorre Verlag, 2006.
Find full textChin-Hui, Lee, Soong Frank K, and Paliwal K. K, eds. Automatic speech and speaker recognition: Advanced topics. Boston: Kluwer Academic Publishers, 1996.
Find full textBook chapters on the topic "Speaker recognition"
Zhang, David D. "Speaker Recognition." In Automated Biometrics, 179–201. Boston, MA: Springer US, 2000. http://dx.doi.org/10.1007/978-1-4615-4519-4_9.
Full textFarouk, Mohamed Hesham. "Speaker Recognition." In SpringerBriefs in Electrical and Computer Engineering, 33–35. Cham: Springer International Publishing, 2013. http://dx.doi.org/10.1007/978-3-319-02732-6_8.
Full textBeigi, Homayoon. "Speaker Recognition." In Fundamentals of Speaker Recognition, 543–59. Boston, MA: Springer US, 2011. http://dx.doi.org/10.1007/978-0-387-77592-0_17.
Full textFurui, Sadaoki. "Speaker Recognition." In Computational Models of Speech Pattern Processing, 132–42. Berlin, Heidelberg: Springer Berlin Heidelberg, 1999. http://dx.doi.org/10.1007/978-3-642-60087-6_14.
Full textBeigi, Homayoon. "Speaker Recognition." In Encyclopedia of Cryptography and Security, 1232–42. Boston, MA: Springer US, 2011. http://dx.doi.org/10.1007/978-1-4419-5906-5_747.
Full textCampbell, Joseph P. "Speaker Recognition." In Biometrics, 165–89. Boston, MA: Springer US, 1996. http://dx.doi.org/10.1007/0-306-47044-6_8.
Full textSomogyi, Zoltán. "Speaker Recognition." In The Application of Artificial Intelligence, 185–96. Cham: Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-60032-7_7.
Full textBeigi, Homayoon. "Speaker Recognition." In Encyclopedia of Cryptography, Security and Privacy, 1–17. Berlin, Heidelberg: Springer Berlin Heidelberg, 2021. http://dx.doi.org/10.1007/978-3-642-27739-9_747-2.
Full textDrygajlo, Andrzej. "From Speaker Recognition to Forensic Speaker Recognition." In Biometric Authentication, 93–104. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-13386-7_8.
Full textBeigi, Homayoon. "Speaker Modeling." In Fundamentals of Speaker Recognition, 525–41. Boston, MA: Springer US, 2011. http://dx.doi.org/10.1007/978-0-387-77592-0_16.
Full textConference papers on the topic "Speaker recognition"
Shantoash, C., M. Vishal, S. Shruthi, and Gopalsamy N. Bharathi. "Speech Accent Recognition." In International Research Conference on IOT, Cloud and Data Science. Switzerland: Trans Tech Publications Ltd, 2023. http://dx.doi.org/10.4028/p-irai1l.
Full textBao, Yinan, Qianwen Ma, Lingwei Wei, Wei Zhou, and Songlin Hu. "Speaker-Guided Encoder-Decoder Framework for Emotion Recognition in Conversation." In Thirty-First International Joint Conference on Artificial Intelligence {IJCAI-22}. California: International Joint Conferences on Artificial Intelligence Organization, 2022. http://dx.doi.org/10.24963/ijcai.2022/562.
Full textTripathi, Supriya, and Smriti Bhatnagar. "Speaker Recognition." In 2012 3rd International Conference on Computer and Communication Technology (ICCCT 2012). IEEE, 2012. http://dx.doi.org/10.1109/iccct.2012.64.
Full textKohler, M. A., W. D. Andrews, J. P. Campbell, and J. Herndndez-Cordero. "Phonetic speaker recognition." In Conference Record. Thirty-Fifth Asilomar Conference on Signals, Systems and Computers. IEEE, 2001. http://dx.doi.org/10.1109/acssc.2001.987748.
Full textDesai, Veena, and Hema A. Murthy. "Distributed speaker recognition." In Interspeech 2004. ISCA: ISCA, 2004. http://dx.doi.org/10.21437/interspeech.2004-536.
Full textWenndt, Stanley J., and Ronald L. Mitchell. "Familiar speaker recognition." In ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2012. http://dx.doi.org/10.1109/icassp.2012.6288854.
Full textYu, Kin, John S. Mason, and John Oglesby. "Speaker recognition models." In 4th European Conference on Speech Communication and Technology (Eurospeech 1995). ISCA: ISCA, 1995. http://dx.doi.org/10.21437/eurospeech.1995-105.
Full textAndrews, Walter D., Mary A. Kohler, and Joseph P. Campbell. "Phonetic speaker recognition." In 7th European Conference on Speech Communication and Technology (Eurospeech 2001). ISCA: ISCA, 2001. http://dx.doi.org/10.21437/eurospeech.2001-416.
Full textDoddington, George. "Speaker recognition based on idiolectal differences between speakers." In 7th European Conference on Speech Communication and Technology (Eurospeech 2001). ISCA: ISCA, 2001. http://dx.doi.org/10.21437/eurospeech.2001-417.
Full textSchulze, Lucas, Renan Sebem, and Douglas Wildgrube Bertol. "Performance of PSO and GWO Algorithms Applied in Text-Independent Speaker Identification." In Congresso Brasileiro de Inteligência Computacional. SBIC, 2021. http://dx.doi.org/10.21528/cbic2021-98.
Full textReports on the topic "Speaker recognition"
Slyh, Raymond E., Eric G. Hansen, and Timothy R. Anderson. AFRL/HECP Speaker Recognition Systems for the 2004 NIST Speaker Recognition Evaluation. Fort Belvoir, VA: Defense Technical Information Center, December 2004. http://dx.doi.org/10.21236/ada430750.
Full textWu, Jin Chu, Alvin F. Martin, Craig S. Greenberg, and Raghu N. Kacker. Measurement uncertainties in speaker recognition evaluation. Gaithersburg, MD: National Institute of Standards and Technology, 2010. http://dx.doi.org/10.6028/nist.ir.7722.
Full textQuatieri, T. F., E. Singer, R. B. Dunn, D. A. Reynolds, and J. P. Campbell. Speaker and Language Recognition Using Speech Codec Parameters. Fort Belvoir, VA: Defense Technical Information Center, January 1999. http://dx.doi.org/10.21236/ada526525.
Full textMartinson, E., and W. Lawson. Learning Speaker Recognition Models through Human-Robot Interaction. Fort Belvoir, VA: Defense Technical Information Center, May 2011. http://dx.doi.org/10.21236/ada550036.
Full textHansen, John H. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. Fort Belvoir, VA: Defense Technical Information Center, October 2015. http://dx.doi.org/10.21236/ada623029.
Full textQuatieri, T. F. Nonlinear Auditory Modeling as a Basis for Speaker Recognition. Fort Belvoir, VA: Defense Technical Information Center, May 2002. http://dx.doi.org/10.21236/ada402327.
Full textWu, Jin Chu, Alvin F. Martin, Craig S. Greenberg, and Raghu N. Kacker. Data dependency on measurement uncertainties in speaker recognition evaluation. Gaithersburg, MD: National Institute of Standards and Technology, 2011. http://dx.doi.org/10.6028/nist.ir.7810.
Full textCieri, Christopher, Joseph P. Campbell, Hirotaka Nakasone, Kevin Walker, and David Miller. The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data. Fort Belvoir, VA: Defense Technical Information Center, January 2004. http://dx.doi.org/10.21236/ada523534.
Full textKarakowski, Joseph A., and Hai H. Phu. Text Independent Speaker Recognition Using A Fuzzy Hypercube Classifier. Fort Belvoir, VA: Defense Technical Information Center, October 1998. http://dx.doi.org/10.21236/ada354792.
Full textFerrer, Luciana, Mitchell McLaren, Nicolas Scheffer, Yun Lei, Martin Graciarena, and Vikramjit Mitra. A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation. Fort Belvoir, VA: Defense Technical Information Center, August 2013. http://dx.doi.org/10.21236/ada614010.
Full text