Dissertations / Theses on the topic 'Language identification'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Language identification.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Botha, Gerrti Reinier. "Text-based language identification for the South African languages." Pretoria : [s.n.], 2007. http://upetd.up.ac.za/thesis/available/etd-090942008-133715/.
Full textYin, Bo Electrical Engineering & Telecommunications Faculty of Engineering UNSW. "Language identification with language and feature dependency." Awarded By:University of New South Wales. Electrical Engineering & Telecommunications, 2009. http://handle.unsw.edu.au/1959.4/44045.
Full textNewman, Jacob Laurence. "Language identification using visual features." Thesis, University of East Anglia, 2011. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.539371.
Full textBerkling, Kay Margarethe. "Automatic language identification with sequences of language-independent phoneme clusters /." Full text open access at:, 1996. http://content.ohsu.edu/u?/etd,204.
Full textConti, Matteo. "Machine Learning Based Programming Language Identification." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/20875/.
Full textMunday, Emma Rachel. "Language and identification in contemporary Kazakhstan." Thesis, University of Edinburgh, 2010. http://hdl.handle.net/1842/6200.
Full textNkadimeng, Calvin. "Language identification using Gaussian mixture models." Thesis, Stellenbosch : University of Stellenbosch, 2010. http://hdl.handle.net/10019.1/4170.
Full textENGLISH ABSTRACT: The importance of Language Identification for African languages is seeing a dramatic increase due to the development of telecommunication infrastructure and, as a result, an increase in volumes of data and speech traffic in public networks. By automatically processing the raw speech data the vital assistance given to people in distress can be speeded up, by referring their calls to a person knowledgeable in that language. To this effect a speech corpus was developed and various algorithms were implemented and tested on raw telephone speech data. These algorithms entailed data preparation, signal processing, and statistical analysis aimed at discriminating between languages. The statistical model of Gaussian Mixture Models (GMMs) were chosen for this research due to their ability to represent an entire language with a single stochastic model that does not require phonetic transcription. Language Identification for African languages using GMMs is feasible, although there are some few challenges like proper classification and accurate study into the relationship of langauges that need to be overcome. Other methods that make use of phonetically transcribed data need to be explored and tested with the new corpus for the research to be more rigorous.
AFRIKAANSE OPSOMMING: Die belang van die Taal identifiseer vir Afrika-tale is sien ’n dramatiese toename te danke aan die ontwikkeling van telekommunikasie-infrastruktuur en as gevolg ’n toename in volumes van data en spraak verkeer in die openbaar netwerke.Deur outomaties verwerking van die ruwe toespraak gegee die noodsaaklike hulp verleen aan mense in nood kan word vinniger-up ”, deur te verwys hul oproepe na ’n persoon ingelichte in daardie taal. Tot hierdie effek van ’n toespraak corpus het ontwikkel en die verskillende algoritmes is gemplementeer en getoets op die ruwe telefoon toespraak gegee.Hierdie algoritmes behels die data voorbereiding, seinverwerking, en statistiese analise wat gerig is op onderskei tussen tale.Die statistiese model van Gauss Mengsel Modelle (GGM) was gekies is vir hierdie navorsing as gevolg van hul vermo te verteenwoordig ’n hele taal met’ n enkele stogastiese model wat nodig nie fonetiese tanscription nie. Taal identifiseer vir die Afrikatale gebruik GGM haalbaar is, alhoewel daar enkele paar uitdagings soos behoorlike klassifikasie en akkurate ondersoek na die verhouding van TALE wat moet oorkom moet word.Ander metodes wat gebruik maak van foneties getranskribeerde data nodig om ondersoek te word en getoets word met die nuwe corpus vir die ondersoek te word strenger.
Avenberg, Anna. "Automatic language identification of short texts." Thesis, Uppsala universitet, Avdelningen för beräkningsvetenskap, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-421032.
Full textForan, Jeffrey (Jeffrey Matthew) 1977. "Missing argument referent identification in natural language." Thesis, Massachusetts Institute of Technology, 1999. http://hdl.handle.net/1721.1/80532.
Full textIncludes bibliographical references (p. 54-55).
by Jeffrey Foran.
S.B.and M.Eng.
Gambardella, Maria-Elena. "Cleartext detection and language identification in ciphers." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-446439.
Full textWilliams, A. Lynn, and Carol Stoel-Gammon. "Identification of Speech-language Disorders in Toddlers." Digital Commons @ East Tennessee State University, 2016. https://dc.etsu.edu/etsu-works/2038.
Full textYang, Xi. "Discriminative acoustic and sequence models for GMM based automatic language identification /." View abstract or full-text, 2007. http://library.ust.hk/cgi/db/thesis.pl?ECED%202007%20YANG.
Full textVindfallet, Vegar Enersen. "Language Identification Based on Detection of Phonetic Characteristics." Thesis, Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon, 2012. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-19506.
Full textdel, Castillo Iglesias Daniel. "End-to-end Learning for Singing-Language Identification." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-277837.
Full textSång-språkidentifiering (SLID) består av att identifiera språket av de sjung- ade texterna direkt från en viss musikinspelning. Denna uppgift är av sär- skilt intresse för musikströmmande företag som drar nytta av applikationer för musiklokalisering. Däremot, är språk en komplex semantisk kvalitet av musikinspelningar, vilket gör upptäckten och utnyttjandet av dess karakteris- tiska funktioner extremt utmanande. Under de senaste åren har de flesta MIR- forskningsinsatser riktats mot problem som inte är relaterade till språk, och de flesta av framstegen med metoder för språkidentifiering förblir långt ifrån musikaliska applikationer. Detta arbete undersöker SLID-problemet, dess ut- maningar och begränsningar, med syftet att hitta en ny lösning som effektivt ut- nyttjar kraften hos djupa inlärningsarkitekturer och en relativt storskalig privat datasats. Som en del av datasatsförbehandlingen föreslås en ny metod för att identifiera högnivåstrukturen av låtar. Som klassificeringsmodell utbildas och utvärderas ett Temporal Convolutional Network (TCN) på musikinspelningar som hör till sju av de mest framstående språk på den globala musikmarkna- den. Även om resultaten visar mycket lägre prestation med avseende på den nuvarande bästa-möjliga-teknik, realiseras en grundlig diskussion med syftet att utforska begränsningarna för SLID, orsakerna till dålig prestation identi- fieras och den nuvarande kunskapen om SLID problemet utökas. Framtida förbättringar och arbetslinjer a gränsas med avseende att stimulera ytterligare forskning mot denna riktning.
Hubeika, Valiantsina. "Intersession Variability Compensation in Language and Speaker Identification." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2008. http://www.nusl.cz/ntk/nusl-235432.
Full textNariyama, Shigeko. "Referent identification for ellipted arguments in Japanese." Connent to thesis, 2000. http://repository.unimelb.edu.au/10187/2870.
Full textThese mechanisms stem from three tiers of linguistic system. Each sentence is structured in such a way as to anchor the subject., (using Sentence devices following the principle of direct alignment), with argument inferring cues on the verbal predicate (using Predicate devices). These subject oriented sentences are cohesively sequenced with the topic as a pivot (using Discourse devices). These subject oriented sentences are cohesively sequenced with the topic as a pivot (using Discourse devices). It is this topicalised subject which is most prone to ellipsis. I develop an algorithm summing up these mechanisms, using naturally occurring texts. I demonstrate how it can detect the existence of ellipsis in sentences and track the referential identity of it.
A generalisation for ellipsis resolution and the way in which the algorithm is constituted is as follows. Sentence devices formulate sentences to make the subject most prone to ellipsis, discourse devices enable the interaction of wa (the topic maker) and ga (the nominative marker), which mark the majority of subjects, to provide the default reading for referent identification of ellipsis, and predicate devices furnish additional cues to verify that reading. Since Japanese is an SOV language, it is intuitively tenable from the perspective of language processing that the interplay of wa/ga representing subjects gives initial cues from predicate devices. This multiple layering of mechanisms, therefore, can determine referents for ellipted arguments more accurately.
Samperio, Sanchez Nahum. "General learning strategies : identification, transfer to language learning and effect on language achievement." Thesis, University of Southampton, 2016. https://eprints.soton.ac.uk/412008/.
Full textKnudson, Ryan Charles. "Automatic Language Identification for Metadata Records: Measuring the Effectiveness of Various Approaches." Thesis, University of North Texas, 2015. https://digital.library.unt.edu/ark:/67531/metadc801895/.
Full textRupe, Jonathan C. "Vision-based hand shape identification for sign language recognition /." Link to online version, 2005. https://ritdml.rit.edu/dspace/handle/1850/940.
Full textStrømhaug, Tommy. "Discriminating Music,Speech and other Sounds and Language Identification." Thesis, Norwegian University of Science and Technology, Department of Computer and Information Science, 2008. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-8953.
Full textThe tasks : discriminating music, speech and other sounds and language identification have a broad range of applications in todays multilingual multimedia community. Both tasks gave a lot of possibilities regarding methods and development tools which also brings some risk. The Language Identification(LID) problem ended up with two different approaches. One approach was discarded due to poor results in the pre-study while the other approach had some promising potential but did not deliver as hoped in the first place. On the other hand, the music, speech discrimination was solved with great accuracy using 3 simple time domain features and Support Vector Machines(SVM). Adding 'other sounds' to this discrimination problem did complicate the problem but the final solution delivered great results using the enormous BBC Sound Effects library as examples of non speech and music. Both tasks were tried being solved using Gaussian Mixture Models(GMM) because of it's known great ability to model arbitrary feature space segmentations. The tools used were Matlab together with a number of different toolboxes explained further in the text.
Peyton, Kari C. "Literacy programs identification and assessment of English language learners /." Menomonie, WI : University of Wisconsin--Stout, 2007. http://www.uwstout.edu/lib/thesis/2007/.
Full textClark, Jessica Celeste. "Automated Identification of Adverbial Clauses in Child Language Samples." Diss., CLICK HERE for online access, 2009. http://contentdm.lib.byu.edu/ETD/image/etd2803.pdf.
Full textBrown, Brittany Cheree. "Automated Identification of Adverbial Clauses in Child Language Samples." BYU ScholarsArchive, 2013. https://scholarsarchive.byu.edu/etd/3404.
Full textMichaelis, Hali Anne. "Automated Identification of Relative Clauses in Child Language Samples." BYU ScholarsArchive, 2009. https://scholarsarchive.byu.edu/etd/1997.
Full textManning, Britney Richey. "Automated Identification of Noun Clauses in Clinical Language Samples." BYU ScholarsArchive, 2009. https://scholarsarchive.byu.edu/etd/2197.
Full textEhlert, Erika E. "Automated Identification of Relative Clauses in Child Language Samples." BYU ScholarsArchive, 2013. https://scholarsarchive.byu.edu/etd/3615.
Full textLareau, Jonathan. "Application of shifted delta cepstral features for GMM language identification /." Electronic version of thesis, 2006. https://ritdml.rit.edu/dspace/handle/1850/2686.
Full textFord, George Harold. "Spoken Language Identification from Processing and Pattern Analysis of Spectrograms." NSUWorks, 2014. http://nsuworks.nova.edu/gscis_etd/152.
Full textRock, Jonna. "Intergenerational Memory, Language and Jewish Identification of the Sarajevo Sephardim." Doctoral thesis, Humboldt-Universität zu Berlin, 2019. http://dx.doi.org/10.18452/19793.
Full textThis study analyzes issues of language and Jewish identification pertaining to the Sephardim in Sarajevo. Complexity of the Sarajevo Sephardi history means that I explore Bosnia-Herzegovina/Yugoslavia, Israel and Spain as possible identity-creating factors for the Sephardim in Sarajevo today. My findings show that the elderly Sephardic generation insist on calling their language Serbo-Croatian, whereas the younger generations do not really know what language they speak – and laugh about the linguistic situation in Sarajevo, or rely on made-up categories such as ‘Sarajevan.’ None of the interviewees emphasize the maintenance of Judeo-Spanish as a crucial condition for the continuation of Sephardic culture in Sarajevo. Similarly, the celebration of Jewish holidays is more important for the maintenance of identity across the generations than speaking a Jewish language. At the same time, the individuals also assert alternative forms of being Bosnian, ones that encompass multiple ethnicities and religious ascriptions. All the youngest interviewees however fear that the Sarajevo Sephardic identity will disappear in a near future. Unique characteristics of Sarajevo Sephardim include the status of the Sephardim and minorities in Bosnia and Herzegovina given (1) the discriminatory Bosnian Constitution; (2) the absence of a law in Bosnia on the return of property; (3) the special situation wherein three major ethnic groups, and not just a single, ethnically homogeneous ‘majority,’ dominate the country; (4) the lack of a well-developed Jewish cultural infrastructure. Despite all of this, a rapprochement between the Sarajevo Jewish Community members and their religion and tradition is taking place. This phenomenon is partly attributable to the Community’s young religious activist and chazan, Igor Kožemjakin, who has attracted younger members to the religious services.
Wong, Kim-Yung Eddie. "Automatic spoken language identification utilizing acoustic and phonetic speech information." Thesis, Queensland University of Technology, 2004. https://eprints.qut.edu.au/37259/1/Kim-Yung_Wong_Thesis.pdf.
Full textZeberlein, Jennifer Catherine. "Examination of the Accuracy of the Social Language Development Test for Identification of Social Language Impairments." Miami University / OhioLINK, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=miami1398975745.
Full textJohnson, Marie A. F., and A. Rice. "Early Childhood Language Delay: Identification of Children At-risk, Characteristics, and Strategies for Building Language Skills." Digital Commons @ East Tennessee State University, 2010. https://dc.etsu.edu/etsu-works/1550.
Full textJohnson, Marie A. F., and A. Rice. "Early Childhood Language Delay: Identification of Children At-risk, Characteristics, and Strategies for Building Language Skills." Digital Commons @ East Tennessee State University, 2011. https://dc.etsu.edu/etsu-works/1549.
Full textSmolenska, Greta. "Complex Word Identification for Swedish." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-352349.
Full textDwyer, Edward J. "Word Identification Strategies." Digital Commons @ East Tennessee State University, 2018. https://dc.etsu.edu/etsu-works/3417.
Full textDwyer, Edward J. "Word Identification Strategies." Digital Commons @ East Tennessee State University, 2016. https://dc.etsu.edu/etsu-works/3419.
Full textChou, Christine S. (Christine Susan). "Language identification through parallel phone recognition dc by Christine S. Chou." Thesis, Massachusetts Institute of Technology, 1994. http://hdl.handle.net/1721.1/34056.
Full textXiang, Yang. "Grammatical Error Identification for Learners of Chinese as a Foreign Language." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-361927.
Full textGerl, Armin. "Modelling of a privacy language and efficient policy-based de-identification." Thesis, Lyon, 2019. http://www.theses.fr/2019LYSEI105.
Full textThe processing of personal information is omnipresent in our datadriven society enabling personalized services, which are regulated by privacy policies. Although privacy policies are strictly defined by the General Data Protection Regulation (GDPR), no systematic mechanism is in place to enforce them. Especially if data is merged from several sources into a data-set with different privacy policies associated, the management and compliance to all privacy requirements is challenging during the processing of the data-set. Privacy policies can vary hereby due to different policies for each source or personalization of privacy policies by individual users. Thus, the risk for negligent or malicious processing of personal data due to defiance of privacy policies exists. To tackle this challenge, a privacy-preserving framework is proposed. Within this framework privacy policies are expressed in the proposed Layered Privacy Language (LPL) which allows to specify legal privacy policies and privacy-preserving de-identification methods. The policies are enforced by a Policy-based De-identification (PD) process. The PD process enables efficient compliance to various privacy policies simultaneously while applying pseudonymization, personal privacy anonymization and privacy models for de-identification of the data-set. Thus, the privacy requirements of each individual privacy policy are enforced filling the gap between legal privacy policies and their technical enforcement
Asadullah, Munshi. "Identification of Function Points in Software Specifications Using Natural Language Processing." Thesis, Paris 11, 2015. http://www.theses.fr/2015PA112228/document.
Full textThe inevitable emergence of the necessity to estimate the size of a software thus estimating the probable cost and effort is a direct outcome of increasing need of complex and large software in almost every conceivable situation. Furthermore, due to the competitive nature of the software development industry, the increasing reliance on accurate size estimation at early stages of software development becoming a commonplace practice. Traditionally, estimation of software was performed a posteriori from the resultant source code and several metrics were in practice for the task. However, along with the understanding of the importance of code size estimation in the software engineering community, the realization of early stage software size estimation, became a mainstream concern. Once the code has been written, size and cost estimation primarily provides contrastive study and possibly productivity monitoring. On the other hand, if size estimation can be performed at an early development stage (the earlier the better), the benefits are virtually endless. The most important goals of the financial and management aspect of software development namely development cost and effort estimation can be performed even before the first line of code is being conceived. Furthermore, if size estimation can be performed periodically as the design and development progresses, it can provide valuable information to project managers in terms of progress, resource allocation and expectation management. This research focuses on functional size estimation metrics commonly known as Function Point Analysis (FPA) that estimates the size of a software in terms of the functionalities it is expected to deliver from a user’s point of view. One significant problem with FPA is the requirement of human counters, who need to follow a set of standard counting rules, making the process labour and cost intensive (the process is called Function Point Counting and the professional, either analysts or counters). Moreover, these rules, in many occasion, are open to interpretation, thus they often produce inconsistent counts. Furthermore, the process is entirely manual and requires Function Point (FP) counters to read large specification documents, making it a rather slow process. Some level of automation in the process can make a significant difference in the current counting practice. Automation of the process of identifying the FPs in a document accurately, will at least reduce the reading requirement of the counters, making the process faster and thus shall significantly reduce the cost. Moreover, consistent identification of FPs will allow the production of consistent raw function point counts. To the best of our knowledge, the works presented in this thesis is an unique attempt to analyse specification documents from early stages of the software development, using a generic approach adapted from well established Natural Language Processing (NLP) practices
Eyecioglu, Ozmutlu Asli. "Paraphrase identification using knowledge-lean techniques." Thesis, University of Sussex, 2016. http://sro.sussex.ac.uk/id/eprint/65497/.
Full textSardinha, Antonio Paulo Berber. "Automatic identification of segments in written texts." Thesis, University of Liverpool, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.364227.
Full textDwyer, Edward J. "Encouraging Word Identification Skills." Digital Commons @ East Tennessee State University, 2016. https://dc.etsu.edu/etsu-works/3401.
Full textAlharthi, Haifa. "Natural Language Processing for Book Recommender Systems." Thesis, Université d'Ottawa / University of Ottawa, 2019. http://hdl.handle.net/10393/39134.
Full textVdovichenko, Susan E. C. "The Beholder’s Eye: How Self-Identification and Linguistic Ideology Affect Shifting Language Attitudes and Language Maintenance in Ukraine." The Ohio State University, 2011. http://rave.ohiolink.edu/etdc/view?acc_num=osu1305582855.
Full textSin, Wan-san Dorene. "The identification and characterization of Cantonese-speaking children with specific language impairment." Click to view the E-thesis via HKUTO, 2000. http://sunzi.lib.hku.hk/hkuto/record/B3620769X.
Full text"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 10, 2000." Also available in print.
Combrinck, Hendrik Petrus. "A cost, complexity and performance comparison of two automatic language identification architectures." Pretoria : [s.n.], 2006. http://upetd.up.ac.za/thesis/available/etd-12212006-141335/.
Full textEsquierdo, Jennifer Joy. "Early identification of Hispanic English language learners for gifted and talented programs." Diss., Texas A&M University, 2003. http://hdl.handle.net/1969.1/3944.
Full textSegers, Vaughn Mackman. "The efficacy of the Eigenvector approach to South African sign language identification." Thesis, University of the Western Cape, 2010. http://etd.uwc.ac.za/index.php?module=etd&action=viewtitle&id=gen8Srv25Nme4_2697_1298280657.
Full textThe communication barriers between deaf and hearing society mean that interaction between these communities is kept to a minimum. The South African Sign Language research group, Integration of Signed and Verbal Communication: South African Sign Language Recognition and Animation (SASL), at the University of the Western Cape aims to create technologies to bridge the communication gap. In this thesis we address the subject of whole hand gesture recognition. We demonstrate a method to identify South African Sign Language classifiers using an eigenvector ap- proach. The classifiers researched within this thesis are based on those outlined by the Thibologa Sign Language Institute for SASL. Gesture recognition is achieved in real- time. Utilising a pre-processing method for image registration we are able to increase the recognition rates for the eigenvector approach.
Van, Der Merwe Ruan Henry. "Triplet entropy loss: improving the generalisation of short speech language identification systems." Master's thesis, Faculty of Science, 2021. http://hdl.handle.net/11427/33953.
Full text