Rozprawy doktorskie na temat „Identification du langage”
Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych
Sprawdź 50 najlepszych rozpraw doktorskich naukowych na temat „Identification du langage”.
Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.
Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.
Przeglądaj rozprawy doktorskie z różnych dziedzin i twórz odpowiednie bibliografie.
Gallant, Isabelle. "Le langage graphique des émotions : identification d'émotions exprimées par le dessin". Thesis, National Library of Canada = Bibliothèque nationale du Canada, 2001. http://www.collectionscanada.ca/obj/s4/f2/dsk3/ftp04/MQ60721.pdf.
Pełny tekst źródłaMessaoudi, Leila. "Des technolectes : présentation, identification, fonctionnement ; application linguistique à l'arabe standard". Paris 5, 1990. http://www.theses.fr/1990PA05H030.
Pełny tekst źródłaThis research deals with three main questions. The first one is : how can we distinguish between special languages (i. E. Technolects) and common language? The second one is : what are the most important ways to identify the technolects? The third one is : what kind of linguistic procedures are used in technolects at work, with a special attention paid to technolects in standard arabic? (which is an applied domain for this study with many data collected from bilingual dictionaries and notices in two fields : agriculture and chemistry in franch arabic. Results of this study show that there are some linguistic procedures which are used in technolects. These procedures can be grouped into two sections : formal procedures and non-formal procedures. In first group we range compounds and derivatives and in the second, we present semantic matters as "mode of designation" (which seems to be specific to technolects) and some cases of semic relations inside and between technolectal words. And in spite of inconvenients of "componential analysis", this method has been relevant in studying semantic relations as synonymy, polysemy and homonymy. But the analysis of technolects in standard arabic shows difficulties ought to real using in situ which seems to be missing in standard arabic
Vahine, Théodora. "Traitements visuels précoces du langage écrit : études chez l'enfant et l'adulte jeune". Thesis, Bordeaux, 2017. http://www.theses.fr/2017BORD0857/document.
Pełny tekst źródłaThe main objective was to study the implication of the magnocellular and the parvocellular visual systems in written language processing, specifically in word identification. Four studies were carried out to document the respective roles of these two systems, for different components of the written word structure, the letter (Study 1), the word length (Study 2), the word shape (Study 3) and the orthographic neighborhood (Study 4). The dissociation of the two visual systems was based on their specific functional characteristics: sensitivity to low spatial frequencies and luminance contrast for the magnocellular system; and sensitivity to medium and high spatial frequencies and chromatic contrast for the parvocellular system. The participants were young adult normal readers and 10-11 years-old children, novice readers, in order to consider the involvement of each visual system at two stages of their development: mature in young adults while still maturing in the children. The results confirmed the prominent role of parvocellular processing, which was consistent with the privilege accorded to the processing of letters and letters features in visual word recognition. On the other hand, word length processing has been shown to be a selectively magnocellular dimension. All results are discussed in the framework of the coarse-to-fine approach
Dupoux, Emmanuel. "Identification des mots parlés : détection de phonèmes et unité prélexicale". Paris, EHESS, 1989. http://www.theses.fr/1989EHES0327.
Pełny tekst źródłaFourour, Nordine. "Identification et catégorisation automatique des entités nommées dans les textes français". Nantes, 2004. http://www.theses.fr/2004NANT2126.
Pełny tekst źródłaNamed Entity (NE) Recognition is a recurring problem in the different domain of Natural Language Processing. As a result of, a linguistic investigation allowing to set-up operational parameters defining the concept of named entity, a state of art of the domain, and a corpus investigation using referential and graphical criteria, we present Nemesis - a French named entity recognizer. This system analyzes the internal and external evidences by using grammar rules and trigger word lexicons, and includes a learning process. With these processes, Nemesis performance achieves about 90% of precision and 80% of recall. To increase the recall, we put forward optional modules (analysis of the wide context and utilization of the Web as a source of new contexts) and investigate in setting up a disambiguation and grammar rules inference module
Trouilleux, François. "Identification des reprises et interprétation automatique des expressions pronominales dans des textes en français". Clermont-Ferrand 2, 2001. https://hal.archives-ouvertes.fr/tel-01152394.
Pełny tekst źródłaStern, Rosa. "Identification automatique d'entités pour l'enrichissement de contenus textuels". Phd thesis, Université Paris-Diderot - Paris VII, 2013. http://tel.archives-ouvertes.fr/tel-00939420.
Pełny tekst źródłaAl, Mahrous Mazen. "Génération de test fonctionnel de circuits digitaux décrits avec un langage déclaratif : Lustre". Phd thesis, Grenoble INPG, 1990. http://tel.archives-ouvertes.fr/tel-00337894.
Pełny tekst źródłaKanoun, Slim. "Identification et analyse de textes arabes par approche affixale". Rouen, 2002. http://www.theses.fr/2002ROUES040.
Pełny tekst źródłaThe presented work in this memory tackles the problems involved in differentiation and text recognition in off-line mode in Arabic and Latin multilingual documents. The first part of this work relates to a method of differentiation between Arabic texts and Latin texts in two natures printed and handwritten. The second part proposes a new approach, called affixal approach, for Arabic word recognition and text analysis. This approach is characterized by modelling from morph-syntactic entities (word basic morphemes) by integrating the morpho-phonological aspects of Arabic vocabulary in the recognition process compared to the traditional approaches which proceed by the modelling of grahic entities (word, letter, pseudo word). The tests carried out show well the contribution of the approach on the recognition simplification and the morph-syntactic categorization of the words in an Arabic text
Duermael, Fabrice. "Référence aux actions dans des dialogues de commande homme-machine". Vandoeuvre-les-Nancy, INPL, 1994. http://docnum.univ-lorraine.fr/public/INPL_T_1994_DUERMAEL_F.pdf.
Pełny tekst źródłaRosá, Aiala. "Identification de opiniónes de differentes fuentes en textos en español". Thesis, Paris 10, 2011. http://www.theses.fr/2011PA100127.
Pełny tekst źródłaThis work presents a study of linguistic expressions of opinion from different sources in Spanish texts. The work includes the definition of a model for opinion predicates and their arguments (source, topic and message), the creation of a lexicon of opinion predicates which have information from the model associated, and the implementation of three systems.The first system, based on contextual rules, gets good results for the F-measure score (partial match): predicate, 92%; source, 81%; topic, 75%; message, 89%; full opinion, 85%. In addition, for source identification the F-measure for exact match is 79%. The second system, based on Conditional Random Fields (CRF), was developed only for the identification of sources, giving 76% of F-measure (exact match). The third system, which combines the two techniques (rules and CRF), gives a value of 83% of F-measure (exact match), showing that the combination yields interesting results.As regards the identification of sources, our system compared to other work developed for languages other than Spanish, gives very satisfactory results. Indeed these works had scores that fall between 63% and 89.5%.Moreover, in addition to the systems made for the identification of opinions, our work has led to the construction of several resources for Spanish: a lexicon of opinion predicates, a 13,000 words corpus with opinions annotated and a 40,000 words corpus with opinion predicates end sources annotated
Roll, Patrice. "Les épilepsies humaines associées à d'autres pathologies cérébrales : identification d'un gène responsable d'une épilepsie rolandique et d'un trouble du langage". Aix-Marseille 2, 2005. http://www.theses.fr/2005AIX20653.
Pełny tekst źródłaEpilepsy is one of the most common neurological diseases (1-4%). Language impairments are very frequents, affecting 2-7% of children entering school. Language processing deserves particular attention in rolandic epilepsy as discharges involve the perisylvian language areas. A gene for oral and speech dyspraxia associated with rolandic epilepsy was mapped at Xq21-q22. Systematic screening identified the disease-causing mutation within the SRPX2 gene encoding a secreted sushi-repeat containing protein. The mutation created a N-glycosylation site. In cultured cells, mutant SRPX2 protein was N-glycosylated and either secreted, or retained in the endoplasmic reticulum as ubiquitin-linked derivatives. In the human adult brain, SRPX2 was expressed in neurons of the rolandic area. In the murine brain, Srpx2 protein expression appeared in neurons at birth. Altogether, our data identify SRPX2 as an important molecular agent of language processing in the rolandic area
Berthod, Christophe. "Identification paramétrique de grandes structures : réanalyse et méthode évolutionnaire". Phd thesis, Université de Franche-Comté, 1998. http://tel.archives-ouvertes.fr/tel-00011640.
Pełny tekst źródłaPremière partie : Étude de méthodes de réanalyse approchée de structures mécaniques modifiées
Lorsque les paramètres de conception du modèle varient, il est nécessaire d'effectuer une réanalyse afin d'obtenir les solutions propres (modes et fréquences) du système modifié. Une stratégie de réanalyse approchée de type Rayleigh-Ritz est présentée : elle est plus rapide et moins coûteuse qu'une réanalyse exacte, tout en offrant une précision satisfaisante grâce à l'apport des vecteurs de résidus statiques.
Deuxième partie : Application d'une méthode évolutionnaire d'optimisation au recalage de modèles
Dans cette partie, on propose d'adapter une méthode évolutionnaire au problème de l'identification paramétrique. Inspiré par les principes d'évolution des algorithmes génétiques, son fonctionnement repose sur l'information fournie par une fonction coût représentant la distance entre un modèle recalé et la structure réelle. Des opérateurs heuristiques sont introduits afin de favoriser la recherche des solutions qui minimisent la fonction.
Troisième partie : Logiciel Proto–Dynamique
Cette partie vise à présenter l'environnement de travail qui a servi à programmer les techniques formulées dans le mémoire et à réaliser les tests numériques. Proto, écrit en langage Matlab, est une plate-forme de développement regroupant des outils d'analyse et des méthodes de recalage.
Arias, Aguilar José Anibal André-Obrecht Régine. "Méthodes spectrales pour le traitement automatique de documents audio analyse, modélisation et détection automatique des disfluences dans le dialogue oral spontané contraintele cas du contrôle aérien /". Toulouse (Université Paul Sabatier, Toulouse 3), 2009. http://thesesups.ups-tlse.fr/436.
Pełny tekst źródłaBoudjedir, Sihem. "Le rapport à la langue de jeunes "issus" de l'immigration : de l'identité à l'origine". Thesis, Strasbourg, 2018. http://www.theses.fr/2018STRAG006.
Pełny tekst źródłaThis dissertation focuses on the relations between the subject and languages and in particular in the given socio-cultural context marked by a cultural and linguistic plurality. It explores the link between the subject and language(s) and more precisely, the way languages border on identity construction. We try to clarify the notion of a language, its appropriation, share and transmission from the psychological point of view and its functions such as family heritage from the cultural point of view. This thesis apprehends the subject in its singularity according to the method of clinical analysis of research interviews taking the narrow link between the and the culture for its starting point and is based mainly on the theory of psychoanalysis. The analysis of interviews shows that in the words of the studied subjects the link with the language has an effect on social and family anchoring. The language of the country, that the parents come from, appears as the ‘origin language’ having ‘affiliation’ as its function and enabling to create and maintain the link with the family origins. By establishing the kinship links in a movement that recreates them endlessly, the connection with ‘the origin language’ that is a part of genealogy line, assigns a place in the genealogy order and enables the openness to some other possible affiliations. Thus, the link with the language appears as a function that constitutes ‘family’ in terms of continuity and forms an external origin for the subject, but central for the identity construction and feeling of belonging
Asadullah, Munshi. "Identification of Function Points in Software Specifications Using Natural Language Processing". Thesis, Paris 11, 2015. http://www.theses.fr/2015PA112228/document.
Pełny tekst źródłaThe inevitable emergence of the necessity to estimate the size of a software thus estimating the probable cost and effort is a direct outcome of increasing need of complex and large software in almost every conceivable situation. Furthermore, due to the competitive nature of the software development industry, the increasing reliance on accurate size estimation at early stages of software development becoming a commonplace practice. Traditionally, estimation of software was performed a posteriori from the resultant source code and several metrics were in practice for the task. However, along with the understanding of the importance of code size estimation in the software engineering community, the realization of early stage software size estimation, became a mainstream concern. Once the code has been written, size and cost estimation primarily provides contrastive study and possibly productivity monitoring. On the other hand, if size estimation can be performed at an early development stage (the earlier the better), the benefits are virtually endless. The most important goals of the financial and management aspect of software development namely development cost and effort estimation can be performed even before the first line of code is being conceived. Furthermore, if size estimation can be performed periodically as the design and development progresses, it can provide valuable information to project managers in terms of progress, resource allocation and expectation management. This research focuses on functional size estimation metrics commonly known as Function Point Analysis (FPA) that estimates the size of a software in terms of the functionalities it is expected to deliver from a user’s point of view. One significant problem with FPA is the requirement of human counters, who need to follow a set of standard counting rules, making the process labour and cost intensive (the process is called Function Point Counting and the professional, either analysts or counters). Moreover, these rules, in many occasion, are open to interpretation, thus they often produce inconsistent counts. Furthermore, the process is entirely manual and requires Function Point (FP) counters to read large specification documents, making it a rather slow process. Some level of automation in the process can make a significant difference in the current counting practice. Automation of the process of identifying the FPs in a document accurately, will at least reduce the reading requirement of the counters, making the process faster and thus shall significantly reduce the cost. Moreover, consistent identification of FPs will allow the production of consistent raw function point counts. To the best of our knowledge, the works presented in this thesis is an unique attempt to analyse specification documents from early stages of the software development, using a generic approach adapted from well established Natural Language Processing (NLP) practices
Chabrier, Jean-Claude C. "Analyses de musiques traditionnelles : identification de systèmes acoustiques, scalaires, modaux & instrumentaux : représentation morpho-mélodique, structuro-modale & du langage instrumental". Paris (213 Av. de Versailles, 75016) : Arabesques, 1996. http://catalogue.bnf.fr/ark:/12148/cb371718879.
Pełny tekst źródłaDevilliers, Elise. "Lecture et identification des mots selon les matières comparaison de deux profils d'enfants (enfants dyslexiques et enfants normo-lecteurs) /". Nancy : Université Henri-Poincaré, 2009. http://www.scd.uhp-nancy.fr/docnum/SCDMED_MORT_2009_DEVILLIERS_ELISE.pdf.
Pełny tekst źródłaBraud, Chloé. "Identification automatique des relations discursives implicites à partir de corpus annotés et de données brutes". Sorbonne Paris Cité, 2015. https://hal.inria.fr/tel-01256884.
Pełny tekst źródłaBuilding discourse parsers is currently a major challenge in Natural Language Processing. The identification of the relations (such as Explanation, Contrast. . . ) linking spans of text in the document is the main difficulty. Especially, identifying the so-called implicit relations, that is the relations that lack a discourse connective (such as but, because. . . ), is known as an hard tank sine it requires to take into account varions factors, and because it leads to specific difficulties in a classification system. In this thesis, we use raw data to improve automatic identification of implicit relations. First, we propose to use discourse markers in order to automatically annotate new data. We use domain adaptation methods to deal with the distributional differences between automatically and manually annotated data : we report improvements for systems built on the French corpus ANNODIS and on the English corpus Penn Discourse Treebank. Then, we propose to use word representations built from raw data, which may be automatically annotated with discourse markers, in order to feed a representation of the data based on the words found in the spans of text to be linked. We report improvements on the English corpus Penn Discourse Treebank, and especially we show that this method alleviates the need for rich resources, available but for a few languages
Capponi, Cécile. "Identification et Exploitation des Types dans un modèle de connaissances à objets". Phd thesis, Université Joseph Fourier (Grenoble), 1995. http://tel.archives-ouvertes.fr/tel-00345845.
Pełny tekst źródłaLe, Priol Florence. "Extraction et capitalisation automatiques de connaissances à partir de documents textuels : Seek-Java : identification et interprétation de relations entre concepts". Paris 4, 2000. http://www.theses.fr/2000PA040172.
Pełny tekst źródłaChakkour, Fairouz Arina. "Identification de relations sémantiques dans des textes scientifiques et techniques en exploitant le raisonnement à partir de cas". Nancy 1, 2003. http://www.theses.fr/2003NAN10192.
Pełny tekst źródłaOur approach provides a semantic interpretation of an utterance in natural language, which can be used in a textual data mining process. This analysis relies on case-based reasoning principles. The semantic interpretation of an utterance consists in identifying the semantic relation expressed by this utterance and its arguments. Semantic relations have an important contribution to a sense of a texte in natural language, and they link concepts in an ontologie. The analysis in our model is based on several types of knowledge: domaine knowledge (an ontology), linguistic knowledge, (transformations and adaptation functions) and experimental knowledge (a case base). We adopt a case-based reasoning model founded on the classification principles and paths of similarity in order to guarantee the adaptability. We have tested the model on two corpuses : the first is a corpus of texts in the domain of profesionnal stress, and the second is a corpus of texts in agriculture
Hatier, Sylvain. "Identification et analyse linguistique du lexique scientifique transdisciplinaire. Approche outillée sur un corpus d'articles de recherche en SHS". Thesis, Université Grenoble Alpes (ComUE), 2016. http://www.theses.fr/2016GREAL027/document.
Pełny tekst źródłaIn this dissertation we study the French cross-disciplinary scientific lexicon (CSL), a lexicon which fall within the genre of scientific articles in humanities and social sciences. As the CSL is commonly used in scientific texts, it is a gateway of interest to explore this genre. This lexicon has also practical applications in the fields of automatic terms identification and foreign language teaching in the academic background. To this end, we apply a corpus-driven approach in order to extract and structure the CSL lexical units which are complex to circumscribe. The method relies on the cross-disciplinarity and specificity criteria and on the lexico-syntactic properties of the CSL lexical units. As a result, we designed a lexical resource which include lexical, syntactical and semantical informations. As we analyze the combinatorial properties extracted from a parsed corpus of scientific articles, we performed a CSL study based on its genre specific use. We follow the same approach to identify cross-disciplinary meanings for the CSL nouns and to design a nominal semantic classification. This two-level typology allow us to explore rhetorical and phraseological CSL properties by identifying frequent syntactico-semantic patterns
Parcollet, Titouan. "Quaternion neural networks A survey of quaternion neural networks - Chapter 2 Real to H-space Autoencoders for Theme Identification in Telephone Conversations - Chapter 7". Thesis, Avignon, 2019. http://www.theses.fr/2019AVIG0233.
Pełny tekst źródłaIn the recent years, deep learning has become the leading approach to modern artificial intelligence (AI). The important improvement in terms of processing time required for learning AI based models alongside with the growing amount of available data made of deep neural networks (DNN) the strongest solution to solve complex real-world problems. However, a major challenge of artificial neural architectures lies on better considering the high-dimensionality of the data.To alleviate this issue, neural networks (NN) based on complex and hypercomplex algebras have been developped. The natural multidimensionality of the data is elegantly embedded within complex and hypercomplex neurons composing the model. In particular, quaternion neural networks (QNN) have been proposed to deal with up to four dimensional features, based on the quaternion representation of rotations and orientations. Unfortunately, and conversely to complex-valued neural networks that are nowadays known as a strong alternative to real-valued neural networks, QNNs suffer from numerous limitations that are carrefuly addressed in the different parts detailled in this thesis.The thesis consists in three parts that gradually introduce the missing concepts of QNNs, to make them a strong alternative to real-valued NNs. The first part introduces and list previous findings on quaternion numbers and quaternion neural networks to define the context and strong basics for building elaborated QNNs.The second part introduces state-of-the-art quaternion neural networks for a fair comparison with real-valued neural architectures. More precisely, QNNs were limited by their simple architectures that were mostly composed of a single and shallow hidden layer. In this part, we propose to bridge the gap between quaternion and real-valued models by presenting different quaternion architectures. First, basic paradigms such as autoencoders and deep fully-connected neural networks are introduced. Then, more elaborated convolutional and recurrent neural networks are extended to the quaternion domain. Experiments to compare QNNs over equivalents NNs have been conducted on real-world tasks across various domains, including computer vision, spoken language understanding and speech recognition. QNNs increase performances while reducing the needed number of neural parameters compared to real-valued neural networks.Then, QNNs are extended to unconventional settings. In a conventional QNN scenario, input features are manually segmented into three or four components, enabling further quaternion processing. Unfortunately, there is no evidence that such manual segmentation is the representation that suits the most to solve the considered task. Morevover, a manual segmentation drastically reduces the field of application of QNNs to four dimensional use-cases. Therefore the third part introduces a supervised and an unsupervised model to extract meaningful and disantengled quaternion input features, from any real-valued input signal, enabling the use of QNNs regardless of the dimensionality of the considered task. Conducted experiments on speech recognition and document classification show that the proposed approaches outperform traditional quaternion features
Arias, Aguilar José Anibal. "Méthodes spectrales pour le traitement automatique de documents audio". Toulouse 3, 2008. http://thesesups.ups-tlse.fr/436/.
Pełny tekst źródłaThe disfluencies are a frequently occurring phenomenon in any spontaneous speech production; it consists of the interruption of the normal flow of speech. They have given rise to numerous studies in Natural Language Processing. Indeed, their study and precise identification are essential, both from a theoretical and applicative perspective. However, most of the researches about the subject relate to everyday uses of language: "small talk" dialogs, requests for schedule, speeches, etc. But what about spontaneous speech production made in a restrained framework? To our knowledge, no study has ever been carried out in this context. However, we know that using a "language specialty" in the framework of a given task leads to specific behaviours. Our thesis work is devoted to the linguistic and computational study of disfluencies within such a framework. These dialogs concern air traffic control, which entails both pragmatic and linguistic constraints. We carry out an exhaustive study of disfluencies phenomena in this context. At first we conduct a subtle analysis of these phenomena. Then we model them to a level of abstraction, which allows us to obtain the patterns corresponding to the different configurations observed. Finally we propose a methodology for automatic processing. It consists of several algorithms to identify the different phenomena, even in the absence of explicit markers. It is integrated into a system of automatic processing of speech. Eventually, the methodology is validated on a corpus of 400 sentences
Baur, Ruedi. "Entre identité et identification : les valeurs civiques des systèmes de représentation publics". Thesis, Strasbourg, 2016. http://www.theses.fr/2016STRAC009/document.
Pełny tekst źródłaResearch documenting the evolution of visual representations of institutions, events and public bodies between 1970 and 2010, from designers’intitial fascination with the concept of « corporate design » and « corporate identity » systems, the progressive reinforcement of « marketing », then the switch to « branding » and the artificial values attached to brands is chronologically analysed through a series of personal experiments related to an intentionally more civic design minded approach that permits the creation of real alternative methods. Analysing the relationship between identification and identity sees a proposed shift in focus concerning brand visibility and enhancement towards approaches that, with the help of shared visual languages, places at the centre – the intelligibility of complex structures – the representation of plurality, trends and relationships, - the integration of context – the coordination of multiple expressions
Botha, Gerrti Reinier. "Text-based language identification for the South African languages". Pretoria : [s.n.], 2007. http://upetd.up.ac.za/thesis/available/etd-090942008-133715/.
Pełny tekst źródłaBlanc, Jean-Marc. "Traitement de la prosodie par un réseau récurrent temporel : un cadre unifié pour l'identification automatique des langues, des attitudes prosodiques, et des catégories lexicales". Lyon 2, 2005. http://theses.univ-lyon2.fr/documents/lyon2/2005/blanc_jm.
Pełny tekst źródłaProsody is directly accessible to us when hearing a foreign language. What is the mechanism implicated in the processing of speech prosody ? A Temporal Recurrent Network (TRN) inspired by neurophysiologic studies for sequences learning by primates has been tested for the identification of prosodic contours. Language rhythm can be globally defined. The TRN identify five European languages (50%) based on the automatic segmentation of speech in consonants and vowels, but also with a cochleogram (65%). With the fundamental frequency, the network identifies six prosodic attitudes (syntactic modes and emotions) and distinguishes content from function words, two lexical categories that could bootstrap syntax. When the model is distorted in order to reduce its temporal sensitivity, this categorization could not be realized, and the pattern of response to two rapid auditory processing tasks resembles that of children with Specific Language Impairment, in particular for syntax. In addition, this model replicates two experiments of prosodic discrimination realized by new-borns: languages are distinguished according to their class rhythm and function words are discriminated from content words. In brief, the TRN accomplishes three tasks where prosody is defined on different temporal domains: from language (global field) to word (local field): Automatic Identification of Languages, and prosodic attitudes; Discrimination of content and function words. Finally auditory and language specific troubles could be simulated when the temporal sensitivity of the network is weaken
Grosser, David. "Construction itérative de bases de connaissances descriptives et classificatoires avec la plate-forme à objets IKBS : application à la systèmatique des coraux des Mascareignes". La Réunion, 2002. http://tel.archives-ouvertes.fr/tel-00003415/fr/.
Pełny tekst źródłaTurchet, Philippe. "Identification de ruptures de compréhension dialogique en contexte interculturel à partir d’indices corporels". Thesis, Paris 10, 2017. http://www.theses.fr/2017PA100174/document.
Pełny tekst źródłaThis PhD thesis is part of the context of intercultural communication and is based on the analysis of a corpus of 184112 words (56 speakers, 13 nationalities), 9155 turns of dialogic words. It is a question of looking for one or more mimogestual indices of rupture of comprehension, in the absence of verbalization, in the silent speaker, during the interaction. The PART I, proposes a state of the places concerning the locating of the breaks of comprehension, in situation of exolinguism. PART II, Preliminary Experimentation (Methodology and Content Analysis), delineates 177 "text-blocks", in which breaks in comprehension occur. A mimo-gestual item, linked to a short and fast eccentric look (ERBR), often replicated, is taken. PART III is a first experiment that arises from preliminary experimentation. The independent variable (the verbal language) and the dependent variable (the mimogestuality), are reversed, to check if the breaks of comprehension, once reported to the corollary verbal criteria, are directly identifiable, starting from precise mimo-gestures attitudes. The index (ERBR) is therefore systematically searched for in the corpus: its presence, which is largely replicated, in a situation of rupture of understanding, therefore suggests that it is a signal of linguistic non-comprehension. Thus, a strong correlation between verbal language and non-conscious gestuality is well objectified, in a dialogical, multicultural situation: this identification of moments of non-understanding, as soon as they have an impact, could be a real contribution to the didactics of languages, in the cosmopolitan societies of today
Nosary, Ali. "Reconnaissance automatique de textes manuscrits par adaptation au scripteur". Rouen, 2002. http://www.theses.fr/2002ROUES007.
Pełny tekst źródłaThis thesis deals with the problem of off-line handwritten text recognition. It describes a system of text recognition which exploits an original principle of adaptation to the handwriting to be recognized. The adaptation principle, inspired by contextual effects observed from a human reader, is based on the automatic learning, during the recognition, of the graphical characteristics of the handwriting (writer invariants). The word recognition proceeds according to an analytical approach based on a segmentation-recognition principle. The on-line adaptation of the recognition system relies on the iteration of two steps : a word recognition step which allows to label the writer's representations (allographes) on the whole text and a revaluation step of character models. The implementation of our adaptation strategy requires an interactive recognition scheme able to make interact treatments at various contextual levels. The interaction model retained is based on the multi-agent paradigm
Bouraoui, Jean-Léon Mehdi. "Analyse, modélisation et détection automatique des disfluences dans le dialogue oral spontané contraint : le cas du contrôle aérien". Phd thesis, Université Paul Sabatier - Toulouse III, 2008. http://tel.archives-ouvertes.fr/tel-00354772.
Pełny tekst źródłaCependant, la majorité des travaux de recherche sur le sujet portent sur des usages de langage quotidien : dialogues « à bâtons rompus », demandes d'horaire, discours, etc. Mais qu'en est-il des productions orales spontanées produites dans un cadre contraint ? Aucune étude n'a à notre connaissance été menée dans ce contexte. Or, on sait que l'utilisation d'une « langue de spécialité » dans le cadre d'une tâche donnée entraîne des comportements spécifiques.
Notre travail de thèse est consacré à l'étude linguistique et informatique des disfluences dans un tel cadre. Il s'agit de dialogues de contrôle de trafic aérien, aux contraintes pragmatiques et linguistiques. Nous effectuons une étude exhaustive des phénomènes de disfluences dans ce contexte. Dans un premier temps nous procédons à l'analyse fine de ces phénomènes. Ensuite, nous les modélisons à un niveau de représentation abstrait, ce qui nous permet d'obtenir les patrons correspondant aux différentes configurations observées. Enfin nous proposons une méthodologie de traitement automatique. Celle-ci consiste en plusieurs algorithmes pour identifier les différents phénomènes, même en l'absence de marqueurs explicites. Elle est intégrée dans un système de traitement automatique de la parole. Enfin, la méthodologie est validée sur un corpus de 400 énoncés.
Yin, Bo Electrical Engineering & Telecommunications Faculty of Engineering UNSW. "Language identification with language and feature dependency". Awarded By:University of New South Wales. Electrical Engineering & Telecommunications, 2009. http://handle.unsw.edu.au/1959.4/44045.
Pełny tekst źródłaDroin, Nicolas. "Paysage et dépaysement dans l’œuvre de Michelangelo Antonioni : de "Blow Up" à "Identification d’une femme"". Thesis, Paris 10, 2012. http://www.theses.fr/2012PA100175/document.
Pełny tekst źródłaThe work of Antonioni is a fertile field to study landscape in cinematographic art. This study focuses on Antonioni's cinematographic disorientation, from Blow up (1966) to its come-back in Italy with Identification of a woman (1982). My work aims at showing the importance of a changing and cineplastic vision of landscape which integrates the question of editing, motion (of image and inside the image itself), in order to highlight the rhythmic, metamorphic and plastic strenghts of the image-landscape in the cinema. Having shown these strenghts, I intend to interrogate the question of landscape from the notion of disorientation. Disorientation represents an operating materiel to think the cinematographic image, its deterritorialisation, its motion. Disorientating landscape in Antonioni's work leads to a dialogue with art history, which implies to rethink the major aesthethic questions of the 20th century (from abstraction to informal art, by Land-Art and performance) in the context of a cinematographic study. The question of disorientation requires new tools to rethink landscape in the cinema. I suggest to name « inter-landscape » the constitution of a landscape which integrates peculiar to image in its plastic processes relying on notions suchs as interval and inter-images. A cinematographic « inter-landscape », as can be define from the work of Antonioni, offers a plastic mobilisation of the image-landscape which allows to interrogate, in turn, contemporary artistic practice
Hedjem, Louisa. "Jeanne d'Arc en France : construction identitaire et identité nationale". Thesis, Sorbonne Paris Cité, 2017. http://www.theses.fr/2017USPCC046/document.
Pełny tekst źródłaThis thesis is based on thinking around the unprecedented action of the Maid of Orléans which created the Joan of Arc matrix. In order to understand this intriguing and singular specificity, we hypothesise that the major changes operated in and by the anomaly of the Maid of Orléans can be explained by the adolescent process, the work of a creating adolescent. She is emblematic of the adolescent who finds themselves developing outside of the social framework of adolescence. The language of her transgressive verbal and behavioural actions, which can be viewed as phallic-narcissistic, express a masculine femininity in which a combination of identifications and ideals passed on by the parent figures are coalesced and reworked. In a pubescent context, we hypothesis that the voices heard by Joan of Arc are not a sign of psychosis but a sort of double of herself, a transitional intermediary, opening up the development of a relationship with herself and with others; a form of support which authorises and stimulates the creation of a self that is both original and disturbing in medieval times.In the XIXth century, Republicans and Nationalists, building a new France, on new foundations, established an ideological position through national storytelling which used Joan of Arc's heroic function to further patriotic idealism. A prototype of the patriotic hero, she was to be used as a support and vector of heroic identification and shared ideals. The identification-identity-ideal nexus is deployed in both the individual and the collective: it helps to understand the identity issues, to respond to the individual's need for identity and the desire to repair the trauma and damage inflicted
Newman, Jacob Laurence. "Language identification using visual features". Thesis, University of East Anglia, 2011. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.539371.
Pełny tekst źródłaVarnet, Léo. "Identification des indices acoustiques utilisés lors de la compréhension de la parole dégradée". Thesis, Lyon 1, 2015. http://www.theses.fr/2015LYO10221/document.
Pełny tekst źródłaThere is today a broad consensus in the scientific community regarding the involvement of acoustic cues in speech perception. Up to now, however, the precise mechanisms underlying the transformation from continuous acoustic stream into discrete linguistic units remain largely undetermined. This is partly due to the lack of an effective method for identifying and characterizing the auditory primitives of speech. Since the earliest studies on the acoustic–phonetic interface by the Haskins Laboratories in the 50’s, a number of approaches have been proposed; they are nevertheless inherently limited by the non-naturalness of the stimuli used, the constraints of the experimental apparatus, and the a priori knowledge needed. The present thesis aimed at introducing a new method capitalizing on the speech-in-noise situation for revealing the acoustic cues used by the listeners.As a first step, we adapted the Classification Image technique, developed in the visual domain, to a phoneme categorization task in noise. The technique relies on a Generalized Linear Model to link each participant’s response to the specific configuration of noise, on a trial-by-trail basis, thereby estimating the perceptual weighting of the different time-frequency regions for the decision. We illustrated the effectiveness of our Auditory Classification Image method through 2 examples: a /aba/-/ada/ categorization and a /da/-/ga/ categorization in context /al/ or /aʁ/. Our analysis confirmed that the F2 and F3 onsets were crucial for the tasks, as suggested in previous studies, but also revealed unexpected cues. In a second step, we relied on this new method to compare the results of musical experts (N=19) or dyslexics participants (N=18) to those of controls. This enabled us to explore the specificities of each group’s listening strategies.All the results taken together show that the Auditory Classification Image method may be a more precise and more straightforward approach to investigate the mechanisms at work at the acoustic-phonetic interface
Koch, Guillaume. "Catégorisation automatique de documents manuscrits : Application aux courriers entrants". Rouen, 2006. http://www.theses.fr/2006ROUES033.
Pełny tekst źródłaThe work described in this document deals with the design, the development and the evaluation of a system of categorization of unconstrained handwritten documents such as handwritten incoming mails. We demonstrate that a system developed for electronic document categorization can be adapted for handwritten document categorization
Vaglio, Andrea. "Leveraging lyrics from audio for MIR". Electronic Thesis or Diss., Institut polytechnique de Paris, 2021. http://www.theses.fr/2021IPPAT027.
Pełny tekst źródłaLyrics provide a lot of information about music since they encapsulate a lot of the semantics of songs. Such information could help users navigate easily through a large collection of songs and to recommend new music to them. However, this information is often unavailable in its textual form. To get around this problem, singing voice recognition systems could be used to obtain transcripts directly from the audio. These approaches are generally adapted from the speech recognition ones. Speech transcription is a decades-old domain that has lately seen significant advancements due to developments in machine learning techniques. When applied to the singing voice, however, these algorithms provide poor results. For a number of reasons, the process of lyrics transcription remains difficult. In this thesis, we investigate several scientifically and industrially difficult ’Music Information Retrieval’ problems by utilizing lyrics information generated straight from audio. The emphasis is on making approaches as relevant in real-world settings as possible. This entails testing them on vast and diverse datasets and investigating their scalability. To do so, a huge publicly available annotated lyrics dataset is used, and several state-of-the-art lyrics recognition algorithms are successfully adapted. We notably present, for the first time, a system that detects explicit content directly from audio. The first research on the creation of a multilingual lyrics-toaudio system are as well described. The lyrics-toaudio alignment task is further studied in two experiments quantifying the perception of audio and lyrics synchronization. A novel phonotactic method for language identification is also presented. Finally, we provide the first cover song detection algorithm that makes explicit use of lyrics information extracted from audio
Yang, Xi. "Discriminative acoustic and sequence models for GMM based automatic language identification /". View abstract or full-text, 2007. http://library.ust.hk/cgi/db/thesis.pl?ECED%202007%20YANG.
Pełny tekst źródłaConti, Matteo. "Machine Learning Based Programming Language Identification". Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/20875/.
Pełny tekst źródłaMunday, Emma Rachel. "Language and identification in contemporary Kazakhstan". Thesis, University of Edinburgh, 2010. http://hdl.handle.net/1842/6200.
Pełny tekst źródłaNkadimeng, Calvin. "Language identification using Gaussian mixture models". Thesis, Stellenbosch : University of Stellenbosch, 2010. http://hdl.handle.net/10019.1/4170.
Pełny tekst źródłaENGLISH ABSTRACT: The importance of Language Identification for African languages is seeing a dramatic increase due to the development of telecommunication infrastructure and, as a result, an increase in volumes of data and speech traffic in public networks. By automatically processing the raw speech data the vital assistance given to people in distress can be speeded up, by referring their calls to a person knowledgeable in that language. To this effect a speech corpus was developed and various algorithms were implemented and tested on raw telephone speech data. These algorithms entailed data preparation, signal processing, and statistical analysis aimed at discriminating between languages. The statistical model of Gaussian Mixture Models (GMMs) were chosen for this research due to their ability to represent an entire language with a single stochastic model that does not require phonetic transcription. Language Identification for African languages using GMMs is feasible, although there are some few challenges like proper classification and accurate study into the relationship of langauges that need to be overcome. Other methods that make use of phonetically transcribed data need to be explored and tested with the new corpus for the research to be more rigorous.
AFRIKAANSE OPSOMMING: Die belang van die Taal identifiseer vir Afrika-tale is sien ’n dramatiese toename te danke aan die ontwikkeling van telekommunikasie-infrastruktuur en as gevolg ’n toename in volumes van data en spraak verkeer in die openbaar netwerke.Deur outomaties verwerking van die ruwe toespraak gegee die noodsaaklike hulp verleen aan mense in nood kan word vinniger-up ”, deur te verwys hul oproepe na ’n persoon ingelichte in daardie taal. Tot hierdie effek van ’n toespraak corpus het ontwikkel en die verskillende algoritmes is gemplementeer en getoets op die ruwe telefoon toespraak gegee.Hierdie algoritmes behels die data voorbereiding, seinverwerking, en statistiese analise wat gerig is op onderskei tussen tale.Die statistiese model van Gauss Mengsel Modelle (GGM) was gekies is vir hierdie navorsing as gevolg van hul vermo te verteenwoordig ’n hele taal met’ n enkele stogastiese model wat nodig nie fonetiese tanscription nie. Taal identifiseer vir die Afrikatale gebruik GGM haalbaar is, alhoewel daar enkele paar uitdagings soos behoorlike klassifikasie en akkurate ondersoek na die verhouding van TALE wat moet oorkom moet word.Ander metodes wat gebruik maak van foneties getranskribeerde data nodig om ondersoek te word en getoets word met die nuwe corpus vir die ondersoek te word strenger.
Avenberg, Anna. "Automatic language identification of short texts". Thesis, Uppsala universitet, Avdelningen för beräkningsvetenskap, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-421032.
Pełny tekst źródłaBerkling, Kay Margarethe. "Automatic language identification with sequences of language-independent phoneme clusters /". Full text open access at:, 1996. http://content.ohsu.edu/u?/etd,204.
Pełny tekst źródłaKnudson, Ryan Charles. "Automatic Language Identification for Metadata Records: Measuring the Effectiveness of Various Approaches". Thesis, University of North Texas, 2015. https://digital.library.unt.edu/ark:/67531/metadc801895/.
Pełny tekst źródłaForan, Jeffrey (Jeffrey Matthew) 1977. "Missing argument referent identification in natural language". Thesis, Massachusetts Institute of Technology, 1999. http://hdl.handle.net/1721.1/80532.
Pełny tekst źródłaIncludes bibliographical references (p. 54-55).
by Jeffrey Foran.
S.B.and M.Eng.
Gambardella, Maria-Elena. "Cleartext detection and language identification in ciphers". Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-446439.
Pełny tekst źródłaWilliams, A. Lynn, i Carol Stoel-Gammon. "Identification of Speech-language Disorders in Toddlers". Digital Commons @ East Tennessee State University, 2016. https://dc.etsu.edu/etsu-works/2038.
Pełny tekst źródłaVindfallet, Vegar Enersen. "Language Identification Based on Detection of Phonetic Characteristics". Thesis, Norges teknisk-naturvitenskapelige universitet, Institutt for elektronikk og telekommunikasjon, 2012. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-19506.
Pełny tekst źródła