Дисертації з теми "Texte structuré"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся з топ-50 дисертацій для дослідження на тему "Texte structuré".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Переглядайте дисертації для різних дисциплін та оформлюйте правильно вашу бібліографію.
Fan, Huihui. "Text Generation with and without Retrieval." Electronic Thesis or Diss., Université de Lorraine, 2021. http://www.theses.fr/2021LORR0164.
Every day we write --- from sending your mother a quick text to drafting a scientific article such as this thesis. The writing we do often goes hand-in-hand with automated assistance. For example, modern instant messaging software often suggests what word to write next, emails can be started with an autocomposer, and essays are improved with machine-suggested edits. These technologies are powered by years of research on text generation, a natural language processing field with the goal of automatically producing fluent, human-readable natural language. At a small scale, text generation systems can generate individual words or sentences, but have wide-reaching applications beyond that. For instance, systems for summarization, dialogue, and even the writing of entire Wikipedia articles are grounded in foundational text generation technology.Producing fluent, accurate, and useful natural language faces numerous challenges. Recent advances in text generation, principally leveraging training neural network architectures on large datasets, have significantly improved the surface-level readability of machine-generated text. However, current systems necessitate improvement along numerous axes, including generation beyond English and writing increasingly longer texts. While the field has seen rapid progress, much research focus has been directed towards the English language, where large-scale training and evaluation datasets for various tasks are readily available. Nevertheless, applications from autocorrect to autocomposition of text should be available universally. After all, by population, the majority of the world does not write in English. In this work, we create text generation systems for various tasks with the capability of incorporating languages beyond English, either as algorithms that easily extend to new languages or multilingual models encompassing up to 20 languages in one model.Beyond our work in multilingual text generation, we focus on a critical piece of generation systems: knowledge. A pre-requisite to writing well is knowing what to write. This concept of knowledge is incredibly important in text generation systems. For example, automatically writing an entire Wikipedia article requires extensive research on that article topic. The instinct to research is often intuitive --- decades ago people would have gone to a library, replaced now by the information available on the World Wide Web. However, for automated systems, the question is not only what knowledge to use to generate text, but also how to retrieve that knowledge and best utilize it to achieve the intended communication goal.We face the challenge of retrieval-based text generation. We present several techniques for identifying relevant knowledge at different scales: from local knowledge available in a paragraph to sifting through Wikipedia, and finally identifying the needle-in-the-haystack on the scale of the full web. We describe neural network architectures that can perform large-scale retrieval efficiently, utilizing pre-computation and caching mechanisms. Beyond how to retrieve knowledge, we further investigate the form the knowledge should take --- from natural language such as Wikipedia articles or text on the web to structured inputs in the form of knowledge graphs. Finally, we utilize these architectures in novel, much more challenging tasks that push the boundaries of where text generation models work well today: tasks that necessitate knowledge but also require models to produce long, structured natural language output, such as answering complex questions or writing full Wikipedia articles
Barth, Elaine Maria Luz. "The effects of text structure instruction on efl reader's understanding of expository texts." reponame:Repositório Institucional da UFSC, 1990. https://repositorio.ufsc.br/xmlui/handle/123456789/157653.
Made available in DSpace on 2016-01-08T16:51:35Z (GMT). No. of bitstreams: 1 80067.pdf: 4788847 bytes, checksum: e2967ec153e31fb0d4401ad3f98eadc2 (MD5) Previous issue date: 1990
Carter-Thomas, Shirley. "Texte et contexte : pour une approche fonctionnelle et empirique." Habilitation à diriger des recherches, Université de la Sorbonne nouvelle - Paris III, 2009. http://tel.archives-ouvertes.fr/tel-00482108.
LUC, CHRISTOPHE. "Representation et composition des structures visuelles et rhetoriques du texte. Approche pour la generation de textes formates." Toulouse 3, 2000. http://www.theses.fr/2000TOU30086.
Haselton, Curt B. Deierlein Gregory G. "Assessing seismic collapse safety of modern reinforced concrete moment-frame buildings." Berkeley, Calif. : Pacific Earthquake Engineering Research Center, 2008. http://nisee.berkeley.edu/elibrary/Text/200803261.
Saint-Germain, Isabelle. "Le passage de l'article scientifique au texte vulgarisé analyse de la structure, du contenu et de la rhétorique des textes." Mémoire, Université de Sherbrooke, 2004. http://savoirs.usherbrooke.ca/handle/11143/2361.
Saint-Germain, Isabelle. "Le passage de l'article scientifique au texte vulgarisé : analyse de la structure, du contenu et de la rhétorique des textes." Sherbrooke : Université de Sherbrooke, 2004.
Hsaio-Hui, Wu. "The effects of text structure on comprehending expository texts by EFL vocational university students in Taiwan." Thesis, Queen's University Belfast, 2016. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.707231.
Fell, Michael. "Traitement automatique des langues pour la recherche d'information musicale : analyse profonde de la structure et du contenu des paroles de chansons." Thesis, Université Côte d'Azur, 2020. http://www.theses.fr/2020COAZ4017.
Applications in Music Information Retrieval and Computational Musicology have traditionally relied on features extracted from the music content in the form of audio, but mostly ignored the song lyrics. More recently, improvements in fields such as music recommendation have been made by taking into account external metadata related to the song. In this thesis, we argue that extracting knowledge from the song lyrics is the next step to improve the user’s experience when interacting with music. To extract knowledge from vast amounts of song lyrics, we show for different textual aspects (their structure, content and perception) how Natural Language Processing methods can be adapted and successfully applied to lyrics. For the structuralaspect of lyrics, we derive a structural description of it by introducing a model that efficiently segments the lyricsinto its characteristic parts (e.g. intro, verse, chorus). In a second stage, we represent the content of lyrics by meansof summarizing the lyrics in a way that respects the characteristic lyrics structure. Finally, on the perception of lyricswe investigate the problem of detecting explicit content in a song text. This task proves to be very hard and we showthat the difficulty partially arises from the subjective nature of perceiving lyrics in one way or another depending onthe context. Furthermore, we touch on another problem of lyrics perception by presenting our preliminary resultson Emotion Recognition. As a result, during the course of this thesis we have created the annotated WASABI SongCorpus, a dataset of two million songs with NLP lyrics annotations on various levels
O'Malley, Claire E. "Structure and access : the role of structural factors in text comprehension and information." Thesis, University of Leeds, 1985. http://etheses.whiterose.ac.uk/322/.
Dzunic, Zoran Ph D. Massachusetts Institute of Technology. "Text structure-aware classification." Thesis, Massachusetts Institute of Technology, 2009. http://hdl.handle.net/1721.1/53315.
Cataloged from PDF version of thesis.
Includes bibliographical references (p. 73-76).
Bag-of-words representations are used in many NLP applications, such as text classification and sentiment analysis. These representations ignore relations across different sentences in a text and disregard the underlying structure of documents. In this work, we present a method for text classification that takes into account document structure and only considers segments that contain information relevant for a classification task. In contrast to the previous work, which assumes that relevance annotation is given, we perform the relevance prediction in an unsupervised fashion. We develop a Conditional Bayesian Network model that incorporates relevance as a hidden variable of a target classifier. Relevance and label predictions are performed jointly, optimizing the relevance component for the best result of the target classifier. Our work demonstrates that incorporating structural information in document analysis yields significant performance gains over bag-of-words approaches on some NLP tasks.
by Zoran Dzunic.
S.M.
Ouentchist, Dogny Elysee. "Rôles et fonctionnement de structures signifiantes dans la modalisation de l'affichage à Abidjan." Thesis, Limoges, 2019. http://www.theses.fr/2019LIMO0056.
Through this thesis, we analyse the "stage" situation and the "strategy" situation of the poster in the city of Abidjan. It shows how different "meaningful structures" (support, space and time, and communication situations) interact with icono-text in poster preaching. The main hypothesis assumes a double manipulation that "puts into play" the poster and the city. We have shown, through semiotic and pragmatic theories that on the one hand the poster in the city lives or survives by capturing and expressing the needs of the populations. On the other hand, the city exercises a political control over the poster, allowing or defending some spaces. The interactions that take place between the city and the poster make it possible to analyze the Ivorian social context. Indeed, the posters in Abidjan shows a diversity of sizes: diversity of enunciators, diversity of speech, diversity of media, diversity of proposals, diversity of senders, and diversity of display spaces. Also, the intensity of the posters shows an anarchism revealing deep social tensions: the hegemony of "small trades" and their sophistication, obscenity in urban writings, economic extroversion and triggers the issue of laicity
McDonald, Daniel Merrill. "Combining Text Structure and Meaning to Support Text Mining." Diss., The University of Arizona, 2006. http://hdl.handle.net/10150/194015.
Clément, Julien. "Algorithmes, mots et textes aléatoires." Habilitation à diriger des recherches, Université de Caen, 2011. http://tel.archives-ouvertes.fr/tel-00913127.
Salson, Mikaël. "Structures d'indexation compressées et dynamiques pour le texte." Rouen, 2010. http://www.theses.fr/2010ROUES042.
Hernandez, Nicolas. "Description et détection automatique de structures de texte." Paris 11, 2004. http://www.theses.fr/2004PA112329.
Information Retrieval Systems are not well adapted for text browsing and visualization (dynamic summarization). But this one is always necessary for the user to evaluate the Information Retrieval (IR) systems are not well adapted for text browsing and visualization (dynamic summarization). But this is always necessary for users to evaluate the relevance of a document. Our work follows a Web Semantic perspective. We aim at annotating documents with abstract information about content description and discourse organization in order to create more abilities for IR systems. Descriptive information concerns both topic identification and semantic and rhetorical classification of text extracts (With information such as "Our aim is. . . ", "This paper deals with. . . "). We implement a system to identify topical linguistic expressions based on a robust anaphora system and lexical chains building. We also propose a method in order to automatically acquire meta-discursive material. We perform the detection of the text structure thanks to two complementary approaches. The first one offers a top-down analysis based on the segmentation provided by lexical cohesion and by linguistic markers such as frame introducers. The second one is concerned by local text organization by the detection of informational relations (coordination and subordination) between subsequent sentences
Tirkkonen-Condit, Sonja. "Argumentative text structure and translation." Jyväskylä : University of Jyväskylä, 1985. http://catalog.hathitrust.org/api/volumes/oclc/13332106.html.
Lemos, Carolina Lindenberg. "Condições semióticas da repetição." Universidade de São Paulo, 2015. http://www.teses.usp.br/teses/disponiveis/8/8139/tde-09062015-111352/.
Emanating from different areas of the human sciences, repetition was given a central role in this thesis of semiotic inclination. It is a widespread phenomenon in all fields of human activity and, particularly, in texts. The optional character of certain repetitions brings about the problem of its function, since, in certain cases, it seems to act directly on the rhythm of the content and the flow of entrances and exits of the phenomenal field. This regulation of the rhythm divides the research into two fronts. On the one hand, the rhythmic effect points to an underlying structure. In that sense, one can ask: what is the configuration of such structure? In what way is repetition part of it? Or even, what is its place in the semiotic model? On the other hand, repetition seems to involve a certain degree of contradiction: in what way can a phenomenon that brings no novelty, only the resumption of the same, sometimes create an effect of tension or surprise? In order to answer these questions, we undertake the revision of the role of repetition in neighboring fields: rhetoric and a specific trend in linguistics. This discussion has allowed us to detect a few insufficiencies in these approaches that may be answered by semiotics. From the semiotic perspective, we have explored the place occupied by repetition, by opposing it to concepts such as identification, text, language and to the notion of semiotics itself. Once the position of repetition in the text is established, we move on to note and discuss the textual conditions necessary to the occurrence of relevant repetitions. In addition to identification, the notion of salience, based on the opposition between figure and ground, revealed itself to be central to the explanation of the phenomenon. Finally, linearity has also proven relevant, which allowed us to re-discuss its theoretical status as one possible manifestation of the underlying syntagmatic structure. Having outlined the conditions for repetition, we have started an investigation into the somewhat contradictory effects we had observed in repetitive incidents. We saw that repetition belongs to the order of the extent it is counted, not measured and, in being so, it is a tool for the manifestation of the rhythm of the content that is presupposed by it. In these terms, repetition is subordinated to the intensive sub-dimensions: tempo and tonicity. To ensure the relevance of our arguments, we studied repetition within some selected objects, where it is made to serve the structuring of the text. Finally, the analysis of these objects shed light on the relations between repetition and the concept of aspect, and three styles of textual progression related to repetition were confirmed: circular, linear and spiraling. This path of investigation has shown us the terms which repetition is tied to and the way in which it manifests an underlying structure. It has also revealed that such structure not only explains but also generates the variations in rhythm and tempo that are felt through repetition. The apparent contradictions of the effects of repetition are explained by the very epistemological bases of the field. The analytical and relational aspects of semiotics are the basis for repetitive construction, which, without adding any new information, may lead the enunciatee to tension, climax and surprise.
NUNES, IAN MONTEIRO. "CLUSTERING TEXT STRUCTURED DATA BASED ON TEXT SIMILARITY." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2008. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=25796@1.
COORDENAÇÃO DE APERFEIÇOAMENTO DO PESSOAL DE ENSINO SUPERIOR
PROGRAMA DE EXCELENCIA ACADEMICA
O presente trabalho apresenta os resultados que obtivemos com a aplicação de grande número de modelos e algoritmos em um determinado conjunto de experimentos de agrupamento de texto. O objetivo de tais testes é determinar quais são as melhores abordagens para processar as grandes massas de informação geradas pelas crescentes demandas de data quality em diversos setores da economia. O processo de deduplicação foi acelerado pela divisão dos conjuntos de dados em subconjuntos de itens similares. No melhor cenário possível, cada subconjunto tem em si todas as ocorrências duplicadas de cada registro, o que leva o nível de erro na formação de cada grupo a zero. Todavia, foi determinada uma taxa de tolerância intrínseca de 5 porcento após o agrupamento. Os experimentos mostram que o tempo de processamento é significativamente menor e a taxa de acerto é de até 98,92 porcento. A melhor relação entre acurácia e desempenho é obtida pela aplicação do algoritmo K-Means com um modelo baseado em trigramas.
This document reports our findings on a set of text clusterig experiments, where a wide variety of models and algorithms were applied. The objective of these experiments is to investigate which are the most feasible strategies to process large amounts of information in face of the growing demands on data quality in many fields. The process of deduplication was accelerated through the division of the data set into individual subsets of similar items. In the best case scenario, each subset must contain all duplicates of each produced register, mitigating to zero the cluster s errors. It is established, although, a tolerance of 5 percent after the clustering process. The experiments show that the processing time is significantly lower, showing a 98,92 percent precision. The best accuracy/performance relation is achieved with the K-Means Algorithm using a trigram based model.
Wylie, Judith W. "Effects of prior knowledge and text structure on text memory." Thesis, Queen's University Belfast, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.359132.
Fuchs, Juliana Thiesen. "Rhetorical Structure Theory: limites e possibiliades de representação da organização textual." Universidade do Vale do Rio do Sinos, 2009. http://www.repositorio.jesuita.org.br/handle/UNISINOS/2569.
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Nesta dissertação de mestrado, procuro mostrar a contribuição de determinadas concepções de organização textual para a representação do texto realizada pelo modelo da Rhetorical Structure Theory – RST (Mann; Thompson, 1988). A RST é uma teoria que explica a estrutura textual por meio de um modelo de relações que se estabelecem, recursivamente, entre partes do texto consideradas pelo analista como núcleos e satélites. Porém, apesar de abarcar a coerência retórica relacional, a RST, como teoria, não lida com outras concepções que dêem conta do processo complexo de organização textual. Dessa forma, como modelo, ela representa o texto de forma limitada. Neste trabalho, investigo a possibilidade de a RST ser associada a determinadas concepções de organização textual, como a relação entre texto e contexto e o processo estratégico top-down de formação do texto. Para tanto, realizo uma investigação em duas partes: uma teórica e uma de análise. Na parte teórica, apresento um quadro teórico que embasa as concepções de
In this master’s degree paper work, I aim to show the contribution of some conceptions of textual organization to the text representing process carried out by Rhetorical Structure Theory – RST (Mann; Thompson, 1988). RST is a theory that explains the text structure by postulating a model of relations which recursively hold between parts of text labeled nucleus or satellite by the analyst. However, even accounting for the rhetorical relational coherence, RST, as a theory, doesn’t include other conceptions to account for the complex process of textual organization. Thus, as a model, it produces a limited text representation. In this paper work, I investigate the possibility of associating RST with some conceptions of textual organization, like the relationship between text and context and the top-down strategic process of text construction. To do so, I carry out an investigation in two parts: a theoretical one and an analytical one. In the theoretical part, I show a theoretical framework that supports the conce
Lyra, Risto Matti Juhani. "Topical subcategory structure in text classification." Thesis, University of Sussex, 2019. http://sro.sussex.ac.uk/id/eprint/81340/.
Lafourcade, Mathieu. "Lexique et analyse sémantique de textes - structures, acquisitions, calculs, et jeux de mots." Habilitation à diriger des recherches, Université Montpellier II - Sciences et Techniques du Languedoc, 2011. http://tel.archives-ouvertes.fr/tel-00649851.
Lau, Lai Lai Cubie. "The argument structure of fund-raising texts." HKBU Institutional Repository, 2001. http://repository.hkbu.edu.hk/etd_ra/385.
Hassan, Jawad. "Structured Text Compiler Targeting XML." Thesis, Växjö University, School of Mathematics and Systems Engineering, 2010. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-6441.
Young-Lai, Matthew. "Text structure recognition using a region algebra." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 2001. http://www.collectionscanada.ca/obj/s4/f2/dsk3/ftp04/NQ60576.pdf.
Bisson, Marie. "Une édition numérique structurée à l’aide de la Text Encoding Initiative des textes montois de dom Thomas Le Roy : établissement critique des textes, recherches sur les sources, présentation littéraire et historique." Caen, 2015. http://www.theses.fr/2015CAEN1029.
The aim of this thesis is to establish a critical edition of the investigations of dom Thomas Le Roy on the history of the Mont Saint-Michel. In the 17th century, this Maurist monk has written collectively many versions of his work. Eugène de Robillard de Beaurepaire, in the 19th century, has worked on the first scientific edition of this work but has principally relied on one main manuscript and has even missed many passages of the work. We therefore propose a critical edition of the three texts that we possess on the Mont's history of dom Thomas Le Roy : one long and chronological version (Caen BM, Mancel 195) ; one brief history of the abbey of the Mont Saint-Michel (Paris BNF, Latin 13818) ; one version of 228 pages, on the topic (Paris BNF, Français 18950). We have compared and have tried to find the sources that the Maurist has taken into account to write the texts. Rendering the authority of each part of the text, comparing the different versions but also analysing the re-writing work of the sources, we have chosen to publish the result of the work in two different forms : paper and electronic. Due to the limited time of the thesis, the scientific annotation of the longest manuscript could not be completed : the text is thus not paper published but electronically published. The electronic edition is done in the XML (eXtensible Markup Language) language with the help of TEI (Text Encoding Language), a guideline presently experimented with greatest success to describe the content and the structure of the written documents in the Human sciences. We propose in introduction of this thesis a literary, historical and methodological presentation of our work
Eler, Marcelo Medeiros. "Uso da técnica de teste estrutural para o teste e monitoração de serviços." Universidade de São Paulo, 2012. http://www.teses.usp.br/teses/disponiveis/55/55134/tde-04092012-141341/.
Software oriented computing aims at developing software by the composition of services. It promotes software reuse and the implementation of dynamic, flexible and low coupling applications. Services provide specific business functionalities and are provided as a black-box. The use of services is only possible if the developers of service applications (integrators) trust the third party services. Particularly, testing is one of the solutions to obtain confidence on third party software. However, testers can only use specification based testing techiniques due to unavailability of the source code. In this context, testers cannot use the benefits of combining specification and implementation-based testing techniques. This works aims at proposing an an approach to introduce the structural testing technique in the context of service-based applications, but without revealing the source code. The proposed approach promotes the development of testable services, which are services with high testability and exposes operations through a testing interface to support structural testing. Integrators can test testable services and get, without having access to the source code, a coverage analysis on structural criteria. Test metadata are also provided along with testable services to help integrators on creating more test cases to increase the coverage obtained. The proposed approach is also used to support monitoring activities. The approach is generic and an instantiation is presented to create testable services written in Java. Formal experiments and case studies were conduct to validate the proposed approach and the instantiation. The results provide evidences of the applicability and the benefits of the approach for both testing and monitoring activities when compared to only using the functional approach
Karaouza, Efthymia. "Cohesion and text structure in Attic Greek prose." Thesis, University of Birmingham, 2007. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.442640.
Bouayad-Agha, Nadjet. "The role of document structure in text generation." Thesis, University of Brighton, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.366234.
Fanning, David John. "Schoenberg's monodrama "Erwartung" text, structure and musical language /." Online version, 1985. http://ethos.bl.uk/OrderDetails.do?did=1&uin=uk.bl.ethos.353718.
Mota, Filho Antonio. "Text structure and brazilian university student's writing proficiency." reponame:Repositório Institucional da UFSC, 1989. https://repositorio.ufsc.br/xmlui/handle/123456789/157596.
Made available in DSpace on 2016-01-08T16:24:50Z (GMT). No. of bitstreams: 1 79187.pdf: 2947212 bytes, checksum: 7538e89c2f805069675390db75fccb0a (MD5) Previous issue date: 1989
Pesquisas empíricas têm demonstrado a importância da organização retórica na compreensão e produção de textos expositivos (ou narrativos). A idéia básica é que a organização retórica subjacente a um dado texto interage com o esquema formal do leitor (seu conhecimento prévio e sua experiência com organização retórica) influenciando na compreensão e produção de textos.
Holsgrove, John V. "Structure strategy use in children's comprehension of expository texts." Thesis, Edith Cowan University, Research Online, Perth, Western Australia, 2011. https://ro.ecu.edu.au/theses/398.
Silva, Patricia Andrade da. "Mapas e redes conceituais: uma proposta metodológica para a sua construção a partir de textos." Universidade de São Paulo, 2012. http://www.teses.usp.br/teses/disponiveis/81/81132/tde-20092016-105920/.
The drafting of essays by students in response to essay questions in the school context seems to be the most conventional way to assess the students` knowledge. The essay produced by a student seeks to approximately reflect his/her knowledge structure about a certain domain. The reading and the analysis of essays are tasks that require a considerable time in a teacher\'s or researcher\'s routine and, when the interest on knowing the most important ideas about a certain topic is verified in a group of students, the task is even harder. The main objective of this research is to develop a methodology that uses computational tools in order to transform written essays in graphic structures such as concept maps and networks graphs. This methodology could be useful not only for teaching research purposes but also for teaching practice, since the final product of its application may lead to inferences about the knowledge structure of a group of students. The investigation developed herein was based on data collected from two distinct matters of undergraduate IQ-USP. That data refer to written essays of 42 students in response to an essay question provided of some pre-established concepts. From the students\' responses, tests were performed: (i) with two softwares for text analysis with a view of quantifying the relationships between concepts, (ii) to investigate the influence on the quantification of relationships between concepts, from the original text and from the propositions extracted from this original text and (iii) to obtain different types of graphic structures. From the tests that were done, it was possible to conclude that Hamlet® consists in a more efficient and convenient program than ALA-Reader® to the objectives of this research. Furthermore, the array generated by Hamlet® program to quantify the relationships between concepts depends essentially on the structure of the essay - either the original text or the modified text. The three types of graphic structures that were built present different focuses, however, these graphic structures may be considered complementaries. The (V+P) network graphs can be thought as interesting representations that focuses on pre-established concepts that were provided on the essay questions. Network graphs from cutting percentage can be thought as representations that are more useful for investigations interested in making a cut or in highlighting the most relevant aspects of a subject by the students. The concept maps constructed in this paper can be thought as extremely valuable representations to know the approximate knowledge structure of the students groups, since they make clear the nature of the propositional relationships between concepts. The construction of concept maps starting from the original texts and also from the modified texts proved that both graphic structures obtained are very close to each other, being highly similar. This similarity suggests that the use of the Hamlet® program to obtain arrays that quantify relationship between concepts found in an original text is more effective in comparison to manual and time-consuming process of extracting propositions from the original text to obtain an array.
Dorante, Alessandra. "Investigação de processo de conversão automática de textos estruturados para hiperdocumentos." Universidade de São Paulo, 1997. http://www.teses.usp.br/teses/disponiveis/76/76132/tde-15092010-164303/.
This work centered in the research of the automatic conversion of structured texts into hyperdocuments. It presents an analysis concerning the advantages and disadvantages of such automatic process. It also details the steps involved in this conversion. As one of the results it proposes an automatic conversion process, which is based on document structure and citations´ formal definitions. The application domain is set as Brazilian statutory norms. Another contribution from this work is a tool called WebifyLaw, which implements the automatic conversion process for the chosen domain. The tool was applied to the Brazilian Constitution, the Civil Code among other 42 norms. The results obtained in using this application are also presented and discussed.
Guerdoud, Mohand. "Acquisition de connaissances a partir de textes structures." Paris 6, 1997. http://www.theses.fr/1997PA066365.
Davis, Marcia H. "Effects of text markers and familiarity on component structures of text-based representations." College Park, Md. : University of Maryland, 2006. http://hdl.handle.net/1903/4086.
Thesis research directed by: Human Development. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
Lemarié, Julie. "La compréhension des textes visuellement structurés : le cas des énumérations." Toulouse 2, 2006. http://www.theses.fr/2006TOU20041.
Our research deals with the influence of visual signals (headings, enumerations,. . . ) on text comprehension. Our general claim is that text comprehension is not restricted to the interpretation of the text propositional content but also consists in interpreting the text visual properties. This assumption aims to enrich existing comprehension models : we shed the light on processes implied in comprehension and that are specific to the cognitive processing of written texts. To test this assumption, we evaluate the contribution of the Textual Architecture Model to the study of text comprehension. This model offers means to analyse the semantic scope of text visual signals. We investigate different assumptions coming from the model. Results indicate that texts with the same propositional content but different visual signaling devices give rise to different interpretations
Ågren, Ola. "Finding, extracting and exploiting structure in text and hypertext /." Umeå, 2009. http://opac.nebis.ch/cgi-bin/showAbstract.pl?u20=9789172647992.
Ågren, Ola. "Finding, extracting and exploiting structure in text and hypertext." Doctoral thesis, Umeå universitet, Institutionen för datavetenskap, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-22352.
Informationsutvinning (som ofta kallas data mining även på svenska) är ett forskningsområde som hela tiden utvecklas. Det handlar om att använda datorer för att hitta mönster i stora mängder data, alternativt förutsäga framtida data utifrån redan tillgänglig data. Eftersom det samtidigt produceras mer och mer data varje år ställer detta högre och högre krav på effektiviteten hos de algoritmer som används för att hitta eller använda informationen inom rimlig tid. Denna avhandling handlar om att extrahera information från semi-strukturerad data, att hitta strukturer i stora diskreta datamängder och att på ett effektivt sätt rangordna webbsidor utifrån ett ämnesbaserat perspektiv. Den informationsextraktion som beskrivs handlar om stöd för att hålla både dokumentationen och källkoden uppdaterad samtidigt. Vår lösning på detta problem är att låta delar av dokumentationen (främst algoritmbeskrivningen) ligga som blockkommentarer i källkoden och extrahera dessa automatiskt med ett verktyg. De strukturer som hittas av våra algoritmer för strukturextraktion är i form av underordnanden, exempelvis att ett visst nyckelord är mer generellt än ett annat. Dessa samband kan utnyttjas för att skapa större strukturer i form av hierarkier eller riktade grafer, eftersom underordnandena är transitiva. Det verktyg som vi har tagit fram har främst använts för att skapa indata till ett informationsutvinningssystem samt för att kunna visualisera indatan. Huvuddelen av den forskning som beskrivs i denna avhandling har dock handlat om att kunna rangordna webbsidor utifrån både deras innehåll och länkarna som finns mellan dem. Vi har skapat ett antal algoritmer och visat hur de beter sig i jämförelse med andra algoritmer som används idag. Dessa jämförelser har huvudsakligen handlat om konvergenshastighet, algoritmernas stabilitet givet osäker data och slutligen hur relevant algoritmernas svarsmängder har ansetts vara utifrån användarnas perspektiv. Forskningen har varit inriktad på effektiva algoritmer för att hämta in och hantera stora datamängder med diskreta eller textbaserade data. I avhandlingen presenterar vi även ett förslag till ett system av verktyg som arbetar tillsammans på en databas bestående av “fingeravtryck” och annan meta-data om de saker som indexerats i databasen. Denna data kan sedan användas av diverse algoritmer för att utöka värdet hos det som finns i databasen eller för att effektivt kunna hitta rätt information.
AlgExt, CHiC, ProT
Dunning, Ted Emerson. "Finding structure in text, genome and other symbolic sequences." Thesis, University of Sheffield, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.310811.
Oakley, Angela L. "Typesetting of integrated scientific text and chemical structure diagrams." Thesis, University of Portsmouth, 1989. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.237873.
Hirose, Koji. "Effects of text structure instruction on Japanese EFL students." Thesis, University of Leicester, 2014. http://hdl.handle.net/2381/28619.
Christensen, Jamie Lynn. "Enhancing Students' Science Content Knowledge Through Text Structure Awareness." Diss., CLICK HERE for online access, 2008. http://contentdm.lib.byu.edu/ETD/image/etd2564.pdf.
Thomas, Karen. "Deepening Understanding of Science Content Through Text Structure Instruction." Diss., CLICK HERE for online access, 2009. http://contentdm.lib.byu.edu/ETD/image/etd3075.pdf.
Eisenberg, Joshua Daniel. "Automatic Extraction of Narrative Structure from Long Form Text." FIU Digital Commons, 2018. https://digitalcommons.fiu.edu/etd/3912.
Van, Blommestein Erane. "Production factors for written expository texts." Thesis, University of British Columbia, 1991. http://hdl.handle.net/2429/30415.
Medicine, Faculty of
Audiology and Speech Sciences, School of
Graduate
Kou, Huaizhong. "Génération d'adaptateurs web intelligents à l'aide de techniques de fouilles de texte." Versailles-St Quentin en Yvelines, 2003. http://www.theses.fr/2003VERS0011.
This thesis defines a system framework of semantically integrating Web information, called SEWISE. It can integrate text information from various Web sources belonging to an application domain into common domain-specific concept ontology. In SEWISE, Web wrappers are built around different Web sites to automatically extract interesting information from. Text mining technologies are then used to discover the semantics Web documents talk about. SEWISE can ease topic-oriented information researches over the Web. Three problems related to the document categorization are studied. Firstly, we investigate the approaches to feature selection and proposed two approaches CBA and IBA to select features. To estimate statistic term associations and integrate them within document similarity model, a mathematical model is proposed. Finally, the category score calculation algorithms used by k-NN classifiers are studied. Two weighted algorithms CBW and IBW to calculate category score are proposed
Forsyth, Richard. "Stylistic structures : a computational approach to text classification." Thesis, University of Nottingham, 1996. http://eprints.nottingham.ac.uk/13445/.
JU, QI. "Large-scale Structural Reranking for Hierarchical Text Categorization." Doctoral thesis, Università degli studi di Trento, 2013. https://hdl.handle.net/11572/369177.