To see the other types of publications on this topic, follow the link: Text-based.

Dissertations / Theses on the topic 'Text-based'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Text-based.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

SOARES, FABIO DE AZEVEDO. "AUTOMATIC TEXT CATEGORIZATION BASED ON TEXT MINING." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2013. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=23213@1.

Full text
Abstract:
PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO<br>CONSELHO NACIONAL DE DESENVOLVIMENTO CIENTÍFICO E TECNOLÓGICO<br>A Categorização de Documentos, uma das tarefas desempenhadas em Mineração de Textos, pode ser descrita como a obtenção de uma função que seja capaz de atribuir a um documento uma categoria a que ele pertença. O principal objetivo de se construir uma taxonomia de documentos é tornar mais fácil a obtenção de informação relevante. Porém, a implementação e a execução de um processo de Categorização de Documentos não é uma tarefa trivial: as ferramentas de Mineração de Textos estão
APA, Harvard, Vancouver, ISO, and other styles
2

NUNES, IAN MONTEIRO. "CLUSTERING TEXT STRUCTURED DATA BASED ON TEXT SIMILARITY." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2008. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=25796@1.

Full text
Abstract:
PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO<br>COORDENAÇÃO DE APERFEIÇOAMENTO DO PESSOAL DE ENSINO SUPERIOR<br>PROGRAMA DE EXCELENCIA ACADEMICA<br>O presente trabalho apresenta os resultados que obtivemos com a aplicação de grande número de modelos e algoritmos em um determinado conjunto de experimentos de agrupamento de texto. O objetivo de tais testes é determinar quais são as melhores abordagens para processar as grandes massas de informação geradas pelas crescentes demandas de data quality em diversos setores da economia. O processo de deduplicação foi acelerado pela divisão dos con
APA, Harvard, Vancouver, ISO, and other styles
3

Biedert, Ralf [Verfasser]. "Gaze-Based Human-Text Interaction/Text 2.0 / Ralf Biedert." München : Verlag Dr. Hut, 2014. http://d-nb.info/1050331605/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Prabowo, Rudy. "Ontology-based automatic text classification." Thesis, University of Wolverhampton, 2005. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.418665.

Full text
Abstract:
This research investigates to what extent ontologies can be used to achieve an accurate classification performance of an automatic text classifier, called the Automatic Classification Engine (ACE). The task of the classifier is to classify Web pages with respect to the Dewey Decimal Classification (DOC) and Library of Congress Classification (LCC) schemes. In particular, this research focuses on how to 1. build a set of ontologies which can provide a mechanism to enable machine reasoning; 2. define the mappings between the ontologies and the two classification schemes; 3. implement an ontology
APA, Harvard, Vancouver, ISO, and other styles
5

Lu, Su. "DCT coefficient based text detection." Access to citation, abstract and download form provided by ProQuest Information and Learning Company; downloadable PDF file, 57 p, 2008. http://proquest.umi.com/pqdweb?did=1605147371&sid=4&Fmt=2&clientId=8331&RQT=309&VName=PQD.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Couairon, Guillaume. "Text-Based Semantic Image Editing." Electronic Thesis or Diss., Sorbonne université, 2023. http://www.theses.fr/2023SORUS248.

Full text
Abstract:
L’objectif de cette thèse est de proposer des algorithmes pour la tâche d’édition d’images basée sur le texte (TIE), qui consiste à éditer des images numériques selon une instruction formulée en langage naturel. Par exemple, étant donné une image d’un chien et la requête "Changez le chien en un chat", nous voulons produire une nouvelle image où le chien a été remplacé par un chat, en gardant tous les autres aspects de l’image inchangés (couleur et pose de l’animal, arrière- plan). L’objectif de l’étoile du nord est de permettre à tout un chacun de modifier ses images en util
APA, Harvard, Vancouver, ISO, and other styles
7

Gottlieb, Michael. "Text based methods for variant prioritization." Thesis, University of British Columbia, 2017. http://hdl.handle.net/2429/60358.

Full text
Abstract:
Despite improvements in sequencing technologies, DNA sequence variant interpretation for rare genetic diseases remains challenging. In a typical workflow for the Treatable Intellectual Disability Endeavor in B.C. (TIDE BC), a geneticist examines variant calls to establish a set of candidate variants that explain a patient's phenotype. Even with a sophisticated computation pipeline for variant prioritization, they may need to consider hundreds of variants. This typically involves literature searches on individual variants to determine how well they explain the reported phenotype, which is a tim
APA, Harvard, Vancouver, ISO, and other styles
8

Zhang, Xuan. "Hardware-based text-to-braille translation." Thesis, Curtin University, 2007. http://hdl.handle.net/20.500.11937/1351.

Full text
Abstract:
Braille, as a special written method of communication for the blind, has been globally accepted for years. It gives blind people another chance to learn and communicate more efficiently with the rest of the world. It also makes possible the translation of printed languages into a written language which is recognisable for blind people. Recently, Braille is experiencing a decreasing popularity due to the use of alternative technologies, like speech synthesis. However, as a form of literacy, Braille is still playing a significant role in the education of people with visual impairments. With the
APA, Harvard, Vancouver, ISO, and other styles
9

Zhang, Xuan. "Hardware-based text-to-braille translation." Curtin University of Technology, Department of Computer Engineering, 2007. http://espace.library.curtin.edu.au:80/R/?func=dbin-jump-full&object_id=17220.

Full text
Abstract:
Braille, as a special written method of communication for the blind, has been globally accepted for years. It gives blind people another chance to learn and communicate more efficiently with the rest of the world. It also makes possible the translation of printed languages into a written language which is recognisable for blind people. Recently, Braille is experiencing a decreasing popularity due to the use of alternative technologies, like speech synthesis. However, as a form of literacy, Braille is still playing a significant role in the education of people with visual impairments. With the
APA, Harvard, Vancouver, ISO, and other styles
10

Liljeström, Monica. "Learning text talk online : Collaborative learning in asynchronous text based discussion forums." Doctoral thesis, Umeå universitet, Pedagogiska institutionen, 2010. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-34199.

Full text
Abstract:
The desire to translate constructivist and sociocultural approaches to learning in specific learning activities is evident in most forms of training at current, not least in online education. Teachers worldwide are struggling with questions of how to create conditions in this fairly new realm of education for learners to contribute to the development of a good quality in their own and others' learning. Collaboration in forms of text talk in asynchronous, text based forums (ADF) is often used so students can participate at the location and time that suits them best given the other aspects of th
APA, Harvard, Vancouver, ISO, and other styles
11

Davis, Marcia H. "Effects of text markers and familiarity on component structures of text-based representations." College Park, Md. : University of Maryland, 2006. http://hdl.handle.net/1903/4086.

Full text
Abstract:
Thesis (Ph. D.) -- University of Maryland, College Park, 2006.<br>Thesis research directed by: Human Development. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
APA, Harvard, Vancouver, ISO, and other styles
12

Zhang, Nan. "TRANSFORM BASED AND SEARCH AWARE TEXT COMPRESSION SCHEMES AND COMPRESSED DOMAIN TEXT RETRIEVAL." Doctoral diss., University of Central Florida, 2005. http://digital.library.ucf.edu/cdm/ref/collection/ETD/id/3938.

Full text
Abstract:
In recent times, we have witnessed an unprecedented growth of textual information via the Internet, digital libraries and archival text in many applications. While a good fraction of this information is of transient interest, useful information of archival value will continue to accumulate. We need ways to manage, organize and transport this data from one point to the other on data communications links with limited bandwidth. We must also have means to speedily find the information we need from this huge mass of data. Sometimes, a single site may also contain large collections of data such as
APA, Harvard, Vancouver, ISO, and other styles
13

Mick, Alan A. "Knowledge based text indexing and retrieval utilizing case based reasoning /." Online version of thesis, 1994. http://hdl.handle.net/1850/11715.

Full text
APA, Harvard, Vancouver, ISO, and other styles
14

Branchetti, Simone. "Color Watermarking Techniques for Text-based Media." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/22170/.

Full text
Abstract:
The main focus in this thesis is text-based watermarking: that means embedding a secter payload of bits into a text, while keeping the text as unaltered as possible to make it harder to identifiy that a watermark happened at all. Using two Google Workplace add-ons we explore the efficacy, the ease of use and the portability of three structural watermarking techniques: Homoglyph-based watermarking, Space coloring-based watermarking and Grayscale based Watermarking. The latter two are techniques developed specifically for this thesis. Another important focus is the use of the Digital Object Iden
APA, Harvard, Vancouver, ISO, and other styles
15

Krishnan, Sharenya. "Text-Based Information Retrieval Using Relevance Feedback." Thesis, KTH, Skolan för informations- och kommunikationsteknik (ICT), 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-53603.

Full text
Abstract:
Europeana, a freely accessible digital library with an idea to make Europe's cultural and scientific heritage available to the public was founded by the European Commission in 2008. The goal was to deliver a semantically enriched digital content with multilingual access to it. Even though they managed to increase the content of data they slowly faced the problem of retrieving information in an unstructured form. So to complement the Europeana portal services, ASSETS (Advanced Search Service and Enhanced Technological Solutions) was introduced with services that sought to improve the usability
APA, Harvard, Vancouver, ISO, and other styles
16

Vassiliou, Andrew. "Analysing film content : a text-based approach." Thesis, University of Surrey, 2006. http://epubs.surrey.ac.uk/2244/.

Full text
APA, Harvard, Vancouver, ISO, and other styles
17

Westmacott, Mike. "Content based image retrieval : analogies with text." Thesis, University of Southampton, 2004. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.423038.

Full text
APA, Harvard, Vancouver, ISO, and other styles
18

Wu, Yingyu. "Using Text based Visualization in Data Analysis." Kent State University / OhioLINK, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=kent1398079502.

Full text
APA, Harvard, Vancouver, ISO, and other styles
19

Wang, Xutao. "Chinese Text Classification Based On Deep Learning." Thesis, Mittuniversitetet, Avdelningen för informationssystem och -teknologi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35322.

Full text
Abstract:
Text classification has always been a concern in area of natural language processing, especially nowadays the data are getting massive due to the development of internet. Recurrent neural network (RNN) is one of the most popular method for natural language processing due to its recurrent architecture which give it ability to process serialized information. In the meanwhile, Convolutional neural network (CNN) has shown its ability to extract features from visual imagery. This paper combine the advantages of RNN and CNN and proposed a model called BLSTM-C for Chinese text classification. BLSTM-C
APA, Harvard, Vancouver, ISO, and other styles
20

Bafuka, Freddy Nole. "Beyond text analysis : image-based evaluation of health-related text readability using style features." Thesis, Massachusetts Institute of Technology, 2009. http://hdl.handle.net/1721.1/53121.

Full text
Abstract:
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.<br>Includes bibliographical references (p. 70-71).<br>Many studies have shown that the readability of health documents presented to consumers does not match their reading levels. An accurate assessment of the readability of health-related texts is an important step in providing material that match readers' literacy. Current readability measurements depend heavily on text analysis (NLP), but neglect style (text layout). In this study, we show that style properties are important p
APA, Harvard, Vancouver, ISO, and other styles
21

Johansson, Vida. "Depending on VR : Rule-based Text Simplification Based on Dependency Relations." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-139043.

Full text
Abstract:
The amount of text that is written and made available increases all the time. However, it is not readily accessible to everyone. The goal of the research presented in this thesis was to develop a system for automatic text simplification based on dependency relations, develop a set of simplification rules for the system, and evaluate the performance of the system. The system was built on a previous tool and developments were made to ensure the that the system could perform the operations necessary for the rules included in the rule set. The rule set was developed by manual adaption of the rules
APA, Harvard, Vancouver, ISO, and other styles
22

Benveniste, Steven M. "Investigation into text classification with kernel based schemes." Thesis, Monterey, California : Naval Postgraduate School, 2010. http://edocs.nps.edu/npspubs/scholarly/theses/2010/Mar/10Mar%5FBenveniste.pdf.

Full text
Abstract:
Thesis (M.S. in Electrical Engineering)--Naval Postgraduate School, March 2010.<br>Thesis Advisor(s): Fargues, Monique P. Second Reader: Cristi, Roberto. "March 2010." Description based on title screen as viewed on May 6, 2010. Author(s) subject terms: Text Classification, Text Categorization, Kernel Based Schemes, Single Value Decomposition (SVD), Data Mining, Feature Vector Selection (FVS). Includes bibliographical references (p. 141-142). Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
23

Deniz, Onur. "Ontology Based Text Mining In Turkish Radiology Reports." Master's thesis, METU, 2012. http://etd.lib.metu.edu.tr/upload/12614145/index.pdf.

Full text
Abstract:
Vast amount of radiology reports are produced in hospitals. Being in free text format and having errors due to rapid production, it continuously gets more complicated for radiologists and physicians to reach meaningful information. Though application of ontologies into bio-medical text mining has gained increasing interest in recent years, less work has been offered for ontology based retrieval tasks in Turkish language. In this work, an information extraction and retrieval system based on SNOMED-CT ontology has been proposed for Turkish radiology reports. Main purpose of this work is to uti
APA, Harvard, Vancouver, ISO, and other styles
24

Wildermoth, Brett Richard, and n/a. "Text-Independent Speaker Recognition Using Source Based Features." Griffith University. School of Microelectronic Engineering, 2001. http://www4.gu.edu.au:8080/adt-root/public/adt-QGU20040831.115646.

Full text
Abstract:
Speech signal is basically meant to carry the information about the linguistic message. But, it also contains the speaker-specific information. It is generated by acoustically exciting the cavities of the mouth and nose, and can be used to recognize (identify/verify) a person. This thesis deals with the speaker identification task; i.e., to find the identity of a person using his/her speech from a group of persons already enrolled during the training phase. Listeners use many audible cues in identifying speakers. These cues range from high level cues such as semantics and linguistics of the sp
APA, Harvard, Vancouver, ISO, and other styles
25

Boulton, David. "Fine art image classification based on text analysis." Thesis, University of Surrey, 2002. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.252478.

Full text
APA, Harvard, Vancouver, ISO, and other styles
26

Mohamed, Muhidin Abdullahi. "Automatic text summarisation using linguistic knowledge-based semantics." Thesis, University of Birmingham, 2016. http://etheses.bham.ac.uk//id/eprint/6659/.

Full text
Abstract:
Text summarisation is reducing a text document to a short substitute summary. Since the commencement of the field, almost all summarisation research works implemented to this date involve identification and extraction of the most important document/cluster segments, called extraction. This typically involves scoring each document sentence according to a composite scoring function consisting of surface level and semantic features. Enabling machines to analyse text features and understand their meaning potentially requires both text semantic analysis and equipping computers with an external sema
APA, Harvard, Vancouver, ISO, and other styles
27

Thaper, Nitin 1975. "Using compression for source-based classification of text." Thesis, Massachusetts Institute of Technology, 2001. http://hdl.handle.net/1721.1/86595.

Full text
APA, Harvard, Vancouver, ISO, and other styles
28

Stigeborn, Olivia. "Text ranking based on semantic meaning of sentences." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-300442.

Full text
Abstract:
Finding a suitable candidate to client match is an important part of consultant companies work. It takes a lot of time and effort for the recruiters at the company to read possibly hundreds of resumes to find a suitable candidate. Natural language processing is capable of performing a ranking task where the goal is to rank the resumes with the most suitable candidates ranked the highest. This ensures that the recruiters are only required to look at the top ranked resumes and can quickly get candidates out in the field. Former research has used methods that count specific keywords in resumes an
APA, Harvard, Vancouver, ISO, and other styles
29

Wildermoth, Brett Richard. "Text-Independent Speaker Recognition Using Source Based Features." Thesis, Griffith University, 2001. http://hdl.handle.net/10072/366289.

Full text
Abstract:
Speech signal is basically meant to carry the information about the linguistic message. But, it also contains the speaker-specific information. It is generated by acoustically exciting the cavities of the mouth and nose, and can be used to recognize (identify/verify) a person. This thesis deals with the speaker identification task; i.e., to find the identity of a person using his/her speech from a group of persons already enrolled during the training phase. Listeners use many audible cues in identifying speakers. These cues range from high level cues such as semantics and linguistics of the sp
APA, Harvard, Vancouver, ISO, and other styles
30

CANO, ERION. "Text-based Sentiment Analysis and Music Emotion Recognition." Doctoral thesis, Politecnico di Torino, 2018. http://hdl.handle.net/11583/2709436.

Full text
Abstract:
Nowadays, with the expansion of social media, large amounts of user-generated texts like tweets, blog posts or product reviews are shared online. Sentiment polarity analysis of such texts has become highly attractive and is utilized in recommender systems, market predictions, business intelligence and more. We also witness deep learning techniques becoming top performers on those types of tasks. There are however several problems that need to be solved for efficient use of deep neural networks on text mining and text polarity analysis. First of all, deep neural networks are data hungry.
APA, Harvard, Vancouver, ISO, and other styles
31

Wang, Jingcheng. "A Rule-based Methodology and Feature-based Methodology for Effect Relation Extraction in Chinese Unstructured Text." Thesis, The University of Sydney, 2015. http://hdl.handle.net/2123/14152.

Full text
Abstract:
The Chinese language differs significantly from English, both in lexical representation and grammatical structure. These differences lead to problems in the Chinese NLP, such as word segmentation and flexible syntactic structure. Many conventional methods and approaches in Natural Language Processing (NLP) based on English text are shown to be ineffective when attending to these language specific problems in late-started Chinese NLP. Relation Extraction is an area under NLP, looking to identify semantic relationships between entities in the text. The term “Effect Relation” is introduced in th
APA, Harvard, Vancouver, ISO, and other styles
32

Goodrum, Abby A. (Abby Ann). "Evaluation of Text-Based and Image-Based Representations for Moving Image Documents." Thesis, University of North Texas, 1997. https://digital.library.unt.edu/ark:/67531/metadc500441/.

Full text
Abstract:
Document representation is a fundamental concept in information retrieval (IR), and has been relied upon in textual IR systems since the advent of library catalogs. The reliance upon text-based representations of stored information has been perpetuated in conventional systems for the retrieval of moving images as well. Although newer systems have added image-based representations of moving image documents as aids to retrieval, there has been little research examining how humans interpret these different types of representations. Such basic research has the potential to inform IR system designe
APA, Harvard, Vancouver, ISO, and other styles
33

Meyer, David, Kurt Hornik, and Ingo Feinerer. "Text Mining Infrastructure in R." American Statistical Association, 2008. http://epub.wu.ac.at/3978/1/textmining.pdf.

Full text
Abstract:
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classiffication and string kernels. (authors' abstract)
APA, Harvard, Vancouver, ISO, and other styles
34

Hossayni, Sayyed Ali. "Foundations of uncertainty management for text-based sentiment prediction." Doctoral thesis, Universitat de Girona, 2018. http://hdl.handle.net/10803/666765.

Full text
Abstract:
Analyzing the sentiment of Social Networks users is an attractive task, well-covered by the Sentiment Analysis research communities. Alongside, predicting the rating/opinion of users in Social Networks or e-commerce platforms is another attractive task covered by the Recommender Systems research communities. However, there is a rather new field of study that takes advantage of both of the mentioned scopes to predict the “unexpressed” opinion of users, based on their written sentiments and their similarity. Although the Social Network extracted data (due to the sparsity of the addressed items b
APA, Harvard, Vancouver, ISO, and other styles
35

Stymne, Sara. "Text Harmonization Strategies for Phrase-Based Statistical Machine Translation." Doctoral thesis, Linköpings universitet, NLPLAB - Laboratoriet för databehandling av naturligt språk, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-76766.

Full text
Abstract:
In this thesis I aim to improve phrase-based statistical machine translation (PBSMT) in a number of ways by the use of text harmonization strategies. PBSMT systems are built by training statistical models on large corpora of human translations. This architecture generally performs well for languages with similar structure. If the languages are different for example with respect to word order or morphological complexity, however, the standard methods do not tend to work well. I address this problem through text harmonization, by making texts more similar before training and applying a PBSMT sys
APA, Harvard, Vancouver, ISO, and other styles
36

Han, Changan. "Neural Network Based Off-line Handwritten Text Recognition System." FIU Digital Commons, 2011. http://digitalcommons.fiu.edu/etd/363.

Full text
Abstract:
This dissertation introduces a new system for handwritten text recognition based on an improved neural network design. Most of the existing neural networks treat mean square error function as the standard error function. The system as proposed in this dissertation utilizes the mean quartic error function, where the third and fourth derivatives are non-zero. Consequently, many improvements on the training methods were achieved. The training results are carefully assessed before and after the update. To evaluate the performance of a training system, there are three essential factors to be consid
APA, Harvard, Vancouver, ISO, and other styles
37

Massey, Louis. "A lazy text-based approach to foundational knowledge acquisition." Thesis, University of Ottawa (Canada), 1995. http://hdl.handle.net/10393/10084.

Full text
Abstract:
Knowledge Acquisition (KA) from text requires that a large quantity of prior knowledge be made available to the Natural Language Processing (NLP) system. This prior knowledge is called foundational knowledge. The question of where foundational knowledge comes from in the first place is one of the biggest problem facing NLP. Conventionally, foundational knowledge has been hand-crafted on a task- and domain-specific basis. However, it is difficult to determine beforehand exactly what knowledge will be required. It has been shown within the TANKA project that a potential solution to this problem
APA, Harvard, Vancouver, ISO, and other styles
38

Schierz, Amanda Claire. "Monitoring innovation in emerging science : a text-based approach." Thesis, University of Surrey, 2005. http://epubs.surrey.ac.uk/842836/.

Full text
Abstract:
The transfer of knowledge from academia to industry is of critical importance to both academics and industrialists. It can be argued that patent documents referring to a set of well-researched concepts may be used as a measure of such a transfer. Concepts are typically articulated as terms, and shared terms in research papers and patent documents are proposed as the monitoring index. Key developments in science and engineering are usually signalled by the introduction of new terms and the exclusion of established ones; this change in the terminology may be construed as a change in the knowledg
APA, Harvard, Vancouver, ISO, and other styles
39

Hossain, Mahmud Shahriar. "Apriori approach to graph-based clustering of text documents." Thesis, Montana State University, 2008. http://etd.lib.montana.edu/etd/2008/hossain/HossainM0508.pdf.

Full text
Abstract:
This thesis report introduces a new technique of document clustering based on frequent senses. The developed system, named GDClust (Graph-Based Document Clustering) [1], works with frequent senses rather than dealing with frequent keywords used in traditional text mining techniques. GDClust presents text documents as hierarchical document-graphs and uses an Apriori paradigm to find the frequent subgraphs, which reflect frequent senses. Discovered frequent subgraphs are then utilized to generate accurate sense-based document clusters. We propose a novel multilevel Gaussian minimum support strat
APA, Harvard, Vancouver, ISO, and other styles
40

Lam, Yat-kin, and 林日堅. "Intelligent lexical access based on Chinese/English text queries." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2005. http://hub.hku.hk/bib/B30445474.

Full text
APA, Harvard, Vancouver, ISO, and other styles
41

Botha, Gerrti Reinier. "Text-based language identification for the South African languages." Pretoria : [s.n.], 2007. http://upetd.up.ac.za/thesis/available/etd-090942008-133715/.

Full text
APA, Harvard, Vancouver, ISO, and other styles
42

Curtmola, Emiran. "Democratic community-based search with XML full-text queries." Diss., [La Jolla] : University of California, San Diego, 2009. http://wwwlib.umi.com/cr/ucsd/fullcit?p3378521.

Full text
Abstract:
Thesis (Ph. D.)--University of California, San Diego, 2009.<br>Title from first page of PDF file (viewed October 22, 2009). Available via ProQuest Digital Dissertations. Vita. Includes bibliographical references (p. 184-193).
APA, Harvard, Vancouver, ISO, and other styles
43

Kamal, Hasan. "An ATMS-based architecture for stylistics-aware text generation." Thesis, University of Edinburgh, 2002. http://hdl.handle.net/1842/23067.

Full text
Abstract:
This thesis is concerned with the effect of surface stylistic constraints (SSC) on syntactic and lexical choice within a unified generation architecture. Despite the fact that these issues have been investigated by researchers in the field, little work has been done with regard to system architectures that allow surface form constraints to influence earlier linguistic or even semantic decisions made throughout the NLG process. By SSC we mean those stylistic requirements that are known beforehand but cannot be tested until after the utterance or (in some lucky cases) a proper linearised part of
APA, Harvard, Vancouver, ISO, and other styles
44

JOSHI, ARUNIMA. "CONCEPT BASED TEXT CLASSIFICATION." Thesis, 2016. http://dspace.dtu.ac.in:8080/jspui/handle/repository/15244.

Full text
Abstract:
The shift from Web 2.0 to Web 3.0 has significantly changed the perception of users for the internet and the Web. Web 2.0 has improved information sharing among the users, the contribution and collaboration of users, and Web 3.0 has improved the structure and representation of data. Web 3.0 (Semantic Web) is all about the concepts which relates more to real-world entities, which proves to be more realistic and practical. One of the most popular applications of Web 2.0 is blogging and its services. For example, Twitter has evolved as a great platform to share opinions and views on anythin
APA, Harvard, Vancouver, ISO, and other styles
45

Tran, Binh Giang. "Combining text-based and vision-based semantics." Master's thesis, 2011. http://www.nusl.cz/ntk/nusl-313292.

Full text
Abstract:
Learning and representing semantics is one of the most important tasks that significantly contribute to some growing areas, as successful stories in the recent survey of Turney and Pantel (2010). In this thesis, we present an in- novative (and first) framework for creating a multimodal distributional semantic model from state of the art text-and image-based semantic models. We evaluate this multimodal semantic model on simulating similarity judgements, concept clustering and the newly introduced BLESS benchmark. We also propose an effective algorithm, namely Parameter Estimation, to integrate
APA, Harvard, Vancouver, ISO, and other styles
46

Viswanath, Meghana. "Ontology-based automatic text summarization." 2009. http://purl.galileo.usg.edu/uga%5Fetd/viswanath%5Fmeghana%5F200912%5Fms.

Full text
APA, Harvard, Vancouver, ISO, and other styles
47

Zhong, Ming. "Concept-based biomedical text retrieval /." 2007. http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&res_dat=xri:pqdiss&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&rft_dat=xri:pqdiss:MR29634.

Full text
Abstract:
Thesis (M.Sc.)--York University, 2007. Graduate Programme in Computer Science.<br>Typescript. Includes bibliographical references (leaves 96-101). Also available on the Internet. MODE OF ACCESS via web browser by entering the following URL: http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&res_dat=xri:pqdiss&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&rft_dat=xri:pqdiss:MR29634
APA, Harvard, Vancouver, ISO, and other styles
48

TONGAR, VIPIN. "TEXT BASED SANSKRIT LANGUAGE IDENTIFICATION." Thesis, 2022. http://dspace.dtu.ac.in:8080/jspui/handle/repository/19141.

Full text
Abstract:
Text Based Identification of a language is the process of automatically detecting a certain language based on the text given in an article or document. Language identification is an established domain of research that has received considerable attention in the past. Language identification is a crucial initial step in various other works of Natural language processing, language translation, performing language specific AI models etc. It is somewhat easier to differentiate languages which do not belong to same language family or not having same script because the characteristics features
APA, Harvard, Vancouver, ISO, and other styles
49

Henriques, Daniel Filipe Rodrigues. "Automatic Completion of Text-based Tasks." Master's thesis, 2019. http://hdl.handle.net/10362/92296.

Full text
Abstract:
Crowdsourcing is a widespread problem-solving model which consists in assigning tasks to an existing pool of workers in order to solve a problem, being a scalable alternative to hiring a group of experts for labeling high volumes of data. It can provide results that are similar in quality, with the advantage of achieving such standards in a faster and more efficient manner. Modern approaches to crowdsourcing use Machine Learning models to do the labeling of the data and request the crowd to validate the results. Such approaches can only be applied if the data in which the model was train
APA, Harvard, Vancouver, ISO, and other styles
50

Debnath, Sandip. "Automatic text-based explanation of events." 2005. http://etda.libraries.psu.edu/theses/approved/WorldWideIndex/ETD-1045/index.html.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!