Rozprawy doktorskie na temat „Retrieval accuracy”

Kliknij ten link, aby zobaczyć inne rodzaje publikacji na ten temat: Retrieval accuracy.

Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych

Wybierz rodzaj źródła:

Sprawdź 25 najlepszych rozpraw doktorskich naukowych na temat „Retrieval accuracy”.

Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.

Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.

Przeglądaj rozprawy doktorskie z różnych dziedzin i twórz odpowiednie bibliografie.

1

Mohammadzadeh, Hadi. "Improving Retrieval Accuracy in Main Content Extraction from HTML Web Documents". Doctoral thesis, Universitätsbibliothek Leipzig, 2013. http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-130500.

Pełny tekst źródła
Streszczenie:
The rapid growth of text based information on the World Wide Web and various applications making use of this data motivates the need for efficient and effective methods to identify and separate the “main content” from the additional content items, such as navigation menus, advertisements, design elements or legal disclaimers. Firstly, in this thesis, we study, develop, and evaluate R2L, DANA, DANAg, and AdDANAg, a family of novel algorithms for extracting the main content of web documents. The main concept behind R2L, which also provided the initial idea and motivation for the other three algorithms, is to use well particularities of Right-to-Left languages for obtaining the main content of web pages. As the English character set and the Right-to-Left character set are encoded in different intervals of the Unicode character set, we can efficiently distinguish the Right-to-Left characters from the English ones in an HTML file. This enables the R2L approach to recognize areas of the HTML file with a high density of Right-to-Left characters and a low density of characters from the English character set. Having recognized these areas, R2L can successfully separate only the Right-to-Left characters. The first extension of the R2L, DANA, improves effectiveness of the baseline algorithm by employing an HTML parser in a post processing phase of R2L for extracting the main content from areas with a high density of Right-to-Left characters. DANAg is the second extension of the R2L and generalizes the idea of R2L to render it language independent. AdDANAg, the third extension of R2L, integrates a new preprocessing step to normalize the hyperlink tags. The presented approaches are analyzed under the aspects of efficiency and effectiveness. We compare them to several established main content extraction algorithms and show that we extend the state-of-the-art in terms of both, efficiency and effectiveness. Secondly, automatically extracting the headline of web articles has many applications. We develop and evaluate a content-based and language-independent approach, TitleFinder, for unsupervised extraction of the headline of web articles. The proposed method achieves high performance in terms of effectiveness and efficiency and outperforms approaches operating on structural and visual features
Das rasante Wachstum von textbasierten Informationen im World Wide Web und die Vielfalt der Anwendungen, die diese Daten nutzen, macht es notwendig, effiziente und effektive Methoden zu entwickeln, die den Hauptinhalt identifizieren und von den zusätzlichen Inhaltsobjekten wie z.B. Navigations-Menüs, Anzeigen, Design-Elementen oder Haftungsausschlüssen trennen. Zunächst untersuchen, entwickeln und evaluieren wir in dieser Arbeit R2L, DANA, DANAg und AdDANAg, eine Familie von neuartigen Algorithmen zum Extrahieren des Inhalts von Web-Dokumenten. Das grundlegende Konzept hinter R2L, das auch zur Entwicklung der drei weiteren Algorithmen führte, nutzt die Besonderheiten der Rechts-nach-links-Sprachen aus, um den Hauptinhalt von Webseiten zu extrahieren. Da der lateinische Zeichensatz und die Rechts-nach-links-Zeichensätze durch verschiedene Abschnitte des Unicode-Zeichensatzes kodiert werden, lassen sich die Rechts-nach-links-Zeichen leicht von den lateinischen Zeichen in einer HTML-Datei unterscheiden. Das erlaubt dem R2L-Ansatz, Bereiche mit einer hohen Dichte von Rechts-nach-links-Zeichen und wenigen lateinischen Zeichen aus einer HTML-Datei zu erkennen. Aus diesen Bereichen kann dann R2L die Rechts-nach-links-Zeichen extrahieren. Die erste Erweiterung, DANA, verbessert die Wirksamkeit des Baseline-Algorithmus durch die Verwendung eines HTML-Parsers in der Nachbearbeitungsphase des R2L-Algorithmus, um den Inhalt aus Bereichen mit einer hohen Dichte von Rechts-nach-links-Zeichen zu extrahieren. DANAg erweitert den Ansatz des R2L-Algorithmus, so dass eine Sprachunabhängigkeit erreicht wird. Die dritte Erweiterung, AdDANAg, integriert eine neue Vorverarbeitungsschritte, um u.a. die Weblinks zu normalisieren. Die vorgestellten Ansätze werden in Bezug auf Effizienz und Effektivität analysiert. Im Vergleich mit mehreren etablierten Hauptinhalt-Extraktions-Algorithmen zeigen wir, dass sie in diesen Punkten überlegen sind. Darüber hinaus findet die Extraktion der Überschriften aus Web-Artikeln vielfältige Anwendungen. Hierzu entwickeln wir mit TitleFinder einen sich nur auf den Textinhalt beziehenden und sprachabhängigen Ansatz. Das vorgestellte Verfahren ist in Bezug auf Effektivität und Effizienz besser als bekannte Ansätze, die auf strukturellen und visuellen Eigenschaften der HTML-Datei beruhen
Style APA, Harvard, Vancouver, ISO itp.
2

Berman, Garrett L. "The influence of processing instructions at encoding and retrieval on face recognition accuracy". FIU Digital Commons, 1992. http://digitalcommons.fiu.edu/etd/1628.

Pełny tekst źródła
Streszczenie:
Whereas previous research has demonstrated that trait ratings of faces at encoding leads to enhanced recognition accuracy as compared to feature ratings, this set of experiments examines whether ratings given after encoding and just prior to recognition influence face recognition accuracy. In Experiment 1 subjects who made feature ratings just prior to recognition were significantly less accurate than subjects who made no ratings or trait ratings. In Experiment 2 ratings were manipulated at both encoding and retrieval. The retrieval effect was smaller and nonsignificant, but a combined probability analysis showed that it was significant when results from both experiments are considered jointly. In a third experiment exposure duration at retrieval, a potentially confounding factor in Experiments 1 and 2, had a nonsignificant effect on recognition accuracy, suggesting that it probably does not explain the results from Experiments 1 and 2. These experiments demonstrate that face recognition accuracy can be influenced by processing instructions at retrieval.
Style APA, Harvard, Vancouver, ISO itp.
3

Zakos, John, i n/a. "A Novel Concept and Context-Based Approach for Web Information Retrieval". Griffith University. School of Information and Communication Technology, 2005. http://www4.gu.edu.au:8080/adt-root/public/adt-QGU20060303.104937.

Pełny tekst źródła
Streszczenie:
Web information retrieval is a relatively new research area that has attracted a significant amount of interest from researchers around the world since the emergence of the World Wide Web in the early 1990s. The problems facing successful web information retrieval are a combination of challenges that stem from traditional information retrieval and challenges characterised by the nature of the World Wide Web. The goal of any information retrieval system is to provide an information need fulfilment in response to an information need. In a web setting, this means retrieving as many relevant web documents as possible in response to an inputted query that is typically limited to only containing a few terms expressive of the user's information need. This thesis is primarily concerned with firstly reviewing pertinent literature related to various aspects of web information retrieval research and secondly proposing and investigating a novel concept and context-based approach. The approach consists of techniques that can be used together or independently and aim to provide an improvement in retrieval accuracy over other approaches. A novel concept-based term weighting technique is proposed as a new method of deriving query term significance from ontologies that can be used for the weighting of inputted queries. A technique that dynamically determines the significance of terms occurring in documents based on the matching of contexts is also proposed. Other contributions of this research include techniques for the combination of document and query term weights for the ranking of retrieved documents. All techniques were implemented and tested on benchmark data. This provides a basis for performing comparison with previous top performing web information retrieval systems. High retrieval accuracy is reported as a result of utilising the proposed approach. This is supported through comprehensive experimental evidence and favourable comparisons against previously published results.
Style APA, Harvard, Vancouver, ISO itp.
4

Zakos, John. "A Novel Concept and Context-Based Approach for Web Information Retrieval". Thesis, Griffith University, 2005. http://hdl.handle.net/10072/365878.

Pełny tekst źródła
Streszczenie:
Web information retrieval is a relatively new research area that has attracted a significant amount of interest from researchers around the world since the emergence of the World Wide Web in the early 1990s. The problems facing successful web information retrieval are a combination of challenges that stem from traditional information retrieval and challenges characterised by the nature of the World Wide Web. The goal of any information retrieval system is to provide an information need fulfilment in response to an information need. In a web setting, this means retrieving as many relevant web documents as possible in response to an inputted query that is typically limited to only containing a few terms expressive of the user's information need. This thesis is primarily concerned with firstly reviewing pertinent literature related to various aspects of web information retrieval research and secondly proposing and investigating a novel concept and context-based approach. The approach consists of techniques that can be used together or independently and aim to provide an improvement in retrieval accuracy over other approaches. A novel concept-based term weighting technique is proposed as a new method of deriving query term significance from ontologies that can be used for the weighting of inputted queries. A technique that dynamically determines the significance of terms occurring in documents based on the matching of contexts is also proposed. Other contributions of this research include techniques for the combination of document and query term weights for the ranking of retrieved documents. All techniques were implemented and tested on benchmark data. This provides a basis for performing comparison with previous top performing web information retrieval systems. High retrieval accuracy is reported as a result of utilising the proposed approach. This is supported through comprehensive experimental evidence and favourable comparisons against previously published results.
Thesis (PhD Doctorate)
Doctor of Philosophy (PhD)
School of Information and Communication Technology
Full Text
Style APA, Harvard, Vancouver, ISO itp.
5

Gwyer, Pat. "The effects of retrieval procedures on recall, recognition, confidence and the confidence/accuracy relationship". Thesis, University of East London, 1997. http://roar.uel.ac.uk/1281/.

Pełny tekst źródła
Streszczenie:
Six separate experiments were conducted to investigate the effects of specific retrieval procedures on recall, recognition, confidence and the confidence / accuracy relationship for these retrieval domains. In experiment I the effects of retrieval procedures were considered as part of a recognised police interview technique, the Cognitive Interview (CI), while in experiments 11,111, and IV the retrieval procedures involved discrete context manipulations. Experiments V and VI focused exclusively on the confidence / accuracy relationship for recall as a function of specific retrieval procedures and question type. Results indicated that although frequently improving the quality and quantity of recall, the Cl and other types of context reinstatement manipulation did not reliably improve recognition accuracy from lineup presentations, nor did they have a significant moderating effect upon the confidence / accuracy relationship. However in experiment TV in which a long (three month) delay was utilised significant effects of context manipulation on recognition performance were found. With regard to confidence, experiment I indicated that the Cl was responsible for a significant increase in confidence of recall but not recognition. Results from experiments II, III, and iv indicated non consistent effects of context manipulation on confidence, rating-q for either recall and recognition. With regard to the confidence / accuracy relationship, results from the initial five experiments indicated that in very few instances was confidence and accuracy significantly related. However, in experiment VI confidence and accuracy was found to be reliably and consistently related The most important finding to emerge from this research suggests the retrieval procedure undergone by a witness (interactive interview / passive questionnaire), to be an important moderator of the confidence / accuracy relationship. As such the results are supportive of Leippe's (1980) two premises in which it is suggested that as reconstructional and social influences increase, the confidence / accuracy relationship will correspondingly decrease
Style APA, Harvard, Vancouver, ISO itp.
6

Begovic, Ena. "Encoding Style of Positive Autobiographical Memories: Relationship to Mood Repair, Memory Functioning, and Depression". Scholar Commons, 2016. http://scholarcommons.usf.edu/etd/6464.

Pełny tekst źródła
Streszczenie:
The recall of positive autobiographical memories is an effective strategy for improving negative mood among healthy persons, yet individuals with a history of depression often fail to derive emotional benefits. Depressed and depression-vulnerable individuals also exhibit deficits in their autobiographical memory characteristics. Scholars have implicated deficits during autobiographical memory retrieval as a cause of mood repair and memory impairments, however the role of memory encoding has largely been overlooked. The current study manipulated encoding style to examine subsequent effects on mood repair efficacy, memory characteristics, and memory accuracy. Fifty-five formerly depressed and 68 never-depressed participants were assigned to employ either a concrete or natural encoding style while engaging in a positive event staged in the laboratory. After a negative mood induction, participants were given the opportunity to improve their moods by recalling details of the positive event. Results failed to support the hypothesized interaction of depression status and encoding style. Interpretations of the null findings are provided and implications of the study are discussed.
Style APA, Harvard, Vancouver, ISO itp.
7

Morgart, Arianna Paige. "Lexical access in aphasia: impacts of phonological neighborhood density on accuracy of word production". Thesis, University of Iowa, 2015. https://ir.uiowa.edu/etd/1704.

Pełny tekst źródła
Streszczenie:
Verbal communication relies heavily on the ability to effortlessly produce intended words to express a meaning. This capacity is frequently impaired in individuals with aphasia, and impairment often lasts well into the chronic stages. However, the nature of anomia can vary. Phonological neighborhood density (PND) is one feature of words which has been shown to impact the ease of retrieval in speakers with aphasia; words with more similar-sounding neighbors are easier to retrieve because the neighbors help activate the target. However, it is unclear how different types of lexical access breakdowns affect the impact of PND. The aim of this project was to analyze the relationship between word retrieval accuracy, speech error patterns, and PND in individuals with aphasia. Twenty-two participants with various types and severities of aphasia named 200 single-syllable line drawings. WebFit, an online software program designed to fit naming data to a theoretical model of word retrieval, was used to characterize participants' error patterns by calculating the strength of connections within the lexicon, as well as the rate of decay. Analyses confirmed previous findings that participants with all types of breakdown achieved lower rates of overall accuracy. Weaker connections between semantic knowledge and words resulted in a more errors that were close to the target, relative to errors with no relationship to the target. Individuals with more severe impairments of the semantic-lexical connections and the lexical-phonological connections produced words with many neighbors more accurately than words with fewer neighbors. Implications for initial therapy target selection and directions for further research are discussed.
Style APA, Harvard, Vancouver, ISO itp.
8

Risius, Uda-Mareke [Verfasser]. "Memory accuracy : A 7-Tesla fMRI approach to memory accuracy ; retrieval, monitoring and control processes / Uda-Mareke Risius. Fakultät für Psychologie und Sportwissenschaft - Arbeitseinheit 14 - Physiologische Psychologie". Bielefeld : Universitätsbibliothek Bielefeld, Hochschulschriften, 2011. http://d-nb.info/1015183646/34.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
9

Holmes, Jason. "Measuring the accuracy of four attributes of sound for conveying changes in a large data set". Thesis, University of North Texas, 2003. https://digital.library.unt.edu/ark:/67531/metadc4154/.

Pełny tekst źródła
Streszczenie:
Human auditory perception is suited to receiving and interpreting information from the environment but this knowledge has not been used extensively in designing computer-based information exploration tools. It is not known which aspects of sound are useful for accurately conveying information in an auditory display. An auditory display was created using PD, a graphical programming language used primarily to manipulate digital sound. The interface for the auditory display was a blank window. When the cursor is moved around in this window, the sound generated would changed based on the underlying data value at any given point. An experiment was conducted to determine which attribute of sound most accurately represents data values in an auditory display. The four attributes of sound tested were frequency-sine waveform, frequency-sawtooth waveform, loudness and tempo. 24 subjects were given the task of finding the highest data point using sound alone using each of the four sound treatments. Three dependent variables were measured: distance accuracy, numeric accuracy, and time on task. Repeated measures ANOVA procedures conducted on these variables did not rise to the level of statistical significance (α=.05). None of the sound treatments was more accurate than the other as representing the underlying data values. 52% of the trials were accurate within 50 pixels of the highest data point (target). An interesting finding was the tendency for the frequency-sin waveform to be used in the least accurate trial attempts (38%). Loudness, on the other hand, accounted for very few (12.5%) of the least accurate trial attempts. In completing the experimental task, four different search techniques were employed by the subjects: perimeter, parallel sweep, sector, and quadrant. The perimeter technique was the most commonly used.
Style APA, Harvard, Vancouver, ISO itp.
10

Mohammadzadeh, Hadi [Verfasser], Gerhard [Akademischer Betreuer] Heyer, Gerhard [Gutachter] Heyer i Jinan [Gutachter] Fiaidhi. "Improving Retrieval Accuracy in Main Content Extraction from HTML Web Documents / Hadi Mohammadzadeh ; Gutachter: Gerhard Heyer, Jinan Fiaidhi ; Betreuer: Gerhard Heyer". Leipzig : Universitätsbibliothek Leipzig, 2013. http://d-nb.info/1237818303/34.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
11

Quimby, Megan. "Word retrieval in bilingual individuals: how do language, test type, and self-reported fluency relate to naming accuracy in English and Hebrew". Thesis, Boston University, 2013. https://hdl.handle.net/2144/12196.

Pełny tekst źródła
Streszczenie:
Thesis (M.S.)--Boston University
Background: Although research on lexical access in Hebrew-English bilinguals exists, there is a need to quantify and determine if proficiency correlates with scores on language tasks like naming in structurally different languages, such as Hebrew and English. Objective: The aim of this study is to determine if language, test type, and self-reported proficiency is reflected in Hebrew-English bilinguals' naming ability in the two languages. Methods: Twenty healthy Hebrew-English bilinguals completed the Mini Mental State Examination (MMSE), the Language Use Questionnaire (LUQ), the Boston Naming Test (BNT), a Hebrew naming test (Kave, 2005), a 100-item category picture naming task, and category generation in both languages. Results: There was a significant effect of language and test type on accuracy scores on picture naming tests. While language did not have a significant effect on category generation, category had a significant effect on correct words, means semantic cluster score, and mean semantic switching score. There were several language proficiency variables that were significantly correlated with naming tasks in English (Language Ability Rating (LAR), Confidence, Lifetime Exposure, Education History), yet LUQ proficiency variables were only significantly correlated with Kave's test (LAR, Education History) on the Hebrew naming tasks. Conclusion: It appears that confrontation-naming assessments result in more variable performance in English and Hebrew than fluency tests. Based on correlations between LUQ variables and Hebrew naming performance, Kave's test appears to be the only test that captures the unique aspects that allow individuals to be proficient in Hebrew. The BNT is a useful measure for examining English naming. Therefore, using the BNT and Kave's Hebrew naming test appears to be the most effective in measuring naming performance in Hebrew-English bilinguals. Significant LUQ variables in English and Hebrew indicate that self-reported acquisition patterns and lifetime usage provide valuable information regarding naming performance.
Style APA, Harvard, Vancouver, ISO itp.
12

Ekström, Linus, i Andreas Augustsson. "A comperative study of text classification models on invoices : The feasibility of different machine learning algorithms and their accuracy". Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-15647.

Pełny tekst źródła
Streszczenie:
Text classification for companies is becoming more important in a world where an increasing amount of digital data are made available. The aim is to research whether five different machine learning algorithms can be used to automate the process of classification of invoice data and see which one gets the highest accuracy. Algorithms are in a later stage combined for an attempt to achieve higher results. N-grams are used, and results are compared in form of total accuracy of classification for each algorithm. A library in Python, called scikit-learn, implementing the chosen algorithms, was used. Data is collected and generated to represent data present on a real invoice where data has been extracted. Results from this thesis show that it is possible to use machine learning for this type of problem. The highest scoring algorithm (LinearSVC from scikit-learn) classifies 86% of all samples correctly. This is a margin of 16% above the acceptable level of 70%.
Style APA, Harvard, Vancouver, ISO itp.
13

Yesiler, M. Furkan. "Data-driven musical version identification: accuracy, scalability and bias perspectives". Doctoral thesis, Universitat Pompeu Fabra, 2022. http://hdl.handle.net/10803/673264.

Pełny tekst źródła
Streszczenie:
This dissertation aims at developing audio-based musical version identification (VI) systems for industry-scale corpora. To employ such systems in industrial use cases, they must demonstrate high performance on large-scale corpora while not favoring certain musicians or tracks above others. Therefore, the three main aspects we address in this dissertation are accuracy, scalability, and algorithmic bias of VI systems. We propose a data-driven model that incorporates domain knowledge in its network architecture and training strategy. We then take two main directions to further improve our model. Firstly, we experiment with data-driven fusion methods to combine information from models that process harmonic and melodic information, which greatly enhances identification accuracy. Secondly, we investigate embedding distillation techniques to reduce the size of the embeddings produced by our model, which reduces the requirements for data storage and, more importantly, retrieval time. Lastly, we analyze the algorithmic biases of our systems.
En esta tesis se desarrollan sistemas de identificación de versiones musicales basados en audio y aplicables en un entorno industrial. Por lo tanto, los tres aspectos que se abordan en esta tesis son el desempeño, escalabilidad, y los sesgos algorítmicos en los sistemas de identificación de versiones. Se propone un modelo dirigido por datos que incorpora conocimiento musical en su arquitectura de red y estrategia de entrenamiento, para lo cual se experimenta con dos enfoques. Primero, se experimenta con métodos de fusión dirigidos por datos para combinar la información de los modelos que procesan información melódica y armónica, logrando un importante incremento en la exactitud de la identificación. Segundo, se investigan técnicas para la destilación de embeddings para reducir su tamaño, lo cual reduce los requerimientos de almacenamiento de datos, y lo que es más importante, del tiempo de búsqueda. Por último, se analizan los sesgos algorítmicos de nuestros sistemas.
Style APA, Harvard, Vancouver, ISO itp.
14

Noyes, Elizabeth J. "An investigation into the accuracy of surface temperature retrievals from the AATSR". Thesis, University of Leicester, 2005. http://hdl.handle.net/2381/30690.

Pełny tekst źródła
Streszczenie:
This thesis investigates the accuracy of operational sea surface temperature (SST) and land surface temperature (LST), data from the Advanced Along-Track Scanning Radiometer (AATSR). The study includes some of the first in situ validation results for these data sets, and forms an integral part of the overall AATSR validation programme. In addition, a comprehensive sensitive study of the response of these retrievals to changes in atmospheric and surface conditions is also presented as in aid to interpreting validation results. AATSR SSTs recorded during 2003 have been validated over the Caribbean sea, using in situ observations of SST from the Marine-Atmosphere Emitted Radiance Interferometer (M-AERI). This validation experiment, which extends over a full year, is the most extensive yet performed for any of the ATSR instruments (ATSR-1, ATSR-2 and AATSR) using in situ SSTs derived from radiometric measurements. These data also provide a first opportunity to study seasonal biases using observations of this type. The results demonstrate that clear-sky SSTs obtained using the unique dual-viewing geometry of the AATSR agree with the in situ data to within 0.3 K. Nadir-only retrievals performed using three channels also provide SSTs that approach this accuracy, but the split-window retrievals are found to be warm-biased by ∼0.6 K. When tropospheric dust aerosol is present, the accuracy of the retrieved SSTs is reduced, with the nadir-view SSTs becoming cold biased and the dual-view SSTs, warm biased. For the first time, the potential for validation of LST over heterogeneous land sites is also explored. Two methodologies are presented for upscaling point in situ LSTs to the 1-km spatial scale of the AATSR, together with initial validation results over a field site in Morocco, which suggest that the AATSR LSTs are warm-biased by at least 0.6 K over this site. The findings of this study suggest that validation over heterogeneous sites is possible for situations where the variation of LST is characterised by several in situ point observations of LST.
Style APA, Harvard, Vancouver, ISO itp.
15

Vijayan, Balaji. "Accurate and efficient detection, prediction and exploitation of program phases". Diss., Online access via UMI:, 2005.

Znajdź pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
16

Wallace, Roy Geoffrey. "Fast and accurate phonetic spoken term detection". Thesis, Queensland University of Technology, 2010. https://eprints.qut.edu.au/39610/1/Roy_Wallace_Thesis.pdf.

Pełny tekst źródła
Streszczenie:
For the first time in human history, large volumes of spoken audio are being broadcast, made available on the internet, archived, and monitored for surveillance every day. New technologies are urgently required to unlock these vast and powerful stores of information. Spoken Term Detection (STD) systems provide access to speech collections by detecting individual occurrences of specified search terms. The aim of this work is to develop improved STD solutions based on phonetic indexing. In particular, this work aims to develop phonetic STD systems for applications that require open-vocabulary search, fast indexing and search speeds, and accurate term detection. Within this scope, novel contributions are made within two research themes, that is, accommodating phone recognition errors and, secondly, modelling uncertainty with probabilistic scores. A state-of-the-art Dynamic Match Lattice Spotting (DMLS) system is used to address the problem of accommodating phone recognition errors with approximate phone sequence matching. Extensive experimentation on the use of DMLS is carried out and a number of novel enhancements are developed that provide for faster indexing, faster search, and improved accuracy. Firstly, a novel comparison of methods for deriving a phone error cost model is presented to improve STD accuracy, resulting in up to a 33% improvement in the Figure of Merit. A method is also presented for drastically increasing the speed of DMLS search by at least an order of magnitude with no loss in search accuracy. An investigation is then presented of the effects of increasing indexing speed for DMLS, by using simpler modelling during phone decoding, with results highlighting the trade-off between indexing speed, search speed and search accuracy. The Figure of Merit is further improved by up to 25% using a novel proposal to utilise word-level language modelling during DMLS indexing. Analysis shows that this use of language modelling can, however, be unhelpful or even disadvantageous for terms with a very low language model probability. The DMLS approach to STD involves generating an index of phone sequences using phone recognition. An alternative approach to phonetic STD is also investigated that instead indexes probabilistic acoustic scores in the form of a posterior-feature matrix. A state-of-the-art system is described and its use for STD is explored through several experiments on spontaneous conversational telephone speech. A novel technique and framework is proposed for discriminatively training such a system to directly maximise the Figure of Merit. This results in a 13% improvement in the Figure of Merit on held-out data. The framework is also found to be particularly useful for index compression in conjunction with the proposed optimisation technique, providing for a substantial index compression factor in addition to an overall gain in the Figure of Merit. These contributions significantly advance the state-of-the-art in phonetic STD, by improving the utility of such systems in a wide range of applications.
Style APA, Harvard, Vancouver, ISO itp.
17

REIS, JUNIOR JOSE S. B. "Métodos e softwares para análise da produção científica e detecção de frentes emergentes de pesquisa". reponame:Repositório Institucional do IPEN, 2015. http://repositorio.ipen.br:8080/xmlui/handle/123456789/26929.

Pełny tekst źródła
Streszczenie:
Submitted by Marco Antonio Oliveira da Silva (maosilva@ipen.br) on 2016-12-21T15:07:24Z No. of bitstreams: 0
Made available in DSpace on 2016-12-21T15:07:24Z (GMT). No. of bitstreams: 0
O progresso de projetos anteriores salientou a necessidade de tratar o problema dos softwares para detecção, a partir de bases de dados de publicações científicas, de tendências emergentes de pesquisa e desenvolvimento. Evidenciou-se a carência de aplicações computacionais eficientes dedicadas a este propósito, que são artigos de grande utilidade para um melhor planejamento de programas de pesquisa e desenvolvimento em instituições. Foi realizada, então, uma revisão dos softwares atualmente disponíveis, para poder-se delinear claramente a oportunidade de desenvolver novas ferramentas. Como resultado, implementou-se um aplicativo chamado Citesnake, projetado especialmente para auxiliar a detecção e o estudo de tendências emergentes a partir da análise de redes de vários tipos, extraídas das bases de dados científicas. Através desta ferramenta computacional robusta e eficaz, foram conduzidas análises de frentes emergentes de pesquisa e desenvolvimento na área de Sistemas Geradores de Energia Nuclear de Geração IV, de forma que se pudesse evidenciar, dentre os tipos de reatores selecionados como os mais promissores pelo GIF - Generation IV International Forum, aqueles que mais se desenvolveram nos últimos dez anos e que se apresentam, atualmente, como os mais capazes de cumprir as promessas realizadas sobre os seus conceitos inovadores.
Dissertação (Mestrado em Tecnologia Nuclear)
IPEN/D
Instituto de Pesquisas Energéticas e Nucleares - IPEN-CNEN/SP
Style APA, Harvard, Vancouver, ISO itp.
18

Thaiss, Laila Maria. "A comparison of the role of the frontal cortex and the anterior temporal lobe in source memory and in the accurate retrieval of episodic information /". Thesis, McGill University, 2001. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=38424.

Pełny tekst źródła
Streszczenie:
It has been argued that patients with frontal lobe lesions are impaired in temporal context memory and, more generally, in retrieving the source of one's knowledge or ideas. Furthermore, it has been speculated that a failure to retrieve source information may result in an increased susceptibility to distortions of episodic memories in patients with frontal lobe lesions. The precise role of the frontal cortex, however, in source or episodic retrieval is not clear. Does this region of cortex play a primary role or a secondary, executive role in the processing of such memories? Studies of patients with temporal lobe lesions have also shown impairments in episodic memory, including difficulties in the retrieval of source information. An important issue, therefore, is whether these two brain regions make different contributions to the processing of source information and to the retrieval of episodic memories.
In the present experiments, patients with unilateral excisions restricted to frontal cortex or to the anterior temporal lobe were compared on various tasks examining source memory performance and the accurate retrieval of episodic information. The results of these studies failed to support the general contention that patients with frontal cortex excisions have source (or temporal context) memory impairments. Instead, differences between these patients and normal control subjects appeared to be contingent on whether strategic organizational or control processes were necessary for efficient processing of episodic information. The memory of patients with left temporal lobe excisions, on the other hand, was significantly impaired for both content and source information in most tasks. Furthermore, these subjects showed high rates of inaccuracies and distortions of memory. The false memories of this patient group were attributed to a combination of their poor memory for the specific items of the task and their over-reliance on semantic "gist" or on inferential knowledge about the events. Patients with right temporal lobe excisions were generally less severely impaired on the verbal memory tasks compared with those with left-sided lesions, but were impaired in their memory for the contextual aspects of an event.
Style APA, Harvard, Vancouver, ISO itp.
19

Artchounin, Daniel. "Tuning of machine learning algorithms for automatic bug assignment". Thesis, Linköpings universitet, Programvara och system, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-139230.

Pełny tekst źródła
Streszczenie:
In software development projects, bug triage consists mainly of assigning bug reports to software developers or teams (depending on the project). The partial or total automation of this task would have a positive economic impact on many software projects. This thesis introduces a systematic four-step method to find some of the best configurations of several machine learning algorithms intending to solve the automatic bug assignment problem. These four steps are respectively used to select a combination of pre-processing techniques, a bug report representation, a potential feature selection technique and to tune several classifiers. The aforementioned method has been applied on three software projects: 66 066 bug reports of a proprietary project, 24 450 bug reports of Eclipse JDT and 30 358 bug reports of Mozilla Firefox. 619 configurations have been applied and compared on each of these three projects. In production, using the approach introduced in this work on the bug reports of the proprietary project would have increased the accuracy by up to 16.64 percentage points.
Style APA, Harvard, Vancouver, ISO itp.
20

Chia-HaoLiu i 劉家豪. "Using Feature-Based Pre-clustering Method for Increasing the Accuracy of Image Retrieval". Thesis, 2014. http://ndltd.ncl.edu.tw/handle/8v64d4.

Pełny tekst źródła
Streszczenie:
碩士
國立成功大學
製造資訊與系統研究所
103
Based on feature matching technique, image retrieval system finds similar images in large database by using distance measure to examine the similarity of feature information between two different images. Traditionally, brute-force search is used to match feature points one by one. However, it is very time-consuming and inefficient for image retrieval processing. To deal with this problem, a tree-like data structure built by a set of training features is used. The idea is to use hierarchical clustering algorithm to merge similar training features into tree structured groups. Since each leaf of the tree will contain a group of similar features, the required range to compare training features and query features can be narrowed down by tree traversal. Although a tree structure of training features can reduce the time of feature matching efficiently, it often needs to spend more searching time on repeating the same path at tree when the number of features extracted from the query image increase. Hence, based on non-hierarchical clustering algorithm, we propose Feature-Based Pre-clustering Method, that clusters similar descriptor of query features before feature matching. Compared with non-pre-clustering method, the result shows that using K-Means as pre-clustering method can provide a similar average accuracy in shorter retrieval time.
Style APA, Harvard, Vancouver, ISO itp.
21

Chuang, Po-Yu, i 莊博宇. "Enhancing Time-efficiency and Accuracy of Patent Prior Art Retrieval Based on Claim Structure". Thesis, 2013. http://ndltd.ncl.edu.tw/handle/02995782717233923789.

Pełny tekst źródła
Streszczenie:
碩士
國立清華大學
服務科學研究所
101
Patent is one of the most valuable intelligent properties. It not only protects inventions and ideas but also is an economic source of enterprises. Since that, it is important for enterprises to detect patent infringement to avoid any legal and economic risks. Patent prior art search task is mainly to identify prior patents that are relevant to a patent application. Thus, a system for retrieving patent prior art is expected to obtain the set of patents which are the most relevant to a target patent. Beyond the existing achievements including patent claim structure in prior retrieval, this research enhances the time efficiency and accuracy of the patient prior retrieval task by using cloud computing and semantic term expansion. The major contribute of this research is to enhance the prior art retrieval system in terms of time efficiency and accuracy, by which enterprises or patent analyzers are operationally viable to identify the most relevant prior patents in filing or approving a new patent
Style APA, Harvard, Vancouver, ISO itp.
22

Wang, Jen-Shiang, i 王禎祥. "Accuracy Analysis of Ellipsometry Based Multi-layer Film Measurement Technique-Incident Angle Verification and Retrieval". Thesis, 1998. http://ndltd.ncl.edu.tw/handle/12628102973832997203.

Pełny tekst źródła
Streszczenie:
碩士
國立臺灣大學
應用力學研究所
86
Ellipsometers have long been used as a major tool for studying the index ofref raction and film thickness of multi-layer structures. Two major properties of ellipsometers are worth mentioning. Firstly, as the physical quantity of int erest including the index of refraction and film thickness cannot be measured directly, a theoretical multi-layer structure model and a numerical inversion algorithm must be used to retrieve these physical quantities. Secondly, the a ccuracy of the angle of incidence will effect the accuracyof these physical qu antities strongly. An incident-angle-retrieval algorithm was invented in this thesis as part of the inversion algorithm to further promote the accuracy of the index of refraction and the film thickness and to compensate for the error induced by any small errors in the supporting mechanical structures. Startin g from the fundamental optical theory and the advantages and disadvantages of the many powerful numerical inversion algorithms, these numerical algorithms w ere used to simulate many measurement conditions. They were used to evaluate the accuracy of the index of refraction and the film thickness as to whether t he newly devised incident-angle-retrieval algorithm can further increase the a ccuracy of the physical quantities to be retrieved. The simulation results wi th the new algorithm show significant improvement in accuracy.
Style APA, Harvard, Vancouver, ISO itp.
23

"Analysis of information features in natural language queries for music information retrieval: Use patterns and accuracy". UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN, 2009. http://pqdtopen.proquest.com/#viewpdf?dispub=3337875.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
24

Lee, Jin Ha. "Analysis of information features in natural language queries for music information retrieval : use patterns and accuracy /". 2008. http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqdiss&rft_dat=xri:pqdiss:3337875.

Pełny tekst źródła
Streszczenie:
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.
Source: Dissertation Abstracts International, Volume: 69-11, Section: A, page: 4167. Adviser: Allen Renear. Includes bibliographical references (leaves 192-207) Available on microfilm from Pro Quest Information and Learning.
Style APA, Harvard, Vancouver, ISO itp.
25

Fan, Yu-Hao, i 范育豪. "Enhancing the Accuracy of the Template Matching for Content-Based Image Retrieval Based on Support Vector Machine". Thesis, 2013. http://ndltd.ncl.edu.tw/handle/99723616790989243873.

Pełny tekst źródła
Streszczenie:
碩士
國立交通大學
電控工程研究所
102
In this thesis, we propose a content-based image retrieval method to enhance the accuracy of acquiring images from large image databases. The term “content” refers to the information that can be extracted from the image itself. The proposed method combines template matching and support vector machine. With the content of images, the similarities among them can be measured according the chi-square distance. The advantage of template matching is that prior knowledge of the databases is unnecessary. However, the variance of images for the same object, called within-class variance, could affect the similarities among images and lead to wrong query results. To solve this problem, support vector machine is introduced in the second stage. Support vector machine can classify images into two categories. We select the training images of support vector machine based on the similarities computed in the first stage. Support vector machine is able to generate non-linear decision boundaries that can reduce the effect of within-class variance. In this thesis, we utilized Caltech-101 databases for experiments. The experimental results show that the proposed content-based image retrieval method has better accuracy than template matching.
Style APA, Harvard, Vancouver, ISO itp.
Oferujemy zniżki na wszystkie plany premium dla autorów, których prace zostały uwzględnione w tematycznych zestawieniach literatury. Skontaktuj się z nami, aby uzyskać unikalny kod promocyjny!

Do bibliografii