Dissertations / Theses on the topic 'Label selection'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 28 dissertations / theses for your research on the topic 'Label selection.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Jungjit, Suwimol. "New multi-label correlation-based feature selection methods for multi-label classification and application in bioinformatics." Thesis, University of Kent, 2016. https://kar.kent.ac.uk/58873/.
Full textGustafsson, Robin. "Ordering Classifier Chains using filter model feature selection techniques." Thesis, Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-14817.
Full textSandrock, Trudie. "Multi-label feature selection with application to musical instrument recognition." Thesis, Stellenbosch : Stellenbosch University, 2013. http://hdl.handle.net/10019/11071.
Full textENGLISH ABSTRACT: An area of data mining and statistics that is currently receiving considerable attention is the field of multi-label learning. Problems in this field are concerned with scenarios where each data case can be associated with a set of labels instead of only one. In this thesis, we review the field of multi-label learning and discuss the lack of suitable benchmark data available for evaluating multi-label algorithms. We propose a technique for simulating multi-label data, which allows good control over different data characteristics and which could be useful for conducting comparative studies in the multi-label field. We also discuss the explosion in data in recent years, and highlight the need for some form of dimension reduction in order to alleviate some of the challenges presented by working with large datasets. Feature (or variable) selection is one way of achieving dimension reduction, and after a brief discussion of different feature selection techniques, we propose a new technique for feature selection in a multi-label context, based on the concept of independent probes. This technique is empirically evaluated by using simulated multi-label data and it is shown to achieve classification accuracy with a reduced set of features similar to that achieved with a full set of features. The proposed technique for feature selection is then also applied to the field of music information retrieval (MIR), specifically the problem of musical instrument recognition. An overview of the field of MIR is given, with particular emphasis on the instrument recognition problem. The particular goal of (polyphonic) musical instrument recognition is to automatically identify the instruments playing simultaneously in an audio clip, which is not a simple task. We specifically consider the case of duets – in other words, where two instruments are playing simultaneously – and approach the problem as a multi-label classification one. In our empirical study, we illustrate the complexity of musical instrument data and again show that our proposed feature selection technique is effective in identifying relevant features and thereby reducing the complexity of the dataset without negatively impacting on performance.
AFRIKAANSE OPSOMMING: ‘n Area van dataontginning en statistiek wat tans baie aandag ontvang, is die veld van multi-etiket leerteorie. Probleme in hierdie veld beskou scenarios waar elke datageval met ‘n stel etikette geassosieer kan word, instede van slegs een. In hierdie skripsie gee ons ‘n oorsig oor die veld van multi-etiket leerteorie en bespreek die gebrek aan geskikte standaard datastelle beskikbaar vir die evaluering van multi-etiket algoritmes. Ons stel ‘n tegniek vir die simulasie van multi-etiket data voor, wat goeie kontrole oor verskillende data eienskappe bied en wat nuttig kan wees om vergelykende studies in die multi-etiket veld uit te voer. Ons bespreek ook die onlangse ontploffing in data, en beklemtoon die behoefte aan ‘n vorm van dimensie reduksie om sommige van die uitdagings wat deur sulke groot datastelle gestel word die hoof te bied. Veranderlike seleksie is een manier van dimensie reduksie, en na ‘n vlugtige bespreking van verskillende veranderlike seleksie tegnieke, stel ons ‘n nuwe tegniek vir veranderlike seleksie in ‘n multi-etiket konteks voor, gebaseer op die konsep van onafhanklike soek-veranderlikes. Hierdie tegniek word empiries ge-evalueer deur die gebruik van gesimuleerde multi-etiket data en daar word gewys dat dieselfde klassifikasie akkuraatheid behaal kan word met ‘n verminderde stel veranderlikes as met die volle stel veranderlikes. Die voorgestelde tegniek vir veranderlike seleksie word ook toegepas in die veld van musiek dataontginning, spesifiek die probleem van die herkenning van musiekinstrumente. ‘n Oorsig van die musiek dataontginning veld word gegee, met spesifieke klem op die herkenning van musiekinstrumente. Die spesifieke doel van (polifoniese) musiekinstrument-herkenning is om instrumente te identifiseer wat saam in ‘n oudiosnit speel. Ons oorweeg spesifiek die geval van duette – met ander woorde, waar twee instrumente saam speel – en hanteer die probleem as ‘n multi-etiket klassifikasie een. In ons empiriese studie illustreer ons die kompleksiteit van musiekinstrumentdata en wys weereens dat ons voorgestelde veranderlike seleksie tegniek effektief daarin slaag om relevante veranderlikes te identifiseer en sodoende die kompleksiteit van die datastel te verminder sonder ‘n negatiewe impak op klassifikasie akkuraatheid.
Paredes, Zevallos Daniel Leoncio. "Multi-scale image inpainting with label selection based on local statistics." Master's thesis, Pontificia Universidad Católica del Perú, 2014. http://tesis.pucp.edu.pe/repositorio/handle/123456789/5578.
Full textTesis
Duncan, Alyssa Renee. ""Nutrition facts" label use in the selection of healthier foods by undergraduate students." FIU Digital Commons, 1996. http://digitalcommons.fiu.edu/etd/3239.
Full textGonzalez, Lopez Jorge. "Distributed multi-label learning on Apache Spark." VCU Scholars Compass, 2019. https://scholarscompass.vcu.edu/etd/5775.
Full textLu, Tien-hsin. "SqueezeFit Linear Program: Fast and Robust Label-aware Dimensionality Reduction." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1587156777565173.
Full textGharroudi, Ouadie. "Ensemble multi-label learning in supervised and semi-supervised settings." Thesis, Lyon, 2017. http://www.theses.fr/2017LYSE1333/document.
Full textMulti-label learning is a specific supervised learning problem where each instance can be associated with multiple target labels simultaneously. Multi-label learning is ubiquitous in machine learning and arises naturally in many real-world applications such as document classification, automatic music tagging and image annotation. In this thesis, we formulate the multi-label learning as an ensemble learning problem in order to provide satisfactory solutions for both the multi-label classification and the feature selection tasks, while being consistent with respect to any type of objective loss function. We first discuss why the state-of-the art single multi-label algorithms using an effective committee of multi-label models suffer from certain practical drawbacks. We then propose a novel strategy to build and aggregate k-labelsets based committee in the context of ensemble multi-label classification. We then analyze the effect of the aggregation step within ensemble multi-label approaches in depth and investigate how this aggregation impacts the prediction performances with respect to the objective multi-label loss metric. We then address the specific problem of identifying relevant subsets of features - among potentially irrelevant and redundant features - in the multi-label context based on the ensemble paradigm. Three wrapper multi-label feature selection methods based on the Random Forest paradigm are proposed. These methods differ in the way they consider label dependence within the feature selection process. Finally, we extend the multi-label classification and feature selection problems to the semi-supervised setting and consider the situation where only few labelled instances are available. We propose a new semi-supervised multi-label feature selection approach based on the ensemble paradigm. The proposed model combines ideas from co-training and multi-label k-labelsets committee construction in tandem with an inner out-of-bag label feature importance evaluation. Satisfactorily tested on several benchmark data, the approaches developed in this thesis show promise for a variety of applications in supervised and semi-supervised multi-label learning
Narassiguin, Anil. "Apprentissage Ensembliste, Étude comparative et Améliorations via Sélection Dynamique." Thesis, Lyon, 2018. http://www.theses.fr/2018LYSE1075/document.
Full textEnsemble methods has been a very popular research topic during the last decade. Their success arises largely from the fact that they offer an appealing solution to several interesting learning problems, such as improving prediction accuracy, feature selection, metric learning, scaling inductive algorithms to large databases, learning from multiple physically distributed data sets, learning from concept-drifting data streams etc. In this thesis, we first present an extensive empirical comparison between nineteen prototypical supervised ensemble learning algorithms, that have been proposed in the literature, on various benchmark data sets. We not only compare their performance in terms of standard performance metrics (Accuracy, AUC, RMS) but we also analyze their kappa-error diagrams, calibration and bias-variance properties. We then address the problem of improving the performances of ensemble learning approaches with dynamic ensemble selection (DES). Dynamic pruning is the problem of finding given an input x, a subset of models among the ensemble that achieves the best possible prediction accuracy. The idea behind DES approaches is that different models have different areas of expertise in the instance space. Most methods proposed for this purpose estimate the individual relevance of the base classifiers within a local region of competence usually given by the nearest neighbours in the euclidean space. We propose and discuss two novel DES approaches. The first, called ST-DES, is designed for decision tree based ensemble models. This method prunes the trees using an internal supervised tree-based metric; it is motivated by the fact that in high dimensional data sets, usual metrics like euclidean distance suffer from the curse of dimensionality. The second approach, called PCC-DES, formulates the DES problem as a multi-label learning task with a specific loss function. Labels correspond to the base classifiers and multi-label training examples are formed based on the ability of each classifier to correctly classify each original training example. This allows us to take advantage of recent advances in the area of multi-label learning. PCC-DES works on homogeneous and heterogeneous ensembles as well. Its advantage is to explicitly capture the dependencies between the classifiers predictions. These algorithms are tested on a variety of benchmark data sets and the results demonstrate their effectiveness against competitive state-of-the-art alternatives
Kraus, Vivien. "Apprentissage semi-supervisé pour la régression multi-labels : application à l’annotation automatique de pneumatiques." Thesis, Lyon, 2021. https://tel.archives-ouvertes.fr/tel-03789608.
Full textWith the advent and rapid growth of digital technologies, data has become a precious asset as well as plentiful. However, with such an abundance come issues about data quality and labelling. Because of growing numbers of available data volumes, while human expert labelling is still important, it is more and more necessary to reinforce semi-supervised learning with the exploitation of unlabeled data. This problem is all the more noticeable in the multi-label learning framework, and in particular for regression, where each statistical unit is guided by many different targets, taking the form of numerical scores. This thesis focuses on this fundamental framework. First, we begin by proposing a method for semi-supervised regression, that we challenge through a detailed experimental study. Thanks to this new method, we present a second contribution, more fitted to the multi-label framework. We also show its efficiency with a comparative study on literature data sets. Furthermore, the problem dimension is always a pain point of machine learning, and reducing it sparks the interest of many researchers. Feature selection is one of the major tasks addressing this problem, and we propose to study it here in a complex framework : for semi-supervised, multi-label regression. Finally, an experimental validation is proposed on a real problem about automatic annotation of tires, to tackle the needs expressed by the industrial partner of this thesis
Spolaôr, Newton. "Seleção de atributos para aprendizagem multirrótulo." Universidade de São Paulo, 2014. http://www.teses.usp.br/teses/disponiveis/55/55134/tde-25032015-160505/.
Full textIrrelevant and/or redundant features in data can deteriorate the performance of the classifiers built from this data by machine learning algorithms. The aim of feature selection algorithms consists in identifying these features and removing them from data before constructing classifiers. Feature selection in single-label data, in which each instance in the training set is associated with only one label, has been widely studied in the literature. However, this is not the case for multi-label data, in which each instance is associated with a set of labels. Moreover, as multi-label data usually exhibit relationships among the labels in the set of labels, machine learning algorithms should take thiis relatinship into account. Therefore, label dependence should also be explored by multi-label feature selection algorithms. The filter approach is one of the most usual approaches considered by feature selection algorithms, as it has potentially lower computational cost than approaches and uses general properties from data to calculate feature importance measures, such as the feature-class correlation. The hypothesis of this work is that feature selection algorithms which consider label dependence will perform better than the ones that disregard label dependence. To this end, ths work proposes and develops filter approach multi-label feature selection algorithms which take into account relations among labels. In particular, we proposed two methods that take into account these relations by performing label construction and adapting the single-label feature selection algorith RelieF. These methods were experimentally evaluated showing good performance in terms of feature reduction and predictability of the classifiers built using the selected features.
Tan, Run Yan. "Active Learning using a Sample Selector Network." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-287312.
Full textI detta arbete sätter vi steget i en begränsad märkningsbudget och föreslår att vi använder ett provväljarnätverk för att lära och välja effektiva träningsprover, vars etiketter vi sedan skulle skaffa för att träna målmodellen som utför den nödvändiga maskininlärningsuppgiften. Vi antar att provfunktionerna, tillståndet för målmodellen och utbildningsförlusten för målmodellen är informativa för att träna provväljarnätverket. Dessutom uppskattar vi målmodellens tillstånd med dess mellanliggande och slutliga nätverksutgångar. Vi undersöker om provväljarnätverket enligt en begränsad märkningsbudget kan lära sig och välja utbildningsprover som tränar målmodellen minst lika effektivt som att använda en annan träningsdel av samma storlek som är enhetligt slumpmässigt samplad från hela utbildningsdatasystemet, det senare är det vanliga förfarandet som används för att utbilda maskininlärningsmodeller utan aktivt lärande. Vi hänvisar till denna vanliga procedur som den traditionella maskininlärning enhetliga slumpmässig sampling metod. Vi utför experiment på datasätten MNIST och CIFAR-10; och visa med empiriska bevis att under en begränsad märkningsbudget och vissa andra förhållanden, aktivt lärande med hjälp av ett provvalnätverk gör det möjligt för målmodellen att lära sig mer effektivt.
Ögren, Niklas. "Selecting/realization of Virtual Private Networks with Multiprotocol Label Switching or Virtual Local Area Networks." Thesis, KTH, Mikroelektronik och Informationsteknik, IMIT, 2002. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-93211.
Full textTolbert, Thomas J. (Thomas James) 1969. "Synthesis of RNA with selective isotopic labels for NMR structural studies." Thesis, Massachusetts Institute of Technology, 1998. http://hdl.handle.net/1721.1/50341.
Full textTomeš, Jan. "Analýza přesnosti výroby lamel formy pneumatiky vyráběných SLM technologií." Master's thesis, Vysoké učení technické v Brně. Fakulta strojního inženýrství, 2016. http://www.nusl.cz/ntk/nusl-241858.
Full textNorris, Maria. "Contesting identity and preventing belonging? : an analysis of British counter terrorism policy since the Terrorism Act 2000 and the selective use of the terrorism label by the British Government." Thesis, London School of Economics and Political Science (University of London), 2015. http://etheses.lse.ac.uk/3348/.
Full textTsai, Yue-Yang, and 蔡岳洋. "Semi-supervised Feature Selection Using Soft-label Information." Thesis, 2012. http://ndltd.ncl.edu.tw/handle/51541470430761590429.
Full text國立交通大學
資訊科學與工程研究所
100
Feature selection is an important task in machine learning. Practically, the quality of features affect the result of machine learning algorithms. In supervised feature selection, sufficient labeled data is necessary. However, labeling, a time-consuming process, is typically done manually. Conversely, unlabeled data is relatively easy to collect. Although unsupervised feature selection does not require labeled data, additional prior information should be considered when labeled data is available. Therefore, this paper proposes a semi-supervised feature selection algorithm to consider both labeled and unlabeled data. This proposed semi-supervised feature selection algorithm is called Soft-label semi-supervised feature selection algorithm. This algorithm applies Semi-supervised logistic regression algorithm to obtain soft-label information of unlabeled data, and applies proposed soft-label mutual information formula to combine label information and soft-label information to find the best feature subset. In the experimental section, we conduct experiments on several datasets, and experimental results indicate that the proposed algorithm can effectively improve classification performance.
Posinasetty, Anusha. "Multi-label Classification with Multiple Label Correlation Orders And Structures." Thesis, 2016. http://etd.iisc.ernet.in/2005/3719.
Full textMa, Long. "A Multi-label Text Classification Framework: Using Supervised and Unsupervised Feature Selection Strategy." 2017. http://scholarworks.gsu.edu/cs_diss/123.
Full textChang, Jen Fu, and 張仁輔. "Label-free Selection of Liver Cancer Stem Cells by Using Polyelectrolyte Multilayer Films." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/50250671387046536855.
Full text長庚大學
生化與生醫工程研究所
101
The majority of hepatocellular carcinoma (HCC) patients present with an advanced stage for which chemotherapy and radiotherapy have limited efficacy. Early diagnosis and treatment of HCC remain challenging due to lack of highly specific and sensitive markers. Cancer stem cells (CSCs) were the source of many solid tumor types, including HCC. Currently, the latest cancer research was oriented toward cancer stem cells, to develop a new cancer therapy direction. The surface properties of materials could be regulated by layer-by-layer polyelectrolyte multilayer (PEM) films and to affect the cell behaviors. Therefore, by varying of polyelectrolyte materials and layer numbers, a series of microenvironment was establish, to control cell morphology and cell attachment. Previously, it was demonstrated that PEM films enable to select liver stem/progenitor cells. Thus, it is suggested that CSC colonies could be selected by the microenvironment controlled by PEM architecture. The aim of this study was to use glass as the based substrate, and to sequentially deposit positively charged (poly(allylamine hydrochloride ), PAH ) and negatively charged ( poly(sodium 4-styrene sulfonate ), PSS ) by using layer-by-layer technique to fabricate the series of PEM films. Liver cancer cell line (Huh7) were cultured on the series of PEM films, in order to establish a label free system for selection of liver cancer stem cells. Quartz crystal microbalance with dissipation sensor was used to investigate the oscillatory frequency and to analyze the dissipation of PEM films. Cell behaviors and colony formation were observed by using a optical microscope. In addition, the techniques for the cell cytotoxicity, CSCs marker expression including double staining of CD133/CD44 and CD133/EpCAM, drug sensitivity, and cell cycle determinant were used to analyze isolated cells from different PEM films. The results showed that the aggregation of cells were investigated obviously in which culture on (PAH/PSS)4-PAH and (PAH/PSS)6-PAH substrates. In addition, by using flow cytometry to determine the CSCs marker expression, the selected cells from the substrates of (PAH/PSS)4-PAH and (PAH/PSS)6-PAH displayed a high degree of CD133/CD44 expression and the percentage increased with the culture period. Furthermore, a commonly used anti-cancer drug, doxorubicin, was used to measure drug sensitivity of the selected cells on different substrates. It was demonstrated that cells selected from (PAH/PSS)4-PAH and (PAH/PSS)6-PAH displayed the low dependence for drug with the drug concentration. However, cell cycle assay revealed that most of the selected cells were arrested in the S-phase, suggesting the proliferation of the liver CSC. In conclusion, a series of microenvironments was constructed by PEM films which can select and purify CSCs. Besides, the surface characteristics of these PEM films were accounted for the relation between microenvironment and liver cancer stem cell. This system could be used for the drug screening and may provide the new strategy on developing the liver cancer therapy.
Tsai, Fu-Kun, and 蔡富焜. "The Study of Configure flash off label Equipment Selection Assessment-Taking a Large Department Stores as an Example." Thesis, 2014. http://ndltd.ncl.edu.tw/handle/skz8v4.
Full text中華科技大學
土木防災與管理碩士班
102
The topic of finding refuge is the most important work of a large department stores in the safety plan of fire protection. When fire disaster occurs in a large department stores, both user in how to use label equipment or avoid disaster information provided in the structural interior and user is to be accepted the information of finding refuge are the most important study topic in this article. The principal objective is to do a case study used a large department stores. Expert visiting, questionnaire, and analytical hierarchy process (AHP) are used to be inducted the configuration flash off label equipment and the paying attention to items in a large department stores. The present study results indicate that one is using flash and sound to offer the information of finding refuge. Another is to help the weak and the timid to leave the fire place in the most short time through the path of finding refuge. Furthermore, based on the results of the AHP to the data of questionnaire, the order of selection alternative of flash off label equipment is functionality. distinguishability, quality, construction method and cost. Based on the consideration of inducting finding refuge, the rank of adopting flash off label is R-type and P-type. Results of the present study may be provided the reference of selection for the flash off label equipment in the future.
Losik, Tatiana. "Your inner garden: children´s book project on the artificial selection of labels." Master's thesis, 2021. http://hdl.handle.net/10400.26/38774.
Full textThis master’s project aims to contribute to a global effort to change certain patterns of thinking, with an object of visual culture that is interesting for children and makes them aware that all the ideas about ourselves that we believe to be true can be challenged. Throughout life, people collect labels, and those labels influence how a person thinks about themselves and behaves based on that knowledge. But the way a person sees another person has nothing to do with who that person actually is. Because other people's ideas only exist in their perception of reality, based on their way of seeing the world and other factors. It is possible to rethink negative labels that have been attached to us in the past; to stop reproducing them in our minds; to think positive thoughts with kindness and appreciation for ourselves and others. The project is an illustrated picture book that interactively demonstrates research-based content in a way that a child can understand. Since we are part of an increasingly digital reality, you can access a mobile app inside the book where you can collect your accomplishments, even the smallest ones, to build positive self-esteem. The choice of content to be presented in the book and the way it is treated came from this pressing issue that is present in the daily lives of most people, with the age group of 7 years and older being the appropriate group. When I finished the book "Inner Garden,'' my colleague Joana Sofia dos Santos Guerreiro used it for her Master's thesis and presented it to children in the Portuguese primary school system to find out the children's reactions and opinions about the book.
WANG, XIAO-DONG, and 王曉棟. "Robust and Fast Feature Selection Methods for High-dimensional Data with Limited Labels." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/6ques9.
Full text朝陽科技大學
資訊管理系
107
Feature selection is one of the most representative techniques in the area of pattern recognition, which aims at filtering data attributes, removing redundant features and improving the performance of the follow-up classification or clustering tasks. In recent years, with increasingly powerful multimedia and computer technologies, high-dimensional data have been rapidly generated. As it is extremely expensive to collect sufficient labels for such a large amount of data, a growing number of data with few labels are presented, which presents a great challenge to existing feature selection methods. Therefore, how to design reasonable and effective feature selection model becomes more and more important for data with limited label information. Under such a circumstance, to meet the requirements of different scale of data with limited labels, this dissertation designs several semi-supervised and unsupervised feature selection algorithms and combines them with various kinds of applications, such as multi-label learning, multi-task learning, and clustering. Firstly, to handle small-scale high-dimensional data, we propose a semi-supervised feature learning model, where the Laplacian matrix construction is constraint by the -norm and is robust to outliers by removing redundant connects among nodes. Secondly, a semi-supervised feature selection model based on multi-task learning is proposed for the large-scale data. Such a model is independent on the graph construction and is able to explore the shared information among tasks by a low-rank regularization. Transferring the relevant information among tasks, it can properly preserve the most important features. Finally, for the large-scale high-dimensional data without labels, we propose a flexible objective function to adaptively perform feature learning with clustering, which is suitable for data with different kinds of distributions. Experimental results show that the proposed three models can efficiently select the most representative features with high accuracy over other classic algorithms in the limited-label scenario. What is more, the proposed models are general and can be extended to other applications.
Tiun, Ting-Kng, and 張呈光. "A Sequeacial Feature Selecting Strategy Based on Relevance Between Data Label and Principle Component." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/bw8d5v.
Full text國立臺灣科技大學
資訊管理系
105
Binary classification method predicts the class of an object based on the associated feature vector. Traditional classification methods usually suffer from the high dimensionality of the feature vector, resulting in the need for decreasing feature vectors. There exist two major approaches to reducing the number of features. One is to select a subset of indigenous features which maintains the original meaning of each feature. The relevance among original features makes it difficult to find a proper subset of significant features from a large number of features, resorting to the need for random optimization algorithms. Another approach first transforms the original attributes to uncorrelated integrated features by the principal component analysis (PCA) and then sequentially search for the subset of significant integrated features. The second approach removes the relevance among integrated features, making the sequential search for the subset of significant integrated features feasible, while losing the interpret ability of significant features. In this study, we first transform the original features to uncorrelated integrated features by PCA and then rank the integrated features according to associated variances. To find the subset of significant integrated features, starting with the integrated features according to the corresponding ranks. For each subset of integrated features, a test score which is a linear combination of the integrated features is generated for classification. The coefficient on each integrated feature in the linear combination is determined such that the area under the Receiver Operating Characteristic(ROC) cure corresponding to the test score is maximized using the Genetic Algorithm(GA). Beside the self-developed classifier, we applied two other commonly used classifiers for comparison. Using the training data, the classification accuracy for each subset is evaluated and the subset with the largest classification accuracy is the final subset of significant integrated features used for classification. In addition to ranking the integrated features by the corresponding variances, we can also rank the integrated features by the corresponding Fisher Information, $R^2$ and AUC and then sequentially inflate the subset of integrated features according to the resulting ranks. Experimental results show that using Fisher Information has chances to get a better subset than merely PCA with variance. However, using PCA has a much consistant result. Using PCA can preduce a more consistance performance and more economy for calculating power. We assume that there are more to investigate further for the situation of using Fisher Information or other correlation methods as selection measurement to get a better classification performance than PCA variance.
Bouhlal, Yasser. "A Retrospective and Prospective Analysis of the Demand for Cheese Varieties in the United States." Thesis, 2012. http://hdl.handle.net/1969.1/ETD-TAMU-2012-05-10745.
Full textDasgupta, Sanjoy, Adam Tauman Kalai, and Claire Monteleoni. "Analysis of Perceptron-Based Active Learning." 2005. http://hdl.handle.net/1721.1/30585.
Full textChiu, Wan-Yu, and 邱婉瑜. "Developing Fuzzy DSS for Selecting Principal of Senior High School Using the Operation of 2-Tuples Fuzzy Linguistic Label." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/14620714732693151630.
Full text國立雲林科技大學
資訊管理系碩士班
90
Abstract Currently, the selecting principal of senior high school has changed from assignment to selection. In this paper, our aims and contributions are: (1)Choice the criteria of selecting principal by surveying referenced literature. (2)Adopt questionnaire of fuzzy linguistic label to visit principals, teachers and education experts, and compare the three groups to get the difference between each other. (3)Use a new operation of 2-tuples fuzzy language label to calculate the weights of criteria and sub-criteria for principal candidates, and establish the algorithm of selecting principal. (4)In verification of selecting principal, by newly operation process, every school select 5 candidates in first stage exam; second stage take oral test by 15 experts. This paper illustrates an example from one senior high school to verify our proposed method. (5)In software system development, we use 2-tuples fuzzy linguistic label to develop fuzzy decision support system for selecting principal of senior high school. The developed DSS can support education institute and as a reference for selecting principal of senior high school.
Lording, William James. "A deeper understanding of the Diels–Alder reaction." Phd thesis, 2010. http://hdl.handle.net/1885/11776.
Full text