Dissertations / Theses on the topic 'Data representation'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Data representation.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Chintala, Venkatram Reddy. "Digital image data representation." Ohio : Ohio University, 1986. http://www.ohiolink.edu/etd/view.cgi?ohiou1183128563.
Full textLansley, Guy David. "Big data : geodemographics and representation." Thesis, University College London (University of London), 2018. http://discovery.ucl.ac.uk/10045119/.
Full textDos, Santos Ludovic. "Representation learning for relational data." Thesis, Paris 6, 2017. http://www.theses.fr/2017PA066480/document.
Full textThe increasing use of social and sensor networks generates a large quantity of data that can be represented as complex graphs. There are many tasks from information analysis, to prediction and retrieval one can imagine on those data where relation between graph nodes should be informative. In this thesis, we proposed different models for three different tasks: - Graph node classification - Relational time series forecasting - Collaborative filtering. All the proposed models use the representation learning framework in its deterministic or Gaussian variant. First, we proposed two algorithms for the heterogeneous graph labeling task, one using deterministic representations and the other one Gaussian representations. Contrary to other state of the art models, our solution is able to learn edge weights when learning simultaneously the representations and the classifiers. Second, we proposed an algorithm for relational time series forecasting where the observations are not only correlated inside each series, but also across the different series. We use Gaussian representations in this contribution. This was an opportunity to see in which way using Gaussian representations instead of deterministic ones was profitable. At last, we apply the Gaussian representation learning approach to the collaborative filtering task. This is a preliminary work to see if the properties of Gaussian representations found on the two previous tasks were also verified for the ranking one. The goal of this work was to then generalize the approach to more relational data and not only bipartite graphs between users and items
Dos, Santos Ludovic. "Representation learning for relational data." Electronic Thesis or Diss., Paris 6, 2017. http://www.theses.fr/2017PA066480.
Full textThe increasing use of social and sensor networks generates a large quantity of data that can be represented as complex graphs. There are many tasks from information analysis, to prediction and retrieval one can imagine on those data where relation between graph nodes should be informative. In this thesis, we proposed different models for three different tasks: - Graph node classification - Relational time series forecasting - Collaborative filtering. All the proposed models use the representation learning framework in its deterministic or Gaussian variant. First, we proposed two algorithms for the heterogeneous graph labeling task, one using deterministic representations and the other one Gaussian representations. Contrary to other state of the art models, our solution is able to learn edge weights when learning simultaneously the representations and the classifiers. Second, we proposed an algorithm for relational time series forecasting where the observations are not only correlated inside each series, but also across the different series. We use Gaussian representations in this contribution. This was an opportunity to see in which way using Gaussian representations instead of deterministic ones was profitable. At last, we apply the Gaussian representation learning approach to the collaborative filtering task. This is a preliminary work to see if the properties of Gaussian representations found on the two previous tasks were also verified for the ranking one. The goal of this work was to then generalize the approach to more relational data and not only bipartite graphs between users and items
Penton, Dave. "Linguistic data models : presentation and representation /." Connect to thesis, 2006. http://eprints.unimelb.edu.au/archive/00002875.
Full textSanches, Pedro. "Health Data : Representation and (In)visibility." Doctoral thesis, KTH, Programvaruteknik och Datorsystem, SCS, 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-158909.
Full textFör att förstå hälsodata krävs sammanhang. Jag visar hur detta kan erhållas, genom två fallstudier: en om självövervakning, med fokus på representation av kroppsdata, samt en om massövervakning, med fokus på representation av populationer. Jag granskar kritiskt hur informationsteknologi (IT) kan fås att representera såväl individer som populationer och vilka följder det får. Mina bidrag är: (i) utformningen av ett självövervakningssystem för stresshantering, (ii) utformningen av ett massövervakningssystem baserat på data från mobiltelefonanvändning, (iii) en empirisk studie av hur användare av en hälsosensor begriper det data som sensorn genererar, (iv) en diskursiv analys av hur syndromövervakningssystem utformas, (v) en kritisk analys av processer kring att utforma ett massövervakningssystem, samt (vi) en analys av den historiska korrektheten i begrepp och beslutsfattande i samband med utformningen av ett stresshanteringssystem. Jag visar att produktion av hälsodata, liksom tekniska beskrivningar av de algoritmer som används i den processen, beror av faktorer som hänger samman med IT-utformningsprocessen. Dessa faktorer avgör sedan hur data kan fås att representera individer och populationer på sätt som kan rendera delar av en population, hälsodeterminanter, eller individens självuppfattning och förståelse av välmående osynliga. Jag visar också att arbetet med att producera data inte är avslutat i och med det ingenjörsarbete som krävs för att IT-systemen ska byggas: konstant underhåll krävs också.
QC 20150114
Parvathala, Rajeev (Rajeev Krishna). "Representation learning for non-sequential data." Thesis, Massachusetts Institute of Technology, 2018. http://hdl.handle.net/1721.1/119581.
Full textThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 85-90).
In this thesis, we design and implement new models to learn representations for sets and graphs. Typically, data collections in machine learning problems are structured as arrays or sequences, with sequential relationships between successive elements. Sets and graphs both break this common mold of data collections that have been extensively studied in the machine learning community. First, we formulate a new method for performing diverse subset selection using a neural set function approximation method. This method relies on the deep sets idea, which says that any set function s(X) has a universal approximator of the form f([sigma]x[xi]X [phi](x)). Second, we design a new variational autoencoding model for highly structured, sparse graphs, such as chemical molecules. This method uses the graphon, a probabilistic graphical model from mathematics, as inspiration for the decoder. Furthermore, an adversary is employed to force the distribution of vertex encodings to follow a target distribution, so that new graphs can be generated by sampling from this target distribution. Finally, we develop a new framework for performing encoding of graphs in a hierarchical manner. This approach partitions an input graph into multiple connected subgraphs, and creates a new graph where each node represents one such subgraph. This allows the model to learn a higher level representation for graphs, and increases robustness of graphical encoding to varying graph input sizes.
by Rajeev Parvathala.
M. Eng.
Andersson, Elin, and Hanna Bengtsson. "Geovisualisering: En rumslig representation av data." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-43221.
Full textThe Internet of Things gives us the ability to identify, control and monitor objects around the world. In order to get meaning and knowledge from the amount of raw data, it needs to be presented in the right way for people to get insights from it. The study therefore examines whether geovisualization can better meet human cognitive ability in interpretation of information. Geovisualization means that spatial data can be explored on a map through an interactive display and is a link between the human decision-making process, interactive interfaces and data [21]. More research is needed in the area to investigate how geovisualization can take place in systems where large amounts of data needs to be presented and how it can support decision-making processes. The study aims to compare geovisualizations with an existing system that provides continuous updating and monitoring of network cameras by performing usability tests and interviews. Geovisualization has been investigated to see if it can contribute an increased understanding and better navigation in a space that mimics the physical world, as well as investigate potential problems to find future improvements. The results proved that navigation and information overload were recurring problems during the tests of the existing system. For the geovisualizations, the results proved the opposite as they instead facilitated the understanding of navigation and information. However, some problems were identified for the developed geovisualizations, such as its limited interaction and misinterpretations of objects. Despite this, it proved to be advantageous to place units in their real environment using geovisualization as it contributed to a better overview and understanding of the system's context.
Friedman, Marc T. "Representation and optimization for data integration /." Thesis, Connect to this title online; UW restricted, 1999. http://hdl.handle.net/1773/6979.
Full textJansson, Erika. "Data-model representation for non-programmers." Thesis, Uppsala universitet, Institutionen för informationsteknologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-394277.
Full textDineva, A. A. "Non-conventional data representation and control." Doctoral thesis, Università degli Studi di Milano, 2017. http://hdl.handle.net/2434/487393.
Full textKarras, Panagiotis. "Data structures and algorithms for data representation in constrained environments." Thesis, Click to view the E-thesis via HKUTO, 2007. http://sunzi.lib.hku.hk/hkuto/record/B38897647.
Full textKalaiah, Aravind. "Visual data representation using context-aware Samples." College Park, Md. : University of Maryland, 2005. http://hdl.handle.net/1903/2465.
Full textThesis research directed by: Computer Science. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
Khanna, Rajiv. "Image data compression using multiple bases representation." Thesis, This resource online, 1990. http://scholar.lib.vt.edu/theses/available/etd-12302008-063722/.
Full textCottee, Michaela J. "The graphical representation of structured multivariate data." Thesis, Open University, 1996. http://oro.open.ac.uk/57616/.
Full textTodman, Christopher Derek. "The representation of time in data warehouses." Thesis, Open University, 1999. http://oro.open.ac.uk/58004/.
Full textOsborne, William George. "Data representation optimisation for reconfigurable hardware design." Thesis, Imperial College London, 2011. http://hdl.handle.net/10044/1/9044.
Full textNan, Lihao. "Privacy Preserving Representation Learning For Complex Data." Thesis, The University of Sydney, 2019. http://hdl.handle.net/2123/20662.
Full textMorell, Vicente. "Contributions to 3D Data Registration and Representation." Doctoral thesis, Universidad de Alicante, 2014. http://hdl.handle.net/10045/42364.
Full textUgail, Hassan, and Eyad Elyan. "Efficient 3D data representation for biometric applications." IOS Press, 2007. http://hdl.handle.net/10454/2683.
Full textAn important issue in many of today's biometric applications is the development of efficient and accurate techniques for representing related 3D data. Such data is often available through the process of digitization of complex geometric objects which are of importance to biometric applications. For example, in the area of 3D face recognition a digital point cloud of data corresponding to a given face is usually provided by a 3D digital scanner. For efficient data storage and for identification/authentication in a timely fashion such data requires to be represented using a few parameters or variables which are meaningful. Here we show how mathematical techniques based on Partial Differential Equations (PDEs) can be utilized to represent complex 3D data where the data can be parameterized in an efficient way. For example, in the case of a 3D face we show how it can be represented using PDEs whereby a handful of key facial parameters can be identified for efficient storage and verification.
Henning, Gustav. "Visualization of neural data : Dynamic representation and analysis of accumulated experimental data." Thesis, KTH, Skolan för datavetenskap och kommunikation (CSC), 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-166770.
Full textDen vetenskapliga metoden är en integral del av undersökningen och utforskandet av hypoteser. Medan procedurer varierar mellan fält liknar de varandra i stora drag. Idag finns det ingen brist på verktyg som visualiserar data i olika grafiska kontexter. Istället fokuserar denna tes på de typ av verktyg som forskare använder för att undersöka integriteten av hypoteser. När tillräckligt med data samlats finns det olika sätt att presentera denna på ett meningsfullt sätt för att demonstrera mönster och avvikelser som skulle förbli osedda i endast siffror. Hurvida användbar statisk visualisering av data är som grafik till vetenskapliga rapporter gäller nödvändigtvis inte samma sak vid analys på grund av de många kombinationer av visualisering som ofta finns. Mjukvara kommer att introduceras för att demonstrera behovet av dynamisk representation vid analys av ackumulerad data för att påskynda upptäckten av mönster och avvikelser.
Li, Mingfei, and 李明飞. "Sparse representation and fast processing of massive data." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2012. http://hub.hku.hk/bib/B49617977.
Full textpublished_or_final_version
Computer Science
Master
Master of Philosophy
Xie, Hanting. "A generic data representation for predicting player behaviours." Thesis, University of York, 2017. http://etheses.whiterose.ac.uk/20137/.
Full textCheung, Jarvis T. "Representation and extraction of trends from process data." Thesis, Massachusetts Institute of Technology, 1992. http://hdl.handle.net/1721.1/13186.
Full textBaum, Robert Adam. "A tolerance representation scheme for solid models." Thesis, Georgia Institute of Technology, 1989. http://hdl.handle.net/1853/18180.
Full textCorreia, Filipe Laginha Pinto. "Cartographic representation of spatiotemporal phenomena." Master's thesis, Faculdade de Ciências e Tecnologia, 2013. http://hdl.handle.net/10362/11185.
Full textThe field of geovisual analytics focuses on visualization techniques to analyze spatial data by enhancing human cognition. However, spatial data also has a temporal component that is practically disregarded when using conventional geovisual analytic tools. Some proposals have been made for techniques to analyze spatiotemporal data, but most were made for specific use cases, and are hard to abstract for other situations. There was a need to create a method to describe and compare the existing techniques. A catalog that provides a clear description of a set of techniques that deal with spatiotemporal data is proposed. This allows the identification of the most useful techniques depending on the required criteria. The description of a technique in the catalog relies on the two frameworks proposed. The first framework is used for describing spatiotemporal datasets resorting to data scenarios, a class of datasets. Twenty three data scenarios are described using this framework. The second framework is used for describing analytical tasks on spatiotemporal data, nine different tasks are described using this framework. Also, in this document, is the proposal of two new geovisual analytical techniques that can be applied to spatiotemporal data: the attenuation & accumulation map technique and the overlapping spatiotemporal windows technique. A prototype was developed that implements both techniques as a proof of concept.
research project “GIAP - GeoInsight Analytics Platform (LISBOA-01-0202-FEDER- 024822)”, funded by Comissão de Coordenação e Desenvolvimento Regional de Lisboa e Vale do Tejo (PORLisboa), included in Sistema de Incentivos à Investigação e Desenvolvimento Tecnológico (SI I&DT), through a MSc research fellowship from FCT-UNL
Rahman, Md Anisur. "Tabular Representation of Schema Mappings: Semantics and Algorithms." Thèse, Université d'Ottawa / University of Ottawa, 2011. http://hdl.handle.net/10393/20032.
Full textMehta, Nishant A. "On sparse representations and new meta-learning paradigms for representation learning." Diss., Georgia Institute of Technology, 2013. http://hdl.handle.net/1853/52159.
Full textBrisson, Erik. "Representation of d-dimensional geometric objects /." Thesis, Connect to this title online; UW restricted, 1990. http://hdl.handle.net/1773/6903.
Full textGurung, Topraj. "Compact connectivity representation for triangle meshes." Diss., Georgia Institute of Technology, 2013. http://hdl.handle.net/1853/47709.
Full textTorres-Rojas, Francisco Jose. "Efficient time representation in distributed systems." Thesis, Georgia Institute of Technology, 1995. http://hdl.handle.net/1853/8301.
Full textBecek, Kazimierz. "Biomass Representation in Synthetic Aperture Radar Interferometry Data Sets." Doctoral thesis, Saechsische Landesbibliothek- Staats- und Universitaetsbibliothek Dresden, 2011. http://nbn-resolving.de/urn:nbn:de:bsz:14-qucosa-62707.
Full textAlami, Wassim T. (Wassim Tarek). "Multi-scale object representation and localization using range data." Thesis, McGill University, 1994. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=69782.
Full textA hierarchical ranking of these patches is then used to describe individual objects based on geometric information. These geometric descriptors are ranked according to several criteria expressing their estimated stability and utility.
Pose estimation is cast as an optimal matching problem. The geometric pose transformation between two views of a simple curved object is found by matching multi-scale descriptions corresponding to the two views. Different combinations of possible three patch correspondences are found and ranked between the two views and the position transformation (rotation and translation) is computed. The starting patches are constrained to be those with the most stable description. The cost of matching the two sets of representative patches based on the position transformation is computed. The final pose estimate is obtained from the correspondence that produces the best global consistency.
The algorithm's applicability to pose estimation is demonstrated by examples using real range data and its behaviour in the presence of noise is validated. Its use in object recognition is then discussed.
Lustosa, Hermano Lourenço Souza. "Managing numerical simulation data using a multidimensional array representation." Laboratório Nacional de Computação Científica, 2015. https://tede.lncc.br/handle/tede/250.
Full textApproved for entry into archive by Maria Cristina (library@lncc.br) on 2017-04-18T17:56:51Z (GMT) No. of bitstreams: 1 Dissertação (Hermano Lustosa).pdf: 11841214 bytes, checksum: c30da4b19ca9fd69bf262318a593729b (MD5)
Made available in DSpace on 2017-04-18T17:57:04Z (GMT). No. of bitstreams: 1 Dissertação (Hermano Lustosa).pdf: 11841214 bytes, checksum: c30da4b19ca9fd69bf262318a593729b (MD5) Previous issue date: 2015-12-09
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Scientific applications, such as numerical simulations, generate an ever increasing amount of data that needs to be eficiently managed. As most traditional row-store Database Management Systems are not tailored for the analytical workload usually required by such applications, alternative approaches, e. g., columnstore and multidimensional arrays, can offer better querying processing time. In this work, we propose new techniques for managing the data produced by numerical simulations, such as those coming from HeMoLab, by using multidimensional array technologies. We take advantage of multidimensional array that nicely models the dimensions and variables used in numerical simulations. The eficient mapping of the simulation output file onto a multi-dimensional array is not simple. A naive solution may lead to sparse arrays, impacting query response time, specially when the simulation uses irregular meshes to model its physical domain. We propose novel strategies to solve these problems by defining an eficient mapping of coordinate values in numerical simulations to evenly distribute cells in array chunks with the use of equi-depth histograms and space-filling curves. We evaluated our techniques through experiments over real-world data, comparing them with a columnar and a row-store relational systems. The results indicate that multidimensional arrays and column-stores are much faster than a tradivitional row-store system for queries issued over a larger amount of simulation data. Also, the results help to identify the scenarios in which using multidimensional arrays is the most eficient approach, and the ones in which they are outperformed by the relational column-store approach.
Aplicações científicas geram uma crescente massa de dados que precisam ser analisados e gerenciados eficientemente. Uma vez que os tradicionais bancos de dados relacionais não são projetados para a carga de trabalho predominantemente analítica exigida por essas aplicações, abordagens alternativas, tais como, matrizes multidimensionais e bancos de dados colunares, podem oferecer melhores tempos de execução de consultas. Neste trabalho, propomos o uso de novas tecnologias para a gerência de dados produzidos por simulações numéricas, similares às desenvolvidas pelo HeMoLab. O modelo de matrizes multidimensionais permite a modelagem elegante de dimensões e variáveis usadas em simulações numéricas. Entretanto, o mapeamento dos dados de saída de uma simulação em uma matriz multidimensional não é simples. Uma solução ingênua pode levar a criação de matrizes excessivamente esparsas, impactando o tempo de resposta do sistema, especialmente quando a simulação utiliza uma malha irregular para modelar o seu domínio físico. Nós propomos novas estratégias para resolver esses problemas através da definição de um mapeamento eficiente de valores de coordenadas com o uso de histogramas e curvas de preenchimento espacial. Nós avaliamos nossas técnicas através de experimentos feitos com dados reais, comparando-as com bancos de dados relacionais. Os resultados indicam que tanto iv matrizes multidimensionais quanto bancos de dados colunares são muitas vezes mais rápidos que bancos de dados relacionais tradicionais para consultas avaliando uma grande quantidade de dados. Além disso, os resultados auxiliam na identificação de cenários nos quais matrizes multidimensionais são mais eficientes, e nos quais elas são superadas por uma abordagem envolvendo o uso de um banco de dados colunar.
Houé, Maxime. "Clustering of short sentences through representation of text data." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-266121.
Full textNatural Language Processing har utvecklats de senaste åren mycket snabbt.Många nya applikationer uppstod av nya metoder, särskilt involverade i skapandet av det populära word embedding Word2Vec skapat av ett team av Googleforskare. En av dessa nya applikationer är chatbot-tekniken. Målet med dessa konversationsgränssnitt är att kunna kommunicera automatiskt med människor via skriftlig eller röstchatt. Med ett chatbot hoppas ett företag förbättra sina kundrelationer till en lägre kostnad. Tyvärr kan chatbots kompetens variera mycket, men till dess är deras förståelse för människorna ofta ganska dålig. Denna hårda slutsats leder till att undra hur chatbot-utvecklarna kan hjälpas för att hantera stora mängder användarförfrågningar som inte förstås av deras chatbot.Detta examensarbete gjordes i samarbete med en start-up som heter Askhub.Denna uppstart syftar till att hjälpa företagen att utveckla sin chatbot.Syftet med denna detta examensarbete är att föreslå ett klustringssystem för att klassificera data som inte förstås av en chatbot. Till att börja med har en studie av de olika metoderna för word embeddings gjorts, följt av en studie av olika klusteranalyser som är lämpliga för det valda word embedding. Resultaten jämförs sedan med vissa mätvärden och några förslag gjordes för att förbättraklusteranalysresultatet.
Sävhammar, Simon. "Uniform interval normalization : Data representation of sparse and noisy data sets for machine learning." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-19194.
Full textLundgren, Clara. "Female representation and public spending : Investigating female representation as a determinant of local expenditure patterns." Thesis, Uppsala universitet, Nationalekonomiska institutionen, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-435526.
Full textShi, Peiyang. "Faster Unsupervised Object Detection For Symbolic Representation." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-277852.
Full textUnder slutet av 1900-talet har forskning inom symbolisk artificiell intelligens ökat kraftigt. Områdena djupinlärning och djup förstärkningsinlärning har på senare tid gjort stora framsteg inom datorseende och robotapplikationer. Både områdena har gjort imponerande framsteg men finner sig i motsatta ändar av AI-forskningens spektrum. Mainstream djupinlärning bygger på automatisk extraktion utan vidare hänsyn till tolkningsbara symboler medan symbolisk AI ofta fokuserar på handgjorda symboler. I denna studie introducerar vi en djupinlärningsalgoritm för symbolrepresentationsinlärning. Algoritmen baseras på de senaste framstegen inom oövervakad objektdetektering och vi visar att den lätt kan anpassas för symbolisk representation. Vår algoritm, FaSPAIR, är en anpassning av algoritmen för objektdetektering, SPAIR. Vi har gjort flera förändringar för att kunna länka modellen till den symboliska representationen som behövs för förstärkningsinlärning samt för förbättring av träningshastigheten. Våra resultat visar verkan och effektiviteten av att använda objektdetektion för symbolisk representationsinlärning. Vi visar även att FaSPAIR ger stor förbättring i beräkningshastigheten jämfört med den toppmoderna algoritmen SPAIR.
Goebel, Randy. "A logic data model for the machine representation of knowledge." Thesis, University of British Columbia, 1985. http://hdl.handle.net/2429/25799.
Full textScience, Faculty of
Computer Science, Department of
Graduate
Barton, Louis W. G. "Theory of semantic data representation for non-determinate symbol systems." Thesis, University of Oxford, 2011. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.669946.
Full textHerzog, Erik. "An approach to systems engineering tool data representation and exchange." Doctoral thesis, Linköping : Univ, 2004. http://www.ep.liu.se/diss/science_technology/08/67/index.html.
Full textLodolini, Lucia. "The representation of symmetric patterns using the Quadtree data structure /." Online version of thesis, 1988. http://hdl.handle.net/1850/8402.
Full textRosen, Jonathan Adam. "Distortion correction and momentum representation of angle-resolved photoemission data." Thesis, University of British Columbia, 2008. http://hdl.handle.net/2429/5317.
Full textLucke, Helmut. "On the representation of temporal data for connectionist word recognition." Thesis, University of Cambridge, 1991. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.239520.
Full textShen, Yuming. "Deep binary representation learning for single/cross-modal data retrieval." Thesis, University of East Anglia, 2018. https://ueaeprints.uea.ac.uk/67635/.
Full textMohammed, Ayat Mohammed Naguib. "High-dimensional Data in Scientific Visualization: Representation, Fusion and Difference." Diss., Virginia Tech, 2017. http://hdl.handle.net/10919/78343.
Full textPh. D.
Srivastava, Arunima. "Univariate and Multivariate Representation and Modeling of Cancer Biomedical Data." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1577717365850367.
Full textLaforgue, Pierre. "Deep kernel representation learning for complex data and reliability issues." Thesis, Institut polytechnique de Paris, 2020. http://www.theses.fr/2020IPPAT006.
Full textThe first part of this thesis aims at exploring deep kernel architectures for complex data. One of the known keys to the success of deep learning algorithms is the ability of neural networks to extract meaningful internal representations. However, the theoretical understanding of why these compositional architectures are so successful remains limited, and deep approaches are almost restricted to vectorial data. On the other hand, kernel methods provide with functional spaces whose geometry are well studied and understood. Their complexity can be easily controlled, by the choice of kernel or penalization. In addition, vector-valued kernel methods can be used to predict kernelized data. It then allows to make predictions in complex structured spaces, as soon as a kernel can be defined on it.The deep kernel architecture we propose consists in replacing the basic neural mappings functions from vector-valued Reproducing Kernel Hilbert Spaces (vv-RKHSs). Although very different at first glance, the two functional spaces are actually very similar, and differ only by the order in which linear/nonlinear functions are applied. Apart from gaining understanding and theoretical control on layers, considering kernel mappings allows for dealing with structured data, both in input and output, broadening the applicability scope of networks. We finally expose works that ensure a finite dimensional parametrization of the model, opening the door to efficient optimization procedures for a wide range of losses.The second part of this thesis investigates alternatives to the sample mean as substitutes to the expectation in the Empirical Risk Minimization (ERM) paradigm. Indeed, ERM implicitly assumes that the empirical mean is a good estimate of the expectation. However, in many practical use cases (e.g. heavy-tailed distribution, presence of outliers, biased training data), this is not the case.The Median-of-Means (MoM) is a robust mean estimator constructed as follows: the original dataset is split into disjoint blocks, empirical means on each block are computed, and the median of these means is finally returned. We propose two extensions of MoM, both to randomized blocks and/or U-statistics, with provable guarantees. By construction, MoM-like estimators exhibit interesting robustness properties. This is further exploited by the design of robust learning strategies. The (randomized) MoM minimizers are shown to be robust to outliers, while MoM tournament procedure are extended to the pairwise setting.We close this thesis by proposing an ERM procedure tailored to the sample bias issue. If training data comes from several biased samples, computing blindly the empirical mean yields a biased estimate of the risk. Alternatively, from the knowledge of the biasing functions, it is possible to reweight observations so as to build an unbiased estimate of the test distribution. We have then derived non-asymptotic guarantees for the minimizers of the debiased risk estimate thus created. The soundness of the approach is also empirically endorsed
Lacerda, Fred W. "Comparative advantages of graphic versus numeric representation of quantitative data." Diss., Virginia Polytechnic Institute and State University, 1986. http://hdl.handle.net/10919/49817.
Full textPech, Palacio Manuel Alfredo. "Spatial data modeling and mining using a graph-based representation." Lyon, INSA, 2005. http://theses.insa-lyon.fr/publication/2005ISAL0118/these.pdf.
Full textWe propose a unique graph-based model to represent spatial data, non-spatial data and the spatial relations among spatial objects. We will generate datasets composed of graphs with a set of these three elements. We consider that by mining a dataset with these characteristics a graph-based mining tool can search patterns involving all these elements at the same time improving the results of the spatial analysis task. A significant characteristic of spatial data is that the attributes of the neighbors of an object may have an influence on the object itself. So, we propose to include in the model three relationship types (topological, orientation, and distance relations). In the model the spatial data (i. E. Spatial objects), non-spatial data (i. E. Non-spatial attributes), and spatial relations are represented as a collection of one or more directed graphs. A directed graph contains a collection of vertices and edges representing all these elements. Vertices represent either spatial objects, spatial relations between two spatial objects (binary relation), or non-spatial attributes describing the spatial objects. Edges represent a link between two vertices of any type. According to the type of vertices that an edge joins, it can represent either an attribute name or a spatial relation name. The attribute name can refer to a spatial object or a non-spatial entity. We use directed edges to represent directional information of relations among elements (i. E. Object x touches object y) and to describe attributes about objects (i. E. Object x has attribute z). We propose to adopt the Subdue system, a general graph-based data mining system developed at the University of Texas at Arlington, as our mining tool. A special feature named overlap has a primary role in the substructures discovery process and consequently a direct impact over the generated results. However, it is currently implemented in an orthodox way: all or nothing. Therefore, we propose a third approach: limited overlap, which gives the user the capability to set over which vertices the overlap will be allowed. We visualize directly three motivations issues to propose the implementation of the new algorithm: search space reduction, processing time reduction, and specialized overlapping pattern oriented search