Dissertations / Theses on the topic 'Statistical graph analysis'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Statistical graph analysis.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Fairbanks, James Paul. "Graph analysis combining numerical, statistical, and streaming techniques." Diss., Georgia Institute of Technology, 2016. http://hdl.handle.net/1853/54972.
Full textSoriani, Nicola. "Topics in Statistical Models for Network Analysis." Doctoral thesis, Università degli studi di Padova, 2012. http://hdl.handle.net/11577/3422100.
Full textLa Network Analysis è un insieme di tecniche statistiche e matematiche per lo studio di dati relazionali per un sistema di entità interconnesse. Molti dei risultati per i dati di rete provengono dalla Social Network Analysis (SNA), incentrata principalmente sullo studio delle relazioni tra un insieme di individui e organizzazioni. La tesi tratta alcuni argomenti riguardanti la modellazione statistica per dati di rete, con particolare attenzione ai modelli utilizzati in SNA. Il nucleo centrale della tesi è rappresentato dai Capitoli 3, 4 e 5. Nel Capitolo 3, viene proposto un approccio alternativo per la stima dei modelli esponenziali per grafi casuali (Exponential Random Graph Models - ERGMs). Nel capitolo 4, l'approccio di modellazione ERGM e quello a Spazio Latente vengono confrontati in termini di bontà di adattamento. Nel capitolo 5, vengono proposti metodi alternativi per la stima della classe di modelli p2.
GRASSI, FRANCESCO. "Statistical and Graph-Based Signal Processing: Fundamental Results and Application to Cardiac Electrophysiology." Doctoral thesis, Politecnico di Torino, 2018. http://hdl.handle.net/11583/2710580.
Full textMeinhardt, Llopis Enric. "Morphological and statistical techniques for the analysis of 3D images." Doctoral thesis, Universitat Pompeu Fabra, 2011. http://hdl.handle.net/10803/22719.
Full textThis thesis proposes a tree data structure to encode the connected components of level sets of 3D images. This data structure is applied as a main tool in several proposed applications: 3D morphological operators, medical image visualization, analysis of color histograms, object tracking in videos and edge detection. Motivated by the problem of edge linking, the thesis contains also an study of anisotropic total variation denoising as a tool for computing anisotropic Cheeger sets. These anisotropic Cheeger sets can be used to find global optima of a class of edge linking functionals. They are also related to some affine invariant descriptors which are used in object recognition, and this relationship is laid out explicitly.
Tavernari, Daniele. "Statistical and network-based methods for the analysis of chromatin accessibility maps in single cells." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2016. http://amslaurea.unibo.it/12297/.
Full textValba, Olga. "Statistical analysis of networks and biophysical systems of complex architecture." Phd thesis, Université Paris Sud - Paris XI, 2013. http://tel.archives-ouvertes.fr/tel-00919606.
Full textKamal, Tariq. "Computational Cost Analysis of Large-Scale Agent-Based Epidemic Simulations." Diss., Virginia Tech, 2016. http://hdl.handle.net/10919/82507.
Full textPh. D.
Jiang, Shan. "Statistical Modeling of Multi-Dimensional Knowledge Diffusion Networks: An ERGM-Based Framework." Diss., The University of Arizona, 2015. http://hdl.handle.net/10150/555946.
Full textLamont, Morné Michael Connell. "Binary classification trees : a comparison with popular classification methods in statistics using different software." Thesis, Stellenbosch : Stellenbosch University, 2002. http://hdl.handle.net/10019.1/52718.
Full textENGLISH ABSTRACT: Consider a data set with a categorical response variable and a set of explanatory variables. The response variable can have two or more categories and the explanatory variables can be numerical or categorical. This is a typical setup for a classification analysis, where we want to model the response based on the explanatory variables. Traditional statistical methods have been developed under certain assumptions such as: the explanatory variables are numeric only and! or the data follow a multivariate normal distribution. hl practice such assumptions are not always met. Different research fields generate data that have a mixed structure (categorical and numeric) and researchers are often interested using all these data in the analysis. hl recent years robust methods such as classification trees have become the substitute for traditional statistical methods when the above assumptions are violated. Classification trees are not only an effective classification method, but offer many other advantages. The aim of this thesis is to highlight the advantages of classification trees. hl the chapters that follow, the theory of and further developments on classification trees are discussed. This forms the foundation for the CART software which is discussed in Chapter 5, as well as other software in which classification tree modeling is possible. We will compare classification trees to parametric-, kernel- and k-nearest-neighbour discriminant analyses. A neural network is also compared to classification trees and finally we draw some conclusions on classification trees and its comparisons with other methods.
AFRIKAANSE OPSOMMING: Beskou 'n datastel met 'n kategoriese respons veranderlike en 'n stel verklarende veranderlikes. Die respons veranderlike kan twee of meer kategorieë hê en die verklarende veranderlikes kan numeries of kategories wees. Hierdie is 'n tipiese opset vir 'n klassifikasie analise, waar ons die respons wil modelleer deur gebruik te maak van die verklarende veranderlikes. Tradisionele statistiese metodes is ontwikkelonder sekere aannames soos: die verklarende veranderlikes is slegs numeries en! of dat die data 'n meerveranderlike normaal verdeling het. In die praktyk word daar nie altyd voldoen aan hierdie aannames nie. Verskillende navorsingsvelde genereer data wat 'n gemengde struktuur het (kategories en numeries) en navorsers wil soms al hierdie data gebruik in die analise. In die afgelope jare het robuuste metodes soos klassifikasie bome die alternatief geword vir tradisionele statistiese metodes as daar nie aan bogenoemde aannames voldoen word nie. Klassifikasie bome is nie net 'n effektiewe klassifikasie metode nie, maar bied baie meer voordele. Die doel van hierdie werkstuk is om die voordele van klassifikasie bome uit te wys. In die hoofstukke wat volg word die teorie en verdere ontwikkelinge van klassifikasie bome bespreek. Hierdie vorm die fondament vir die CART sagteware wat bespreek word in Hoofstuk 5, asook ander sagteware waarin klassifikasie boom modelering moontlik is. Ons sal klassifikasie bome vergelyk met parametriese-, "kernel"- en "k-nearest-neighbour" diskriminant analise. 'n Neurale netwerk word ook vergelyk met klassifikasie bome en ten slote word daar gevolgtrekkings gemaak oor klassifikasie bome en hoe dit vergelyk met ander metodes.
Noel, Jonathan A. "Extremal combinatorics, graph limits and computational complexity." Thesis, University of Oxford, 2016. https://ora.ox.ac.uk/objects/uuid:8743ff27-b5e9-403a-a52a-3d6299792c7b.
Full textAragonès, Martín Àngels. "Graph theory applied to transmission path problems in vibroacoustics." Doctoral thesis, Universitat Ramon Llull, 2015. http://hdl.handle.net/10803/299378.
Full textA fundamental aspect when solving a vibroacoustic problem in a mechanical system is that of finding out how energy flows from a given source to any part of the system. This would help making decisions to undertake actions for diminishing, for example, the noise or vibration levels at a given system area. The dynamic behavior of a mechanical system can be estimated using different numerical methods, each of them targeting a certain frequency range. Whereas at low frequencies deterministic methods such as the Finite Element Method (FEM) or the Boundary Element Method (BEM) can be applied, statistical methods like Statistical Energy Analysis (SEA) become unavoidable at high frequencies. In addition, a large variety of approaches such as the hybrid FE-SEA, the Energy Distribution (ED) models or the Statistical modal Energy distribution Analysis (SmEdA), among many others, have been recently proposed to tackle with the so-called mid-frequency problem. However, although numerical methods can predict the pointwise or averaged vibroacoustic response of a system, they do not directly provide information on how energy flows throughout the system. Therefore, some kind of post-processing is required to determine energy transmission paths. The energy transmitted through a particular path linking a source subsystem, where external energy is being input, and a target subsystem, can be computed numerically. Yet, identifying which paths dominate the whole energy transmission from source to target usually relies on the engineer's expertise and judgement. Thus, an approach for the automatic identification of those paths would prove very useful. Graph theory provides a way out to this problem, since powerful path algorithms for graphs are available. In this thesis, a link between vibroacoustic models and graph theory is proposed, which allows one to address energy transmission path problems in a straightforward manner. The dissertation starts focusing on SEA models. It is first shown that performing a transmission path analysis (TPA) in SEA makes sense. Then a graph that accurately represents the SEA model is defined. Given that the energy transmission between sources and targets is justified by the contribution of a limited group of dominant paths in many cases of practical interest, an algorithm to find them is presented. Thereafter, an enhanced algorithm is devised to include the stochastic nature of SEA loss factors in the ranking of paths. Next, it is discussed how transmission path analysis can be extended to the mid frequency range. The graph approach for path computation becomes adapted for some ED models, as well as for SmEdA. Finally, we outline another possible application of graph theory to vibroacoustics. A graph cut algorithm strategy is implemented to achieve energy reduction at a target subsystem with the sole modification of a reduced set of loss factors. The set is found by computing cuts in the graph separating source and receiver subsystems.
Un aspecto fundamental a la hora de resolver un problema vibroacústico en un sistema mecánico es el de determinar cómo fluye la energía desde una determinada fuente hasta cualquier parte del sistema. Ello ayudaría a tomar decisiones para emprender acciones destinadas a disminuir, por ejemplo, los niveles de ruido y vibraciones en un área del sistema dada. El comportamiento dinámico de un sistema mecánico se puede estimar utilizando varios métodos numéricos, cada uno de ellos enfocado a un determinado rango de frecuencia. Mientras en las bajas frecuencias se pueden aplicar métodos deterministas como el Método de los Elementos Finitos (FEM) o el método de Elementos de Contorno (BEM), los métodos estadísticos como el Análisis Estadístico Energético son inevitables en las altas frecuencias. Además, se han desarrollado gran variedad de técnicas como el FE-SEA híbrido, los modelos de Distribución de Energía (ED) o el Análisis Estadístico de distribución de Energía modal (SmEdA), entre otras, para tratar el llamado problema de las medias frecuencias. Sin embargo, aunque los métodos numéricos pueden predecir la respuesta vibroacústica puntual o promediada de un sistema mecánico, ellos no proporcionan información sobre como fluye la energía en el sistema. Por lo tanto, hace falta algún tipo de post-procesado para determinar las vías de transmisión de energía. La energía transmitida a través de un determinado camino que conecta un subsistema fuente, donde se introduce la energía, y un subsistema receptor, se puede calcular numéricamente. A pesar de ello, identificar qué caminos dominan la transmisión de energía desde la fuente al receptor normalmente suele recaer en la experiencia o el juicio del ingeniero. Así pues, un método automático para identificar estos caminos resultaría muy útil. La teoría de grafos proporciona una solución a este problema, ya que existen potentes algoritmos de cálculos de caminos en grafos. En esta tesis, se propone un enlace entre los modelos vibroacústicos y la teoría de grafos, que permite abordar los problemas de vías de transmisión de forma directa. La disertación empieza centrándose en los modelos SEA. Primeramente, se muestra que tiene sentido realizar un análisis de vías de transmisión (TPA) en un modelo SEA. Seguidamente, se define un grafo que representa fielmente un modelo SEA. Teniendo en cuenta que en muchos casos de interés práctico, la transmisión de energía entre fuentes y receptores se puede justificar mediante la contribución de un grupo finito de vías de transmisión, se define un algoritmo para encontrarlas. A continuación, se implementa un algoritmo que incluye en el cómputo de caminos la naturaleza estocástica de los factores de pérdidas SEA. Luego, se trata la extensión del análisis de vías de transmisión al rango de media frecuencia. La técnica de teoría de grafos aplicada a cálculo de caminos se adapta para algunos modelos ED y también SmEdA. Finalmente, se presenta otra posible aplicación de la teoría de grafos a la vibroacústica. Se implementa una estrategia basada en algoritmos de cortes en grafos destinada a reducir la energía en un subsistema receptor mediante la simple modificación de un grupo reducido de factores de pérdidas. El grupo se encuentra calculando cortes que separen en el grafo los subsistemas fuentes de los subsistemas receptores.
Maus, Aaron. "Formulation of Hybrid Knowledge-Based/Molecular Mechanics Potentials for Protein Structure Refinement and a Novel Graph Theoretical Protein Structure Comparison and Analysis Technique." ScholarWorks@UNO, 2019. https://scholarworks.uno.edu/td/2673.
Full textMaroušek, Vít. "Vizualizace vícerozměrných statistických dat." Master's thesis, Vysoká škola ekonomická v Praze, 2011. http://www.nusl.cz/ntk/nusl-85161.
Full textHolmgren, Åke J. "Quantitative vulnerability analysis of electric power networks." Doctoral thesis, KTH, Transporter och samhällsekonomi, 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3969.
Full textQC 20100831
Karlström, Daniel. "Implementation of data-collection tools using NetFlow for statistical analysis at the ISP level." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-16140.
Full textAtt försvara sig mot DoS-och DDoS-attacker är svårt att åstadkomma; att hitta och filtrera ut illegitim trafik från det legitima flödet är nästan omöjligt. Att vidta åtgärder när en sådan attack upptäcks kan endast göras när IP-adresserna från angriparna är kända. Detta kan uppnås genom att man övervakar trafikflödet mellan målet för attacken och angriparna och ser vilka som sänder mest data och på så sätt identifierar angriparna.. Detta tillåter företaget eller dess ISP att blockera trafiken ifrån dessa IP-adresser genom att sända trafiken vidare till ingenstans. Detta kallas blackhole-routing eller null-routing. Genom att använda redovisnings- och övervakningsprogrammet pmacct syftar denna uppsats på att undersöka hurvida pmacct-sviten är lämpad för större installationer när det gäller att spåra och förhindra DDoS-attacker, såsom hos en Internetleverantör eller dylikt. Potentialla problem som kan uppstå är att mängden trafik som måste analyserar blir för stor och för krävande. Denna avhandling går även igenom pmacct-verktyget i sig. Slutsatserna är lovande, vilket indikerar att den har potential av att kunna hantera sådana stora miljöer med noggrann planering.
Phadnis, Miti. "Statistical Analysis of Linear Analog Circuits Using Gaussian Message Passing in Factor Graphs." DigitalCommons@USU, 2009. https://digitalcommons.usu.edu/etd/504.
Full textVohra, Neeru Rani. "Three dimensional statistical graphs, visual cues and clustering." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 2000. http://www.collectionscanada.ca/obj/s4/f2/dsk2/ftp03/MQ56213.pdf.
Full textZoffoli, Violetta <1992>. "Multiple Graph Structure Learning: a comparative analysis." Doctoral thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amsdottorato.unibo.it/9400/1/tesi_finale.pdf.
Full textAlbà, Xènia. "Automated cardiac MR image analysis for population imaging." Doctoral thesis, Universitat Pompeu Fabra, 2017. http://hdl.handle.net/10803/403063.
Full textClinical practice and research are routinely generating large amounts of medical records, including medical images. However, valuable knowledge that could impact healthcare delivery remains currently frozen in these population cohorts. New tools are therefore necessary to process and exploit such large-scale data, taking into account in particular the unprecedented variability in anatomy and pathophysiology. In this thesis, we present new approaches for the automatic and robust processing of large-scale medical image data, focusing on the challenging segmentation of cardiac magnetic resonance images (MRI) studies. The main contributions of this thesis allow automatic segmentation (i) across multiple MRI sequences without the need for sequence-specific parameter tuning, (ii) across highly variable cases without a priori knowledge of the involved pathology, and (iii) incorporating automatic detection and quality control without the need for any user interaction. All of these techniques are demonstrated over multiple large-scale cohorts from different clinical centers and public databases.
Wang, Kaijun. "Graph-based Modern Nonparametrics For High-dimensional Data." Diss., Temple University Libraries, 2019. http://cdm16002.contentdm.oclc.org/cdm/ref/collection/p245801coll10/id/578840.
Full textPh.D.
Developing nonparametric statistical methods and inference procedures for high-dimensional large data have been a challenging frontier problem of statistics. To attack this problem, in recent years, a clear rising trend has been observed with a radically different viewpoint--``Graph-based Nonparametrics," which is the main research focus of this dissertation. The basic idea consists of two steps: (i) representation step: code the given data using graphs, (ii) analysis step: apply statistical methods on the graph-transformed problem to systematically tackle various types of data structures. Under this general framework, this dissertation develops two major research directions. Chapter 2—based on Mukhopadhyay and Wang (2019a)—introduces a new nonparametric method for high-dimensional k-sample comparison problem that is distribution-free, robust, and continues to work even when the dimension of the data is larger than the sample size. The proposed theory is based on modern LP-nonparametrics tools and unexplored connections with spectral graph theory. The key is to construct a specially-designed weighted graph from the data and to reformulate the k-sample problem into a community detection problem. The procedure is shown to possess various desirable properties along with a characteristic exploratory flavor that has practical consequences. The numerical examples show surprisingly well performance of our method under a broad range of realistic situations. Chapter 3—based on Mukhopadhyay and Wang (2019b)—revisits some foundational questions about network modeling that are still unsolved. In particular, we present unified statistical theory of the fundamental spectral graph methods (e.g., Laplacian, Modularity, Diffusion map, regularized Laplacian, Google PageRank model), which are often viewed as spectral heuristic-based empirical mystery facts. Despite half a century of research, this question has been one of the most formidable open issues, if not the core problem in modern network science. Our approach integrates modern nonparametric statistics, mathematical approximation theory (of integral equations), and computational harmonic analysis in a novel way to develop a theory that unifies and generalizes the existing paradigm. From a practical standpoint, it is shown that this perspective can provide adequate guidance for designing next-generation computational tools for large-scale problems. As an example, we have described the high-dimensional change-point detection problem. Chapter 4 discusses some further extensions and application of our methodologies to regularized spectral clustering and spatial graph regression problems. The dissertation concludes with the a discussion of two important areas of future studies.
Temple University--Theses
Blignaut, Rennette Julia. "Discriminant analysis : a review of its application to the classificationof grape cultivars." Master's thesis, University of Cape Town, 1989. http://hdl.handle.net/11427/14298.
Full textMei, Jonathan B. "Principal Network Analysis." Research Showcase @ CMU, 2018. http://repository.cmu.edu/dissertations/1175.
Full textYoung, Stephen J. "Random dot product graphs a flexible model for complex networks." Diss., Atlanta, Ga. : Georgia Institute of Technology, 2008. http://hdl.handle.net/1853/26548.
Full textCommittee Chair: Mihail, Milena; Committee Member: Lu, Linyuan; Committee Member: Sokol, Joel; Committee Member: Tetali, Prasad; Committee Member: Trotter, Tom; Committee Member: Yu, Xingxing. Part of the SMARTech Electronic Thesis and Dissertation Collection.
ARTARIA, ANDREA. "Objective Bayesian Analysis for Differential Gaussian Directed Acyclic Graphs." Doctoral thesis, Università degli Studi di Milano-Bicocca, 2014. http://hdl.handle.net/10281/55327.
Full textFadrný, Tomáš. "Statistické zhodnocení dat." Master's thesis, Vysoké učení technické v Brně. Fakulta strojního inženýrství, 2009. http://www.nusl.cz/ntk/nusl-228740.
Full textKim, Sungmin. "Community Detection in Directed Networks and its Application to Analysis of Social Networks." The Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1397571499.
Full textPsorakis, Ioannis. "Probabilistic inference in ecological networks : graph discovery, community detection and modelling dynamic sociality." Thesis, University of Oxford, 2013. http://ora.ox.ac.uk/objects/uuid:84741d8b-31ea-4eee-ae44-a0b7b5491700.
Full textMaignant, Elodie. "Plongements barycentriques pour l'apprentissage géométrique de variétés : application aux formes et graphes." Electronic Thesis or Diss., Université Côte d'Azur, 2023. http://www.theses.fr/2023COAZ4096.
Full textAn MRI image has over 60,000 pixels. The largest known human protein consists of around 30,000 amino acids. We call such data high-dimensional. In practice, most high-dimensional data is high-dimensional only artificially. For example, of all the images that could be randomly generated by coloring 256 x 256 pixels, only a very small subset would resemble an MRI image of a human brain. This is known as the intrinsic dimension of such data. Therefore, learning high-dimensional data is often synonymous with dimensionality reduction. There are numerous methods for reducing the dimension of a dataset, the most recent of which can be classified according to two approaches.A first approach known as manifold learning or non-linear dimensionality reduction is based on the observation that some of the physical laws behind the data we observe are non-linear. In this case, trying to explain the intrinsic dimension of a dataset with a linear model is sometimes unrealistic. Instead, manifold learning methods assume a locally linear model.Moreover, with the emergence of statistical shape analysis, there has been a growing awareness that many types of data are naturally invariant to certain symmetries (rotations, reparametrizations, permutations...). Such properties are directly mirrored in the intrinsic dimension of such data. These invariances cannot be faithfully transcribed by Euclidean geometry. There is therefore a growing interest in modeling such data using finer structures such as Riemannian manifolds. A second recent approach to dimension reduction consists then in generalizing existing methods to non-Euclidean data. This is known as geometric learning.In order to combine both geometric learning and manifold learning, we investigated the method called locally linear embedding, which has the specificity of being based on the notion of barycenter, a notion a priori defined in Euclidean spaces but which generalizes to Riemannian manifolds. In fact, the method called barycentric subspace analysis, which is one of those generalizing principal component analysis to Riemannian manifolds, is based on this notion as well. Here we rephrase both methods under the new notion of barycentric embeddings. Essentially, barycentric embeddings inherit the structure of most linear and non-linear dimension reduction methods, but rely on a (locally) barycentric -- affine -- model rather than a linear one.The core of our work lies in the analysis of these methods, both on a theoretical and practical level. In particular, we address the application of barycentric embeddings to two important examples in geometric learning: shapes and graphs. In addition to practical implementation issues, each of these examples raises its own theoretical questions, mostly related to the geometry of quotient spaces. In particular, we highlight that compared to standard dimension reduction methods in graph analysis, barycentric embeddings stand out for their better interpretability. In parallel with these examples, we characterize the geometry of locally barycentric embeddings, which generalize the projection computed by locally linear embedding. Finally, algorithms for geometric manifold learning, novel in their approach, complete this work
Araújo, Eduardo Barbosa. "Scientific Collaboration Networks from Lattes Database: Topology, Dynamics and Gender Statistics." reponame:Repositório Institucional da UFC, 2016. http://www.repositorio.ufc.br/handle/riufc/18489.
Full textSubmitted by Edvander Pires (edvanderpires@gmail.com) on 2016-07-19T15:58:54Z No. of bitstreams: 1 2016_tese_ebaraujo.pdf: 3600069 bytes, checksum: a78e83ffda97c499e589b405da4da3c8 (MD5)
Approved for entry into archive by Edvander Pires (edvanderpires@gmail.com) on 2016-07-19T15:59:07Z (GMT) No. of bitstreams: 1 2016_tese_ebaraujo.pdf: 3600069 bytes, checksum: a78e83ffda97c499e589b405da4da3c8 (MD5)
Made available in DSpace on 2016-07-19T15:59:07Z (GMT). No. of bitstreams: 1 2016_tese_ebaraujo.pdf: 3600069 bytes, checksum: a78e83ffda97c499e589b405da4da3c8 (MD5) Previous issue date: 2016
Understanding the dynamics of research production and collaboration may reveal better strategies for scientific careers, academic institutions and funding agencies. Here we propose the use of a large and multidisciplinary database of scientific curricula in Brazil, namely, the Lattes Platform, to study patterns of scientific production and collaboration. Detailed information about publications and researchers is available in this database. Individual curricula are submitted by the researchers themselves so that co-authorship is unambiguous. Researchers can be evaluated by scientific productivity, geographical location and field of expertise. Our results show that the collaboration network is growing exponentially for the last three decades, with a distribution of number of collaborators per researcher that approaches a power-law as the network gets older. Moreover, both the distributions of number of collaborators and production per researcher obey power-law behaviors, regardless of the geographical location or field, suggesting that the same universal mechanism might be responsible for network growth and productivity. We also show that the collaboration network under investigation displays a typical assortative mixing behavior, where teeming researchers (i.e., with high degree) tend to collaborate with others alike. Moreover, we discover that on average men prefer collaborating with other men than with women, while women are more egalitarian. This is consistently observed over all fields and essentially independent on the number of collaborators of the researcher. The solely exception is for engineering, where clearly this gender bias is less pronounced, when the number of collaborators increases. We also find that the distribution of number of collaborators follows a power-law, with a cut-off that is gender dependent. This reflects the fact that on average men produce more papers andhave more collaborators than women. We also find that both genders display the same tendency towards interdisciplinary collaborations, except for Exact and Earth Sciences, where women having many collaborators are more open to interdisciplinary research.
Compreender a dinâmica de produção e colaboração em pesquisa pode revelar melhores estratégias para carreiras científicas, instituições acadêmicas e agências de fomento. Neste trabalho nós propomos o uso de uma grande e multidisciplinar base de currículos científicos brasileira, a Plataforma Lattes, para o estudo de padrões em pesquisa científica e colaborações. Esta base de dados inclui informações detalhadas acerca de publicações e pesquisadores. Currículos individuais são enviados pelos próprios pesquisadores de forma que a identificação de coautoria não é ambígua. Pesquisadores podem ser classificados por produção científica, localização geográfica e áreas de pesquisa. Nossos resultados mostram que a rede de colaborações científicas tem crescido exponencialmente nas últimas três décadas, com a distribuição do número de colaboradores por pesquisador se aproximando de uma lei de potência à medida que a rede evolui. Além disso, ambas a distribuição do número de colaboradores e a produção por pesquisador seguem o comportamento de leis de potência, independentemente da região ou áreas, sugerindo que um mesmo mecanismo universal pode ser responsável pelo crescimento da rede e pela produtividade dos pesquisadores. Também mostramos que as redes de colaboração investigadas apresentam um típico comportamento assortativo, no qual pesquisadores de alto nível (com muitos colaboradores) tendem a colaborador com outros semelhantes. Em seguida, mostramos que homens preferem colaborar com outros homens enquanto mulheres são mais igualitárias ao estabelecer suas colaborações. Isso é consistentemente observado em todas as áreas e é essencialmente independente do número de colaborações do pesquisador. A única exceção sendo a área de Engenharia, na qual este viés é claramente menos pronunciado para pesquisadores com muitas colaborações. Também mostramos que o número de colaborações segue o comportamento de leis de potência, com um cutoff dependente do gênero. Isso se reflete no fato de que em média mulheres produzem menos artigos e têm menos colaborações que homens. Também mostramos que ambos os gêneros exibem a mesma tendência quanto a colaborações interdisciplinares, exceto em Ciências Exatas e da Terra, nas quais mulheres tendo mais colaboradores são mais propensas a pesquisas interdisciplinares.
AraÃjo, Eduardo Barbosa. "Scientific Collaboration Networks from Lattes Database: Topology, Dynamics and Gender Statistics." Universidade Federal do CearÃ, 2016. http://www.teses.ufc.br/tde_busca/arquivo.php?codArquivo=17184.
Full textCompreender a dinÃmica de produÃÃo e colaboraÃÃo em pesquisa pode revelar melhores estratÃgias para carreiras cientÃficas, instituiÃÃes acadÃmicas e agÃncias de fomento. Neste trabalho nÃs propomos o uso de uma grande e multidisciplinar base de currÃculos cientÃficos brasileira, a Plataforma Lattes, para o estudo de padrÃes em pesquisa cientÃfica e colaboraÃÃes. Esta base de dados inclui informaÃÃes detalhadas acerca de publicaÃÃes e pesquisadores. CurrÃculos individuais sÃo enviados pelos prÃprios pesquisadores de forma que a identificaÃÃo de coautoria nÃo à ambÃgua. Pesquisadores podem ser classificados por produÃÃo cientÃfica, localizaÃÃo geogrÃfica e Ãreas de pesquisa. Nossos resultados mostram que a rede de colaboraÃÃes cientÃficas tem crescido exponencialmente nas Ãltimas trÃs dÃcadas, com a distribuiÃÃo do nÃmero de colaboradores por pesquisador se aproximando de uma lei de potÃncia à medida que a rede evolui. AlÃm disso, ambas a distribuiÃÃo do nÃmero de colaboradores e a produÃÃo por pesquisador seguem o comportamento de leis de potÃncia, independentemente da regiÃo ou Ãreas, sugerindo que um mesmo mecanismo universal pode ser responsÃvel pelo crescimento da rede e pela produtividade dos pesquisadores. TambÃm mostramos que as redes de colaboraÃÃo investigadas apresentam um tÃpico comportamento assortativo, no qual pesquisadores de alto nÃvel (com muitos colaboradores) tendem a colaborador com outros semelhantes. Em seguida, mostramos que homens preferem colaborar com outros homens enquanto mulheres sÃo mais igualitÃrias ao estabelecer suas colaboraÃÃes. Isso à consistentemente observado em todas as Ãreas e à essencialmente independente do nÃmero de colaboraÃÃes do pesquisador. A Ãnica exceÃÃo sendo a Ãrea de Engenharia, na qual este viÃs à claramente menos pronunciado para pesquisadores com muitas colaboraÃÃes. TambÃm mostramos que o nÃmero de colaboraÃÃes segue o comportamento de leis de potÃncia, com um cutoff dependente do gÃnero. Isso se reflete no fato de que em mÃdia mulheres produzem menos artigos e tÃm menos colaboraÃÃes que homens. TambÃm mostramos que ambos os gÃneros exibem a mesma tendÃncia quanto a colaboraÃÃes interdisciplinares, exceto em CiÃncias Exatas e da Terra, nas quais mulheres tendo mais colaboradores sÃo mais propensas a pesquisas interdisciplinares.
Understanding the dynamics of research production and collaboration may reveal better strategies for scientific careers, academic institutions and funding agencies. Here we propose the use of a large and multidisciplinary database of scientific curricula in Brazil, namely, the Lattes Platform, to study patterns of scientific production and collaboration. Detailed information about publications and researchers is available in this database. Individual curricula are submitted by the researchers themselves so that co-authorship is unambiguous. Researchers can be evaluated by scientific productivity, geographical location and field of expertise. Our results show that the collaboration network is growing exponentially for the last three decades, with a distribution of number of collaborators per researcher that approaches a power-law as the network gets older. Moreover, both the distributions of number of collaborators and production per researcher obey power-law behaviors, regardless of the geographical location or field, suggesting that the same universal mechanism might be responsible for network growth and productivity. We also show that the collaboration network under investigation displays a typical assortative mixing behavior, where teeming researchers (i.e., with high degree) tend to collaborate with others alike. Moreover, we discover that on average men prefer collaborating with other men than with women, while women are more egalitarian. This is consistently observed over all fields and essentially independent on the number of collaborators of the researcher. The solely exception is for engineering, where clearly this gender bias is less pronounced, when the number of collaborators increases. We also find that the distribution of number of collaborators follows a power-law, with a cut-off that is gender dependent. This reflects the fact that on average men produce more papers andhave more collaborators than women. We also find that both genders display the same tendency towards interdisciplinary collaborations, except for Exact and Earth Sciences, where women having many collaborators are more open to interdisciplinary research.
Li, Xiaohu. "Security Analysis on Network Systems Based on Some Stochastic Models." ScholarWorks@UNO, 2014. http://scholarworks.uno.edu/td/1931.
Full textHerman, Joseph L. "Multiple sequence analysis in the presence of alignment uncertainty." Thesis, University of Oxford, 2014. http://ora.ox.ac.uk/objects/uuid:88a56d9f-a96e-48e3-b8dc-a73f3efc8472.
Full textFockstedt, Jonas, and Ema Krcic. "Unsupervised anomaly detection for structured data - Finding similarities between retail products." Thesis, Högskolan i Halmstad, Akademin för informationsteknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-44756.
Full textKoskinen, Johan. "Essays on Bayesian Inference for Social Networks." Doctoral thesis, Stockholm : Department of Statistics [Statistiska institutionen], Univ, 2004. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-128.
Full textMartinet, Lucie. "Réseaux dynamiques de terrain : caractérisation et propriétés de diffusion en milieu hospitalier." Thesis, Lyon, École normale supérieure, 2015. http://www.theses.fr/2015ENSL1010/document.
Full textIn this thesis, we focus on tools whose aim is to extract structural and temporal properties of dynamic networks as well as diffusion characteristics which can occur on these networks. We work on specific data, from the European MOSAR project, including the network of individuals proximity from time to time during 6 months at the Brek-sur-Mer Hospital. The studied network is notable because of its three dimensions constitution : the structural one induced by the distribution of individuals into distinct services, the functional dimension due to the partition of individual into groups of socio-professional categories and the temporal dimension.For each dimension, we used tools well known from the areas of statistical physics as well as graphs theory in order to extract information which enable to describe the network properties. These methods underline the specific structure of the contacts distribution which follows the individuals distribution into services. We also highlight strong links within specific socio-professional categories. Regarding the temporal part, we extract circadian and weekly patterns and quantify the similarities of these activities. We also notice distinct behaviour within patients and staff evolution. In addition, we present tools to compare the network activity within two given periods. To finish, we use simulations techniques to extract diffusion properties of the network to find some clues in order to establish a prevention policy
Lumbreras, Alberto. "Automatic role detection in online forums." Thesis, Lyon, 2016. http://www.theses.fr/2016LYSE2111/document.
Full textThis thesis addresses the problem of detecting user roles in online discussion forums. A role may be defined as the set of behaviors characteristic of a person or a position. In discussion forums, behaviors are primarily observed through conversations. Hence, we focus our attention on how users discuss. We propose three methods to detect groups of users with similar conversational behaviors.Our first method for the detection of roles is based on conversational structures. Weapply different notions of neighborhood for posts in tree graphs (radius-based, order-based, and time-based) and compare the conversational patterns that they detect as well as the clusters of users with similar conversational patterns.Our second method is based on stochastic models of growth for conversation threads.Building upon these models we propose a method to find groups of users that tend to reply to the same type of posts. We show that, while there are clusters of users with similar replying patterns, there is no strong evidence that these behaviors are predictive of future behaviors |except for some groups of users with extreme behaviors.In out last method, we integrate the type of data used in the two previous methods(feature-based and behavioral or functional-based) and show that we can find clusters using fewer examples. The model exploits the idea that users with similar features have similar behaviors
Zaylaa, Amira. "Analyse et extraction de paramètres de complexité de signaux biomédicaux." Thesis, Tours, 2014. http://www.theses.fr/2014TOUR3315/document.
Full textThe analysis of biomedical time series derived from nonlinear dynamic systems is challenging due to the chaotic nature of these time series. Only few classical parameters can be detected by clinicians to opt the state of patients and fetuses. Though there exist valuable complexity invariants such as multi-fractal parameters, entropies and recurrence plot, they were unsatisfactory in certain cases. To overcome this limitation, we propose in this dissertation new entropy invariants, we contributed to multi-fractal analysis and we developed signal-based (unbiased) recurrence plots based on the dynamic transitions of time series. Principally, we aim to improve the discrimination between healthy and distressed biomedical systems, particularly fetuses by processing the time series using our techniques. These techniques were either validated on Lorenz system, logistic maps or fractional Brownian motions modeling chaotic and random time series. Then the techniques were applied to real fetus heart rate signals recorded in the third trimester of pregnancy. Statistical measures comprising the relative errors, standard deviation, sensitivity, specificity, precision or accuracy were employed to evaluate the performance of detection. Elevated discernment outcomes were realized by the high-order entropy invariants. Multi-fractal analysis using a structure function enhances the detection of medical fetal states. Unbiased cross-determinism invariant amended the discrimination process. The significance of our techniques lies behind their post-processing codes which could build up cutting-edge portable machines offering advanced discrimination and detection of Intrauterine Growth Restriction prior to fetal death. This work was devoted to Fetal Heart Rates but time series generated by alternative nonlinear dynamic systems should be further considered
Bělohlávek, Jiří. "Agent pro kurzové sázení." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2008. http://www.nusl.cz/ntk/nusl-235980.
Full textSORIANI, NICOLA. "Topics in Statistical Models for Network Analysis." Doctoral thesis, 2012. http://hdl.handle.net/11577/2697077.
Full textJalali, Ali 1982. "Dirty statistical models." Thesis, 2012. http://hdl.handle.net/2152/ETD-UT-2012-05-5088.
Full texttext
"Generalized Statistical Tolerance Analysis and Three Dimensional Model for Manufacturing Tolerance Transfer in Manufacturing Process Planning." Doctoral diss., 2011. http://hdl.handle.net/2286/R.I.9125.
Full textDissertation/Thesis
Ph.D. Mechanical Engineering 2011
Krištof, Radim. "Žákovská interpretace grafických výstupů statistických šetření." Master's thesis, 2016. http://www.nusl.cz/ntk/nusl-354104.
Full textChe, Xuan. "Spatial graphical models with discrete and continuous components." Thesis, 2012. http://hdl.handle.net/1957/33644.
Full textGraduation date: 2013
"An application of cox hazard model and CART model in analyzing the mortality data of elderly in Hong Kong." 2002. http://library.cuhk.edu.hk/record=b5891190.
Full textThesis (M.Phil.)--Chinese University of Hong Kong, 2002.
Includes bibliographical references (leaves 85-87).
Abstracts in English and Chinese.
Chapter 1 --- Introduction --- p.1
Chapter 1.1 --- Overview --- p.1
Chapter 1.1.1 --- Survival Analysis --- p.2
Chapter 1.1.2 --- Tree、-structured Statistical Method --- p.2
Chapter 1.1.3 --- Mortality Study --- p.3
Chapter 1.2 --- Motivation --- p.3
Chapter 1.3 --- Background Information --- p.4
Chapter 1.4 --- Data Content --- p.7
Chapter 1.5 --- Thesis Outline --- p.8
Chapter 2 --- Imputation and File Splitting --- p.10
Chapter 2.1 --- Imputation of Missing Values --- p.10
Chapter 2.1.1 --- Purpose of Imputation --- p.10
Chapter 2.1.2 --- Procedure of Hot Deck Imputation --- p.11
Chapter 2.1.3 --- List of Variables for Imputation --- p.12
Chapter 2.2 --- File Splitting --- p.14
Chapter 2.2.1 --- Splitting by Gender --- p.14
Chapter 2.3 --- Splitting for Validation Check --- p.1G
Chapter 3 --- Cox Hazard Model --- p.17
Chapter 3.1 --- Basic Idea --- p.17
Chapter 3.1.1 --- Survival Analysis --- p.17
Chapter 3.1.2 --- Survivor Function --- p.18
Chapter 3.1.3 --- Hazard Function --- p.18
Chapter 3.2 --- The Cox Proportional Hazards Model --- p.19
Chapter 3.2.1 --- Kaplan-Meier Estimate and Log-Rank Test --- p.20
Chapter 3.2.2 --- Hazard Ratio --- p.23
Chapter 3.2.3 --- Partial Likelihood --- p.24
Chapter 3.3 --- Extension of the Cox Proportional Hazards Model for Time-dependent Variables --- p.25
Chapter 3.3.1 --- Modification of the Cox's Model --- p.25
Chapter 3.4 --- Results of Model Fitting --- p.26
Chapter 3.4.1 --- Extract the Significant Covariates from the Models --- p.31
Chapter 3.5 --- Model Interpretation --- p.32
Chapter 4 --- CART --- p.37
Chapter 4.1 --- CART Procedure --- p.38
Chapter 4.2 --- Selection of the Splits --- p.39
Chapter 4.2.1 --- Goodness of Split --- p.39
Chapter 4.2.2 --- Type of Variables --- p.40
Chapter 4.2.3 --- Estimation --- p.40
Chapter 4.3 --- Pruning the Tree --- p.41
Chapter 4.3.1 --- Misclassification Cost --- p.42
Chapter 4.3.2 --- Class Assignment Rule --- p.44
Chapter 4.3.3 --- Minimal Cost Complexity Pruning --- p.44
Chapter 4.4 --- Cross Validation --- p.47
Chapter 4.4.1 --- V-fold Cross-validation --- p.47
Chapter 4.4.2 --- Selecting the right sized tree --- p.49
Chapter 4.5 --- Missing Value --- p.49
Chapter 4.6 --- Results of CART program --- p.51
Chapter 4.7 --- Model Interpretation --- p.53
Chapter 5 --- Model Prediction --- p.58
Chapter 5.1 --- Application to Test Sample --- p.58
Chapter 5.1.1 --- Fitting test sample to Cox's Model --- p.59
Chapter 5.1.2 --- Fitting test sample to CART model --- p.61
Chapter 5.2 --- Comparison of Model Prediction --- p.62
Chapter 5.2.1 --- Misclassification Rate --- p.62
Chapter 5.2.2 --- Misclassification Rate of Cox's model --- p.63
Chapter 5.2.3 --- Misclassification Rate of CART model --- p.64
Chapter 5.2.4 --- Prediction Result --- p.64
Chapter 6 --- Conclusion --- p.67
Chapter 6.1 --- Comparison of Results --- p.67
Chapter 6.2 --- Comparison of the Two Statistical Techniques --- p.68
Chapter 6.3 --- Limitation --- p.70
Appendix A: Coding Description for the Health Factors --- p.72
Appendix B: Log-rank Test --- p.75
Appendix C: Longitudinal Plot of Time Dependent Variables --- p.76
Appendix D: Hypothesis Testing of Suspected Covariates --- p.78
Appendix E: Terminal node report for both gender --- p.81
Appendix F: Calculation of Critical Values --- p.83
Appendix G: Distribution of Missing Value in Learning sample and Test Sample --- p.84
Bibliography --- p.85
KANG, TSAI FU, and 蔡阜鋼. "The study on item analysis – a research on elementary school students’ conceptualization of statistical gragh." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/66413671381554190760.
Full text臺中師範學院
數學教育學系在職進修教學碩士學位班
93
This study is to devise a test item on statistical gragh by researcher, and to analyze the test outcome through the relational structure figure drawn by Item Relational Structure Analysis (IRS), with the expectation to explore the cognition constructing process of elementary school students when they are forming their concepts of statistical gragh. The researcher, taking references to the present math curriculum of elementary school and the domestic and foreign studies, devises a test item based on seven sub-concepts of statistical gragh: (1) basic statistical concepts of data, (2) concepts of symbolic apposition, (3) concepts of graphing a data set, (4) concepts of statistical gragh presentation, (5) concepts of comparison among subclasses on statistical gragh, (6) statistical concepts of statistical gragh of single data set, and (7) statistical concepts of statistical gragh of multiple data set. The study was done on a class of sixth graders of an elementary school in Tai-Chung County. After the students took the test, the outcome was analyzed with IRSP, which is designed based on IRS with the expectation of getting information through the item relational structure analysis figure. According to the structure figure, several findings are concluded: With the paper-and-pencil test and analysis on relational structure figure, the researcher has found the cognition of the tested elementary school students on statistical gragh developing in the following order: (1) basic statistical concepts of data, (2) concepts of symbolic apposition, (3) concepts of graphing a data set, (4) concepts of statistical gragh presentation, (5) concepts of comparison among subclasses on statistical gragh, (6) statistical concepts of statistical gragh of single data set, and (7) statistical concepts of statistical gragh of multiple data set. Through this test, it is also found in this study that the outcome of Bar Chart is better than Line Chart and the outcome of Line Chart is better than Pie Chart. However, sub-concepts of Pie Chart do not have significant hierarchically relation with the sub-concepts of Bar Chart and Line Chart though they do show influence on the students’ cognition development on conceptualizing statistical gragh. With the findings and the conclusion, the researcher has made some suggestions that teachers and future studies can draw references to.
Gupta, Shubham. "Statistical Network Analysis: Community Structure, Fairness Constraints, and Emergent Behavior." Thesis, 2021. https://etd.iisc.ac.in/handle/2005/5513.
Full textHumphries, Peter J. "Combinatorial aspects of leaf-labelled trees : a thesis submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Mathematics, University of Canterbury Department of Mathematics and Statistics /." 2008. http://hdl.handle.net/10092/1801.
Full textWu, Su-Heng, and 吳素亨. "A Content Analysis of the Statistical Graphs Materials in the Elementary Mathematic Textbooks of Taiwan and Finland." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/ps96p5.
Full text國立臺北教育大學
數學暨資訊教育學系(含數學教育碩士班)
104
The study aims to explore the differences between the mathematics textbook of Hanlin version in Taiwan, and WSOY version in Finland in terms of the time and order of statistical charts, and how the teaching materials are presented. Through content analysis method and concept mapping, the study discovers that Hanlin versions in Taiwan and WSOY version in Finland all contain higher proportion of “reading, comparing, and interpreting statistical charts” and “reading application forms,” showing that both countries emphasize the reading, comparing, and understanding of statistical charts and forms. The teaching materials of both countries include materials for first graders to classify and arrange data and turn records into numbers. As for the design of questions, the questions in the teaching materials of both countries are mostly comparison and calculation. Yet for the arrangement of topics on statistical forms, the teaching materials of the 2D forms in Hanlin version is to seek answers through cross corresponding whereas those in WSOY version contains 2D forms in both cross corresponding and logical judgment. In statistical charts, Hanlin version contains bar charts, line charts, and pie charts (Exception: the workbook of the second semester of the fourth graders in the 2003 Hanlin version contains population pyramids) while the statistical charts in WSOY version are diverse, including not only bar charts, line charts, pie charts as well as graphic statistical charts, climate diagrams, population pyramids, and histograms. However, in the drawing of statistical charts, the teaching materials in Finland do not contain the drawing of pie charts, but those include the drawing of range bars. The study suggests the statistical charts textbooks in early grades shall highlight again the importance of establishing classification standards for the development of classification concepts rather than only emphasizing records. In this way, the first step for organizing data can be completed.
Rea, William S. "The application of atheoretical regression trees to problems in time series analysis : a thesis submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Mathematics, Department of Mathematics and Statistics, University of Canterbury /." 2008. http://hdl.handle.net/10092/1715.
Full textKulmatitskiy, Nikolay. "Modeling Dynamic Network with Centrality-based Logistic Regression." Thesis, 2011. http://hdl.handle.net/10012/6290.
Full text