Academic literature on the topic 'Haplotype assembly'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Haplotype assembly.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Haplotype assembly"

1

Xu, Mengyang, Lidong Guo, Xiao Du, Lei Li, Brock A. Peters, Li Deng, Ou Wang, et al. "Accurate haplotype-resolved assembly reveals the origin of structural variants for human trios." Bioinformatics 37, no. 15 (February 4, 2021): 2095–102. http://dx.doi.org/10.1093/bioinformatics/btab068.

Full text
Abstract:
Abstract Motivation Achieving a near complete understanding of how the genome of an individual affects the phenotypes of that individual requires deciphering the order of variations along homologous chromosomes in species with diploid genomes. However, true diploid assembly of long-range haplotypes remains challenging. Results To address this, we have developed Haplotype-resolved Assembly for Synthetic long reads using a Trio-binning strategy, or HAST, which uses parental information to classify reads into maternal or paternal. Once sorted, these reads are used to independently de novo assemble the parent-specific haplotypes. We applied HAST to cobarcoded second-generation sequencing data from an Asian individual, resulting in a haplotype assembly covering 94.7% of the reference genome with a scaffold N50 longer than 11 Mb. The high haplotyping precision (∼99.7%) and recall (∼95.9%) represents a substantial improvement over the commonly used tool for assembling cobarcoded reads (Supernova), and is comparable to a trio-binning-based third generation long-read-based assembly method (TrioCanu) but with a significantly higher single-base accuracy [up to 99.99997% (Q65)]. This makes HAST a superior tool for accurate haplotyping and future haplotype-based studies. Availability and implementation The code of the analysis is available at https://github.com/BGI-Qingdao/HAST Supplementary information Supplementary data are available at Bioinformatics online.
APA, Harvard, Vancouver, ISO, and other styles
2

Siragusa, Enrico, Niina Haiminen, Richard Finkers, Richard Visser, and Laxmi Parida. "Haplotype assembly of autotetraploid potato using integer linear programing." Bioinformatics 35, no. 18 (January 25, 2019): 3279–86. http://dx.doi.org/10.1093/bioinformatics/btz060.

Full text
Abstract:
Abstract Summary Haplotype assembly of polyploids is an open issue in plant genomics. Recent experimental studies on highly heterozygous autotetraploid potato have shown that available methods do not deliver satisfying results in practice. We propose an optimal method to assemble haplotypes of highly heterozygous polyploids from Illumina short-sequencing reads. Our method is based on a generalization of the existing minimum fragment removal model to the polyploid case and on new integer linear programs to reconstruct optimal haplotypes. We validate our methods experimentally by means of a combined evaluation on simulated and experimental data based on 83 previously sequenced autotetraploid potato cultivars. Results on simulated data show that our methods produce highly accurate haplotype assemblies, while results on experimental data confirm a sensible improvement over the state of the art. Availability and implementation Executables for Linux at http://github.com/Computational Genomics/HaplotypeAssembler. Supplementary information Supplementary data are available at Bioinformatics online.
APA, Harvard, Vancouver, ISO, and other styles
3

Bahcall, Orli. "Single-haplotype genome assembly." Nature Genetics 46, no. 12 (November 24, 2014): 1257. http://dx.doi.org/10.1038/ng.3157.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Rodriguez, Oscar L., Anna Ritz, Andrew J. Sharp, and Ali Bashir. "MsPAC: a tool for haplotype-phased structural variant detection." Bioinformatics 36, no. 3 (August 9, 2019): 922–24. http://dx.doi.org/10.1093/bioinformatics/btz618.

Full text
Abstract:
Abstract Summary While next-generation sequencing (NGS) has dramatically increased the availability of genomic data, phased genome assembly and structural variant (SV) analyses are limited by NGS read lengths. Long-read sequencing from Pacific Biosciences and NGS barcoding from 10x Genomics hold the potential for far more comprehensive views of individual genomes. Here, we present MsPAC, a tool that combines both technologies to partition reads, assemble haplotypes (via existing software) and convert assemblies into high-quality, phased SV predictions. MsPAC represents a framework for haplotype-resolved SV calls that moves one step closer to fully resolved, diploid genomes. Availability and implementation https://github.com/oscarlr/MsPAC. Supplementary information Supplementary data are available at Bioinformatics online.
APA, Harvard, Vancouver, ISO, and other styles
5

Sun, Hequan, Wen-Biao Jiao, Kristin Krause, José A. Campoy, Manish Goel, Kat Folz-Donahue, Christian Kukat, Bruno Huettel, and Korbinian Schneeberger. "Chromosome-scale and haplotype-resolved genome assembly of a tetraploid potato cultivar." Nature Genetics 54, no. 3 (March 2022): 342–48. http://dx.doi.org/10.1038/s41588-022-01015-0.

Full text
Abstract:
AbstractPotato is the most widely produced tuber crop worldwide. However, reconstructing the four haplotypes of its autotetraploid genome remained an unsolved challenge. Here, we report the 3.1 Gb haplotype-resolved (at 99.6% precision), chromosome-scale assembly of the potato cultivar ‘Otava’ based on high-quality long reads, single-cell sequencing of 717 pollen genomes and Hi-C data. Unexpectedly, ~50% of the genome was identical-by-descent due to recent inbreeding, which was contrasted by highly abundant structural rearrangements involving ~20% of the genome. Among 38,214 genes, only 54% were present in all four haplotypes with an average of 3.2 copies per gene. Taking the leaf transcriptome as an example, 11% of the genes were differently expressed in at least one haplotype, where 25% of them were likely regulated through allele-specific DNA methylation. Our work sheds light on the recent breeding history of potato, the functional organization of its tetraploid genome and has the potential to strengthen the future of genomics-assisted breeding.
APA, Harvard, Vancouver, ISO, and other styles
6

Si, Hongbo, Haris Vikalo, and Sriram Vishwanath. "Information-Theoretic Analysis of Haplotype Assembly." IEEE Transactions on Information Theory 63, no. 6 (June 2017): 3468–79. http://dx.doi.org/10.1109/tit.2017.2686884.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Mousavi, Sayyed R. "Improved haplotype assembly using Xor genotypes." Journal of Theoretical Biology 298 (April 2012): 122–30. http://dx.doi.org/10.1016/j.jtbi.2012.01.003.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Chu, Wai Keung, Peter Edge, Ho Suk Lee, Vikas Bansal, Vineet Bafna, Xiaohua Huang, and Kun Zhang. "Ultraaccurate genome sequencing and haplotyping of single human cells." Proceedings of the National Academy of Sciences 114, no. 47 (October 24, 2017): 12512–17. http://dx.doi.org/10.1073/pnas.1707609114.

Full text
Abstract:
Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10−8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs.
APA, Harvard, Vancouver, ISO, and other styles
9

Newman, Chris, Ming-shan Tsai, Christina D. Buesching, Peter W. H. Holland, and David W. Macdonald. "The genome sequence of the European badger, Meles meles (Linnaeus, 1758)." Wellcome Open Research 7 (September 23, 2022): 239. http://dx.doi.org/10.12688/wellcomeopenres.18230.1.

Full text
Abstract:
We present a haplotype resolved, diploid genome assembly from a male Meles meles (European badger; Chordata; Mammalia; Carnivora; Mustelidae) using the trio binning approach. The genome sequence is 2,739 megabases in span. The majority of the assembly (95.16%) is scaffolded into 23 chromosomal pseudomolecules with the X and Y sex chromosomes assembled. The complete mitochondrial genome was also assembled and is 16.4 kilobases in length.
APA, Harvard, Vancouver, ISO, and other styles
10

Mohades, M. M., M. H. Kahaei, and H. Mohades. "Haplotype assembly using Riemannian trust-region method." Digital Signal Processing 112 (May 2021): 102999. http://dx.doi.org/10.1016/j.dsp.2021.102999.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "Haplotype assembly"

1

Moeinzadeh, Mohammadhossein [Verfasser]. "De novo and haplotype assembly of polyploid genomes / Mohammadhossein Moeinzadeh." Berlin : Freie Universität Berlin, 2019. http://d-nb.info/1189660237/34.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

ZACCARIA, SIMONE. "Inferring Genomic Variants and their Evolution: Combinatorial Optimization for Haplotype Assembly and Quantification of Intra-Tumor Heterogeneity." Doctoral thesis, Università degli Studi di Milano-Bicocca, 2017. http://hdl.handle.net/10281/151631.

Full text
Abstract:
Il genoma di ogni individuo nella popolazione umana è caratterizzato univocamente da un insieme di varianti genomiche che distinguono il suo DNA da ogni altro. Perciò, studiare la relazione tra queste varianti e le differenze osservabili negli individui è fondamentale per molte applicazioni mediche. Per identificare queste varianti, le tecnologie di sequenziamento producono un’enorme quantità di frammenti di DNA. Tuttavia, questi frammenti danno solo una visione approssimativa del DNA di un individuo, perché sono molto più corti, contengono diversi errori e sono ottenuti considerando milioni di cellule contemporaneamente. Di conseguenza, l’inferenza di varianti genomiche da questi frammenti è uno dei problemi principali in biologia computazionale, una branca dell’informatica che progetta algoritmi per affrontare problemi derivanti da dati biologici. Il mio lavoro di ricerca in questa tesi si colloca in questo contesto e si focalizza su due problemi che riguardano, difatti, l’inferenza di varianti genomiche da dati di sequenziamento. Per questi problemi ho adottato un approccio basato su ottimizzazione combinatoria per il quale ho introdotto nuove formulazioni combinatorie e relativi algoritmi, sfruttato caratteristiche specifiche del contesto. Il primo problema riguarda la ricostruzione dei due aplotipi di un individuo, ognuno dei quali corrisponde all’insieme di sequenze di DNA ereditate da ognuno dei due genitori. La loro ricostruzione è fondamentale per caratterizzare il genoma umano, visto che i due aplotipi sono distinti da diverse varianti genomiche. La forma piu’ frequente di varianti genomiche negli aplotipi sono i Single-Nucleotice Polymorphisms (SNPs). L’assemblaggio di aplotipi è l’approccio che cerca di ricostruire gli aplotipi e i loro SNPs a partire da frammenti di sequenziamento. Anzitutto, in questa tesi mi sono occupato di studiare la complessità parametrica e l’approssimazione di tradizionali formulazioni combinatorie di questo problema, tra cui la correzione del minimo numero di errori (MEC). Infatti, metodi basati su questi approcci hanno mostrato ottimi risultati preliminari su dati reali. Inoltre, sfruttando le caratteristiche dei frammenti prodotti dalle nuove tecnologie, quali la lunghezza aumentata e la distribuzione uniforme degli errori, ho introdotto una nuova formulazione combinatoria del problema. Per questa, ho progettato un nuovo algoritmo, chiamato HapCol, che si è rivelato in grado di migliorare i risultati ottenuti da metodi tradizionali sia su dati simulati che reali. Il secondo problema riguarda l’identificazione e quantificazione dei cloni di un singolo tumore. Infatti, un tumore deriva da un processo evolutivo in cui nuove mutazioni vengono accumulate durante la vita di un individuo in diverse cellule, risultando in distinte sottopopolazioni di cellule, chiamate cloni, ognuna delle quali è caratterizzata da una combinazione unica di varianti genomiche. L’identificazione di questi cloni, delle loro proporzioni e della loro storia evolutiva si è rivelato di fondamentale importanza sia per la diagnosi che la prognosi di tumori. In particolare, mi sono focalizzato su mutazioni frequenti nei tumori, chiamate Copy-Number Aberrations (CNAs) che amplificano o eliminano copie di segmenti genomici. Perciò, ho introdotto nuove formulazioni combinatorie che mirano ad identificare i diversi cloni distinti da queste mutazioni e la loro evoluzione a partire da frammenti di sequenziamento ottenuti da più campioni dello stesso tumore e contenenti cellule di cloni diversi. Ho studiato la loro complessita’ computazionale e ho disegnato algoritmi, basati su programmazione lineare intera e su strategie di “discesa delle coordinate”, che si sono rivelati in grado di gestire istanze di dimensioni reali su dati simulati, mentre, su dati derivanti da un tumore alla prostata, hanno offerto una visione ad alta risoluzione dei suoi cloni in termini di CNAs.
The genome of any individual in the human population is characterized by a unique complement of genomic variants distinguishing its DNA from any other. As such, studying the relationship between genomic variants and observable traits in human individuals is important for many applications in medicine. To identify these variants, current sequencing technologies produce a huge amount of DNA fragments, called reads. Unfortunately, these reads do not offer a complete and exact view of the DNA sequence since they are orders of magnitude shorter than the source, contain errors, and are obtained by considering millions of different cells together. As such, the inference of genomic variants from sequencing reads is one of the main problems in computational biology, a branch of computer science that designs algorithms for answering biological questions. The work of my thesis belongs to this context and is hence focused on the inference of genomic variants from sequencing reads. I adopt an approach based on combinatorial optimization to introduce new problem formulations and related algorithms by exploiting characteristics specific to the context. The first problem concerns the assembly of the two haplotypes contained in each cell of a human individual. A haplotype corresponds to the set of DNA sequences inherited from each parent. Since the two haplotypes comprise different genomic variants, their reconstruction is crucial for characterizing the genome of an individual: Single-Nucleotide Polymorphisms (SNPs) are the most common form of genomic variants between the two haplotypes. Haplotype Assembly is the approach that aims to reconstruct the two haplotypes and their SNPs from sequencing reads. First, in this thesis I study the parameterized tractability and approximability of traditional combinatorial problems for this approach, especially the Minimum Error Correction (MEC). In fact, methods based of these frameworks revealed preliminary good results on real data. Next, I introduce a new combinatorial formulation of MEC by exploiting the characteristics of reads produced by “future-sequencing” technologies, such as the longer reads and the uniform distribution of sequencing errors. For this problem, I design a dynamic-programming algorithm, HapCol, that outperforms current state-of-the-art methods on both simulated and real data. The second problem concerns the quantification of intra-tumor heterogeneity. Cancer results from an evolutionary process where somatic mutations accumulate in different cells during the lifetime of an individual. As such, a tumor comprises distinct subpopulations of cells, or clones, sharing a unique complement of genomic variants. The identification of these clones, their proportions, and their evolutionary history is crucial to both diagnosis and prognosis of cancer. Here, I focus on Copy-Number Aberrations (CNAs) that are mutations amplifying or deleting the copies of genomic segments. In this thesis, I introduce new combinatorial formulations of problems that aim to quantify the distinct clones in terms of CNAs and to infer their evolution from sequencing reads of multiple heterogeneous samples comprising different cells of different clones. I study the computational complexity of these problems and I design algorithms, based on integer-linear programming and coordinate-descent approach. On simulated data, I show that these algorithms scale on simulated instances of practical size, whereas on a prostate-cancer dataset they offer a high-resolution view of the tumor clones in term of CNAs, higher than previous analyses.
APA, Harvard, Vancouver, ISO, and other styles
3

Price, Jared Calvin. "The Bioluminescence Heterozygous Genome Assembler." BYU ScholarsArchive, 2014. https://scholarsarchive.byu.edu/etd/4346.

Full text
Abstract:
High-throughput DNA sequencing technologies are currently revolutionizing the fields of biology and medicine by elucidating the structure and function of the components of life. Modern DNA sequencing machines typically produce relatively short reads of DNA which are then assembled by software in an attempt to produce a representation of the entire genome. Due to the complex structure of all but the smallest genomes, especially the abundant presence of exact or almost exact repeats, all genome assemblers introduce errors into the final sequence and output a relatively large set of contigs instead of full-length chromosomes (a contig is a DNA sequence built from the overlaps between many reads). These problems are dramatically worse when homologous copies of the same chromosome differ substantially. Currently such genomes are usually avoided as assembly targets and, when they are not avoided, they generally produce assemblies of relatively low quality. An improved algorithm for the assembly of such data would dramatically improve our understanding of the genetics of a large class of organisms. We present a unique algorithm for the assembly of diploid genomes which have a high degree of variation between homologous chromosomes. The approach uses coverage, graph patterns and machine-learning classification to identify haplotype-specific sequences in the input reads. It then uses these haplotype-specific markers to guide an improved assembly. We validate the approach with a large experiment that isolates and elucidates the effect of single nucleotide polymorphisms (SNPs) on genome assembly more clearly than any previous study. The experiment conclusively demonstrates that the Bioluminescence heterozygous genome assembler produces dramatically longer contigs with fewer haplotype-switch errors than competing algorithms under conditions of high heterozygosity.
APA, Harvard, Vancouver, ISO, and other styles
4

TANGHERLONI, ANDREA. "High-Performance Computing to tackle complex problems in life sciences." Doctoral thesis, Università degli Studi di Milano-Bicocca, 2019. http://hdl.handle.net/10281/241217.

Full text
Abstract:
Nuovi ed efficienti metodi computazionali sono attualmente necessari per elaborare la ingente mole di dati generata dalle più recenti tecnologie sviluppate in svariati settori delle scienze della vita, tra cui la biologia computazionale e l’imaging medicale. In altre discipline, come la biologia dei sistemi in cui si modellano matematicamente le reti biochimiche, è necessario affrontare problemi relativi alla mancanza di dati quantitativi, e allo stesso tempo simulare efficacemente le dinamiche emergenti di queste reti. In questi contesti applicativi, le infrastrutture di calcolo ad elevate prestazioni si stanno rivelando uno strumento fondamentale per affrontare e risolvere i problemi che insorgono, in quanto permettono sia di elaborare in tempo reale ingenti quantità di dati sia di eseguire simulazioni in modo efficace ed efficiente. Durante gli ultimi anni si sta sempre di più radicando l’uso di dispositivi general-purpose caratterizzati da decine, centinaia o migliaia di core di calcolo, come ad esempio i coprocessori Many Integrated Cores e le Graphics Processing Units (GPU). L’uso delle GPU è motivato sia dalla efficienza computazionale che possono raggiungere (nell’ordine dei teraflop) grazie alle migliaia di core a disposizione sia dall’efficienza energetica che le contraddistingue. Oltre al calcolo ad elevate prestazioni, in questa tesi si sono sfruttate tecniche di intelligenza computazionale per affrontare problemi di ottimizzazione, come ad esempio la stima di parametri nella biologia dei sistemi, l’inferenza degli aplotipi nella bioinformatica, l’enhancement e la segmentazione di immagini medicali caratterizzate da istogrammi bimodali dei livelli di grigio che costituiscono le immagini stesse. La stima di parametri è stata affrontata sfruttando approcci di computazione evolutiva e di swarm intelligence insieme a nuovi simulatori accelerati su GPU - sviluppati appositamente per eseguire in parallelo sia molte simulazioni corrispondenti a diverse parametrizzazione dei modelli matematici che una singola simulazione di reti biochimiche a larga scala - permettendo di ridurre drasticamente il tempo di calcolo richiesto per calcolare le funzioni di fitness di questi approcci. Grazie alla loro efficacia nel risolvere i problemi combinatori, gli Algoritmi Genetici sono stati utilizzati per risolvere i problemi relativi alla ricostruzione degli aplotipi e l’enhancement delle immagini medicali. I due metodi proposti sono stati sviluppati sfruttando il paradigma Master-Slave che permette di distribuire il gravoso carico computazionale richiesto per risolvere questi problemi, riducendo notevolmente i tempi di calcolo. I risultati ottenuti in questa tesi mostrano come l’utilizzo del calcolo ad elevate prestazioni, unito alle tecniche di intelligenza computazionale, rappresenti una strategia efficace per la risoluzione di questi problemi, permettendo di effettuare analisi computazionali complesse richieste nelle scienze della vita.
Recent advances in several research fields of Life Sciences, such as Bioinformatics, Computational Biology and Medical Imaging, are generating huge amounts of data that require effective computational tools to be analyzed, while other disciplines, like Systems Biology, typically deal with mathematical models of biochemical networks, where issues related to the lack of quantitative parameters and the efficient description of the emergent dynamics must be faced. In these contexts, High-Performance Computing (HPC) infrastructures represent a fundamental means to tackle these problems, allowing for both real-time processing of data and fast simulations. In the latest years, the use of general-purpose many-core devices, such as Many Integrated Core coprocessors and Graphics Processing Units (GPUs), gained ground. The second ones, which are pervasive, relatively cheap and extremely efficient parallel many-core coprocessors capable of achieving tera-scale performance on common workstations, have been extensively exploited in the work presented in this thesis. Moreover, some of the problems described here require the application of Computational Intelligence (CI) methods. As a matter fact, the Parameter Estimation problem in Systems Biology, the Haplotype Assembly problem in Genome Analysis as well as the enhancement and segmentation of medical images characterized by a bimodal gray level intensity histogram can be viewed as optimization problems, which can be effectively addressed by relying on CI approaches. In the case of the Parameter Estimation problem, Evolutionary and Swarm Intelligence techniques were exploited and coupled with novel GPU-powered simulators-designed and developed in this thesis to execute both coarse-grained and fine-grained simulations-which were used to perform in a parallel fashion the biochemical simulations underlying the fitness functions required by these population-based approaches. The Haplotype Assembly and the enhancement of medical images problems were both addressed by means of Genetic Algorithms (GAs), which were shown to be very effective in solving combinatorial problems. Since the proposed approaches based on GAs are computationally demanding, a Master-Slave paradigm was exploited to distribute the workload, reducing the required running time. The overall results show that coupling HPC and CI techniques is advantageous to address these problems and speed up the computational analyses in these research fields.
APA, Harvard, Vancouver, ISO, and other styles
5

Yang, Ei Wen, and 楊翊文. "The Extension of Algorithms for Single Nucleotide Polymorphism Haplotype Assembly Problems." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/87857613803901359159.

Full text
Abstract:
碩士
國立清華大學
資訊工程學系
93
Single nucleotide polymorphisms (SNPs) is one of the most considered topics. This phenomenon of genetic polymorphism is the most frequent human genetic variation and corresponding to numerous applications such as medical diagnosis, drug design and phylogenis. It is also helpful for tracking disease genes. The complete sequence of SNP varieties from a single copy of chromosomes is called a haplotype. To determine haplotypes for a single individual, one alternative method proposed in [1, 2] is based on the DNA fragments and the methodology of Shotgun Sequencing Assembly. Every DNA fragment contains several SNPs information. After an appropriate assembly of the fragments, we can get the haplotypes for a single individuals. But it is difficult to get error-free fragments in the begining, how to remove errors to obtain valid assembly of all corrected fragments becomes the first problem. Since different error types are considered, two version of the problem, Minimal Fragments Removal(MFR) and Minimal SNPs Removal(MSR), were introduced in [1]. In this paper, we revised the original algorithm for MSR on fragments at most k holes. Although the original one was claimed to run in O(mn2k+2) [3], with more careful analysis, we found that it should be an O(mn2 + n2k+2) algorithm. Moreover, the existing algorithms for MFR only use the fragments from diploid genomes as input data. We extended the algorithm such that it also works robustly on the gapless fragments from polyploid genomes.
APA, Harvard, Vancouver, ISO, and other styles
6

Chuang, Shengyu, and 莊盛宇. "Simultaneous Haplotype Assembly and Structural Variations Detection Using Next Generation Sequencing." Thesis, 2011. http://ndltd.ncl.edu.tw/handle/53850063627106944834.

Full text
Abstract:
碩士
國立中正大學
資訊工程研究所
100
The genomes of most species in the biosphere is a diploid genome composed of two haplotypes. However, existing short-read assemblers for next-generation sequencing (NGS) platforms only reconstruct one consensus sequence which is a mosaic of the two haplotypes. In addition, the differences between the two haplotypes range from Single Nucleotide Polymorphisms (SNPs) to large-scale structure variations (SVs). Therefore, de novo haplotype assembly of a diploid genome is a still challenging task using NGS platforms. In this thesis, we design and implement a new framework called HapSVAssembler for de novo assembly of a diploid genome using short paired-end reads. HapSVAssembler uses a hybrid assembly approach to build a consensus sequence, identify heterozygous SNPs and SV loci, and simultaneously reconstruct the SNP/SV haplotypes via reads spanning two or more SNPs/SVs. A new optimization problem is formulated and solved by Genetic Algorithm (GA). The experimental results indicated that the assembly accuracies and continuity of HapSVAssembler is much higher than previous methods. With the ability of assembling haplotypes containing multiple types of genomic variations, HapSVAssembler is very useful for studying linkage disequilibrium across different variations.
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Haplotype assembly"

1

Aldinucci, Marco, Andrea Bracciali, Tobias Marschall, Murray Patterson, Nadia Pisanti, and Massimo Torquati. "High-Performance Haplotype Assembly." In Computational Intelligence Methods for Bioinformatics and Biostatistics, 245–58. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-24462-4_21.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Patterson, Murray, Tobias Marschall, Nadia Pisanti, Leo van Iersel, Leen Stougie, Gunnar W. Klau, and Alexander Schönhuth. "WhatsHap: Haplotype Assembly for Future-Generation Sequencing Reads." In Lecture Notes in Computer Science, 237–49. Cham: Springer International Publishing, 2014. http://dx.doi.org/10.1007/978-3-319-05269-4_19.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Sun, Hequan, José A. Campoy, and Korbinian Schneeberger. "Gamete Binning to Achieve Haplotype-Resolved Genome Assembly." In Methods in Molecular Biology, 201–18. New York, NY: Springer US, 2022. http://dx.doi.org/10.1007/978-1-0716-2819-5_13.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

HALLDÓRSSON, BJARNI V., DEREK AGUIAR, and SORIN ISTRAIL. "HAPLOTYPE PHASING BY MULTI-ASSEMBLY OF SHARED HAPLOTYPES: PHASE-DEPENDENT INTERACTIONS BETWEEN RARE VARIANTS." In Biocomputing 2011, 88–99. WORLD SCIENTIFIC, 2010. http://dx.doi.org/10.1142/9789814335058_0010.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Haplotype assembly"

1

Si, Hongbo, Haris Vikalo, and Sriram Vishwanath. "Haplotype assembly: An information theoretic view." In 2014 IEEE Information Theory Workshop (ITW). IEEE, 2014. http://dx.doi.org/10.1109/itw.2014.6970817.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Das, Shreepriya, and Haris Vikalo. "Optimal haplotype assembly with statistical pruning." In 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP). IEEE, 2014. http://dx.doi.org/10.1109/globalsip.2014.7032339.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

AGUIAR, DEREK, WENDY S. W. WONG, and SORIN ISTRAIL. "TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS." In Proceedings of the Pacific Symposium. WORLD SCIENTIFIC, 2013. http://dx.doi.org/10.1142/9789814583220_0002.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Zhao, Yuying, and Jinshan Li. "Label Propagation Algorithm for Haplotype Assembly Problem." In 2015 4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering. Paris, France: Atlantis Press, 2015. http://dx.doi.org/10.2991/icmmcce-15.2015.385.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Puljiz, Zrinka, and Haris Vikalo. "A message passing algorithm for haplotype assembly." In 2013 Asilomar Conference on Signals, Systems and Computers. IEEE, 2013. http://dx.doi.org/10.1109/acssc.2013.6810596.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Moeinzadeh, M.-Hossein, Ehsan Asgariarn, Morteza Mohammad Noori, Mehdi Sadeghi, and Sara Sharifian-R. "F.C.A: Designing a fuzzy clustering algorithm for haplotype assembly." In 2009 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, 2009. http://dx.doi.org/10.1109/fuzzy.2009.5277349.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Kamath, Govinda M., Eren Sasoglu, and David Tse. "Optimal haplotype assembly from high-throughput mate-pair reads." In 2015 IEEE International Symposium on Information Theory (ISIT). IEEE, 2015. http://dx.doi.org/10.1109/isit.2015.7282588.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Wu, Jingli, Jianxin Wang, and Jian'er Chen. "A Genetic Algorithm for Single Individual SNP Haplotype Assembly." In 2008 9th International Conference for Young Computer Scientists (ICYCS). IEEE, 2008. http://dx.doi.org/10.1109/icycs.2008.95.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Yang, Yang, and Minzhu Xie. "A parameterized enumeration algorithm for the haplotype assembly problem." In 2013 6th International Conference on Biomedical Engineering and Informatics (BMEI). IEEE, 2013. http://dx.doi.org/10.1109/bmei.2013.6746986.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Xu, Xinshun, Jun Ma, and Jiahai Wang. "A Hopfield-Type Neural Network for Haplotype Assembly Problem." In 2008 Fourth International Conference on Natural Computation. IEEE, 2008. http://dx.doi.org/10.1109/icnc.2008.570.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "Haplotype assembly"

1

Weng, Ziqing, Dorian J. Garrick, Mahdi Saatchi, Robert Schnabel, and Jeremy Taylor. Impact of Pedigree Information and Genome Assembly Errors on Inference of SNP Haplotypes in Cattle. Ames (Iowa): Iowa State University, January 2013. http://dx.doi.org/10.31274/ans_air-180814-632.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography