Pan-genome Analysis, Visualization and Exploration

DSpace Repositorium (Manakin basiert)

Zur Kurzanzeige

dc.contributor.advisor Neher, Richard (Dr.)
dc.contributor.author Ding, Wei
dc.date.accessioned 2018-02-02T07:36:01Z
dc.date.available 2018-02-02T07:36:01Z
dc.date.issued 2018
dc.identifier.other 498077837 de_DE
dc.identifier.uri http://hdl.handle.net/10900/80098
dc.identifier.uri http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-800985 de_DE
dc.identifier.uri http://dx.doi.org/10.15496/publikation-21493
dc.description.abstract The dynamics of prokaryotic genomes are driven by the intricate interplay of different evolutionary forces such as gene duplication, gene loss and horizontal transfer. Even closely related strains can exhibit remarkable genetic diversity and substantial gene presence/absence variation. The pan-genome, namely the complete inventory of genes in a collection of strains, can be several times larger than the genome of any single strain. Although several tools for pan-genome analysis have been published, there is still much room for algorithmic improvement, as well as needs for applications that better interactively visualize and explore pan-genomes. Therefore, we have developed panX, an automated computational pipeline for efficient identification of orthologous gene clusters in the pan-genome. PanX identifies homologous relationships among genes using DIAMOND and MCL and then harnesses phylogeny-based post- processing to separate orthologs from paralogs. Furthermore, we take advantage of a divide-and-conquer strategy to achieve an approximately linear runtime on large datasets. The analysis result can be visualized by the accompanying software, an easy-to-use and powerful web-based visualization application for interactive exploration of the pan-genome. The visualization dashboard encompasses a variety of connected components that allow rapid searching, filtering and sorting of genes and flexible investigation of evolutionary relationships among strains and their genes. PanX seamlessly interlinks gene clusters with their alignments and gene phylogenies, maps mutations on the branches of gene tree and highlights gene gain and loss events on the core-genome phylogeny that can also be colored by metadata associated with strains. By using 120 simulated pan-genome datasets for benchmarking and comparing clustering results on real dataset between different tools, panX exhibits overall good performance across a large range of diversities. PanX is available at pangenome.de, with a wide range of microbial pan-genomes established. Besides, user-provided pan-genomes can be visualized either via a web server or by running panX locally as a web-based application. en
dc.language.iso en de_DE
dc.publisher Universität Tübingen de_DE
dc.rights ubt-podok de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en en
dc.subject.classification Bioinformatik , Datenanalyse , Visualisierung de_DE
dc.subject.ddc 570 de_DE
dc.subject.other Pan-genome analysis en
dc.subject.other Pan-genome visualization en
dc.title Pan-genome Analysis, Visualization and Exploration en
dc.type PhDThesis de_DE
dcterms.dateAccepted 2018-01-29
utue.publikation.fachbereich Biologie de_DE
utue.publikation.fakultaet 7 Mathematisch-Naturwissenschaftliche Fakultät de_DE

Dateien:

Das Dokument erscheint in:

Zur Kurzanzeige