Pan-genome Analysis, Visualization and Exploration

Ding, Wei

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
7 Mathematisch-Naturwissenschaftliche Fakultät
→
Dokumentanzeige

dc.contributor.advisor	Neher, Richard (Dr.)
dc.contributor.author	Ding, Wei
dc.date.accessioned	2018-02-02T07:36:01Z
dc.date.available	2018-02-02T07:36:01Z
dc.date.issued	2018
dc.identifier.other	498077837	de_DE
dc.identifier.uri	http://hdl.handle.net/10900/80098
dc.identifier.uri	http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-800985	de_DE
dc.identifier.uri	http://dx.doi.org/10.15496/publikation-21493
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-800985	de_DE
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-800980	de_DE
dc.description.abstract	The dynamics of prokaryotic genomes are driven by the intricate interplay of different evolutionary forces such as gene duplication, gene loss and horizontal transfer. Even closely related strains can exhibit remarkable genetic diversity and substantial gene presence/absence variation. The pan-genome, namely the complete inventory of genes in a collection of strains, can be several times larger than the genome of any single strain. Although several tools for pan-genome analysis have been published, there is still much room for algorithmic improvement, as well as needs for applications that better interactively visualize and explore pan-genomes. Therefore, we have developed panX, an automated computational pipeline for efficient identification of orthologous gene clusters in the pan-genome. PanX identifies homologous relationships among genes using DIAMOND and MCL and then harnesses phylogeny-based post- processing to separate orthologs from paralogs. Furthermore, we take advantage of a divide-and-conquer strategy to achieve an approximately linear runtime on large datasets. The analysis result can be visualized by the accompanying software, an easy-to-use and powerful web-based visualization application for interactive exploration of the pan-genome. The visualization dashboard encompasses a variety of connected components that allow rapid searching, filtering and sorting of genes and flexible investigation of evolutionary relationships among strains and their genes. PanX seamlessly interlinks gene clusters with their alignments and gene phylogenies, maps mutations on the branches of gene tree and highlights gene gain and loss events on the core-genome phylogeny that can also be colored by metadata associated with strains. By using 120 simulated pan-genome datasets for benchmarking and comparing clustering results on real dataset between different tools, panX exhibits overall good performance across a large range of diversities. PanX is available at pangenome.de, with a wide range of microbial pan-genomes established. Besides, user-provided pan-genomes can be visualized either via a web server or by running panX locally as a web-based application.	en
dc.language.iso	en	de_DE
dc.publisher	Universität Tübingen	de_DE
dc.rights	ubt-podok	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en	en
dc.subject.classification	Bioinformatik , Datenanalyse , Visualisierung	de_DE
dc.subject.ddc	570	de_DE
dc.subject.other	Pan-genome analysis	en
dc.subject.other	Pan-genome visualization	en
dc.title	Pan-genome Analysis, Visualization and Exploration	en
dc.type	PhDThesis	de_DE
dcterms.dateAccepted	2018-01-29
utue.publikation.fachbereich	Biologie	de_DE
utue.publikation.fakultaet	7 Mathematisch-Naturwissenschaftliche Fakultät	de_DE

Dateien:	WeiDing-PhDThesis-final.pdf 3.72 MB PDF

Das Dokument erscheint in:

7 Mathematisch-Naturwissenschaftliche Fakultät [5348]

Zur Kurzanzeige

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

Pan-genome Analysis, Visualization and Exploration

DSpace Repositorium (Manakin basiert)

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto