Browse by author
Lookup NU author(s): Professor Hans-Peter Klenk
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
Genome Blast Distance Phylogeny (GBDP) infers distances and phylogenetic relationships between organisms from completely or partially sequenced genomes. It is well suited for parallelization as pairwise distances are calculated independently. As exemplar data for a high-performance cluster implementation that executes many pairwise genome comparisons in parallel, we here used sequences from the Genomic Encyclopedia of Bacteria and Archaea project. Phylogenies were inferred from genome-scale nucleotide and amino acid data with all variants of GBDP, including novel adaptations to amino acid sequences and approaches yielding trees with branch support. The dependency of phylogenetic accuracy, average branch support as well as performance indicators such as running time and disk space consumption on details of genome comparison, distance calculation, and phylogenetic inference was examined in detail. If combined with conservative measures for branch support, GBDP appears to infer reasonable phylogenetic relationships of microorganisms with a comparatively low computational cost. Due to the linear speed-up of the cluster, benchmarks reveal an overall computation time of less than 24 h required for the 7750 pairwise genome/proteome comparisons of the Genomic Encyclopedia of Bacteria and Archaea data set that is opposed to an estimated running time of about 30 days for the non-parallelized version.
Author(s): Meier-Kolthoff JP, Auch AF, Klenk HP, Göker M
Publication type: Article
Publication status: Published
Journal: Concurrency and Computation Practice and Experience
Print publication date: 01/07/2014
Online publication date: 07/08/2013
Acceptance date: 19/07/2013
ISSN (print): 1532-0626
ISSN (electronic): 1532-0634
Publisher: John Wiley & Sons, Inc.
Altmetrics provided by Altmetric