Data Descriptor: Next generation sequencing data of a defined microbial mock community

Singer, E; Andreopoulos, B; Bowers, RM; Lee, J; Deshpande, S; Chiniquy, J; Ciobanu, D; Klenk, HP; Zane, M; Daum, C; Clum, A; Cheng, JF; Copeland, A; Woyke, T

doi:10.1038/sdata.2016.81

Data Descriptor: Next generation sequencing data of a defined microbial mock community

Lookup NU author(s): Professor Hans-Peter Klenk

Downloads

Published version [.pdf]

Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

Abstract

Generating sequence data of a defined community composed of organisms with complete reference genomes is indispensable for the benchmarking of new genome sequence analysis methods, including assembly and binning tools. Moreover the validation of new sequencing library protocols and platforms to assess critical components such as sequencing errors and biases relies on such datasets. We here report the next generation metagenomic sequence data of a defined mock community (Mock Bacteria ARchaea Community; MBARC-26), composed of 23 bacterial and 3 archaeal strains with finished genomes. These strains span 10 phyla and 14 classes, a range of GC contents, genome sizes, repeat content and encompass a diverse abundance profile. Short read Illumina and long-read PacBio SMRT sequences of this mock community are described. These data represent a valuable resource for the scientific community, enabling extensive benchmarking and comparative evaluation of bioinformatics tools without the need to simulate data. As such, these data can aid in improving our current sequence data analysis toolkit and spur interest in the development of new tools.

Publication metadata

Author(s): Singer E, Andreopoulos B, Bowers RM, Lee J, Deshpande S, Chiniquy J, Ciobanu D, Klenk HP, Zane M, Daum C, Clum A, Cheng JF, Copeland A, Woyke T

Publication type: Article

Publication status: Published

Journal: Scientific Data

Year: 2016

Volume: 3

Online publication date: 27/09/2016

Acceptance date: 04/08/2016

Date deposited: 14/02/2017

ISSN (electronic): 2052-4463

Publisher: Nature Publishing Group

URL: http://dx.doi.org/10.1038/sdata.2016.81

DOI: 10.1038/sdata.2016.81

Altmetrics

Altmetrics provided by Altmetric

Funding

Funder reference	Funder name
DE-AC02-05CH11231	U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility

ePrints

Data Descriptor: Next generation sequencing data of a defined microbial mock community

Downloads

Licence

Abstract

Publication metadata

Altmetrics

Funding

Share