Toggle Main Menu Toggle Search

Open Access padlockePrints

Exploring microbial genome sequences to identify protein families on the grid

Lookup NU author(s): Professor Anil Wipat, Dr Matthew Pocock, Professor Pete Lee, Dr Keith Flanagan, James Worthington



The analysis of microbial genome sequences can identify protein families that provide potential drug targets for new antibiotics. With the rapid accumulation of newly sequenced genomes, this analysis has become a computationally intensive and data-intensive problem. This paper describes the development of a Web-service-enabled, component-based, architecture to support the large-scale comparative analysis of complete microbial genome sequences and the subsequent identification of orthologues and protein families (Microbase). The system is coordinated through the use of Web-service-based notifications and integrates distributed computing resources together with genomic databases to realize all-against-all comparisons for a large volume of genome sequences and to present the data in a computationally amenable format through a Web service interface. We demonstrate the use of the system in searching for orthologues and candidate protein families, which ultimately could lead to the identification of potential therapeutic targets. © 2007 IEEE.

Publication metadata

Author(s): Sun Y, Wipat A, Pocock M, Lee PA, Flanagan K, Worthington JT

Publication type: Article

Publication status: Published

Journal: IEEE Transactions on Information Technology in Biomedicine

Year: 2007

Volume: 11

Issue: 4

Pages: 435-442

Date deposited: 26/05/2010

ISSN (print): 1089-7771

ISSN (electronic): 1558-0032

Publisher: IEEE


DOI: 10.1109/TITB.2007.892913

PubMed id: 17674626


Altmetrics provided by Altmetric