Toggle Main Menu Toggle Search

Open Access padlockePrints

DistillFlow: Removing redundancy in scientific workflows

Lookup NU author(s): Professor Paolo MissierORCiD


Full text for this publication is not currently held within this repository. Alternative links are provided below where available.


Scientific workflows management systems are increasingly used by scientists to specify complex data processing pipelines. Workflows are represented using a graph structure, where nodes represent tasks and links represent the dataflow. However, the complexity of workflow structures is increasing over time, reducing the rate of scientific workflows reuse. Here, we introduce DistillFlow, a tool based on effective methods for workflow design, with a focus on the Taverna model. DistillFlow is able to detect "anti-patterns" in the structure of workflows (idiomatic forms that lead to over-complicated design) and replace them with different patterns to reduce the workflow's overall structural complexity. Rewriting workflows in this way is beneficial both in terms of user experience and workflow maintenance. © Copyright 2014 ACM 978-1-4503-2722-0/14/06⋯$15.00.

Publication metadata

Author(s): Chen J, Cohen-Boulakia S, Froidevaux C, Goble C, Missier P, Williams AR

Publication type: Conference Proceedings (inc. Abstract)

Publication status: Published

Conference Name: Proceedings of the 26th International Conference on Scientific and Statistical Database Management (SSDBM 14)

Year of Conference: 2014

Pages: 46

Online publication date: 30/06/2014

Acceptance date: 01/01/1900

Publisher: ACM


DOI: 10.1145/2618243.2618287

Library holdings: Search Newcastle University Library for this item

ISBN: 9781450327220