Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures

Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures
Author :
Publisher : Forschungszentrum Jülich
Total Pages : 207
Release :
ISBN-10 : 9783893369492
ISBN-13 : 389336949X
Rating : 4/5 (92 Downloads)

Book Synopsis Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures by : Sonja Holl

Download or read book Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures written by Sonja Holl and published by Forschungszentrum Jülich. This book was released on 2014 with total page 207 pages. Available in PDF, EPUB and Kindle. Book excerpt: Scientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments. Workflow management systems simplify the management of scientific workflows by providing graphical interfaces for their development, monitoring and analysis. Nowadays, e-Science combines such workflow management systems with large-scale data and computing resources into complex research infrastructures. For instance, e-Science allows the conveyance of best practice research in collaborations by providing workflow repositories, which facilitate the sharing and reuse of scientific workflows. However, scientists are still faced with different limitations while reusing workflows. One of the most common challenges they meet is the need to select appropriate applications and their individual execution parameters. If scientists do not want to rely on default or experience-based parameters, the best-effort option is to test different workflow set-ups using either trial and error approaches or parameter sweeps. Both methods may be inefficient or time consuming respectively, especially when tuning a large number of parameters. Therefore, scientists require an effective and efficient mechanism that automatically tests different workflow set-ups in an intelligent way and will help them to improve their scientific results. This thesis addresses the limitation described above by defining and implementing an approach for the optimization of scientific workflows. In the course of this work, scientists’ needs are investigated and requirements are formulated resulting in an appropriate optimization concept. In a following step, this concept is prototypically implemented by extending a workflow management system with an optimization framework, including general mechanisms required to conduct workflow optimization. As optimization is an ongoing research topic, different algorithms are provided by pluggable extensions (plugins) that can be loosely coupled with the framework, resulting in a generic and quickly extendable system. In this thesis, an exemplary plugin is introduced which applies a Genetic Algorithm for parameter optimization. In order to accelerate and therefore make workflow optimization feasible at all, e-Science infrastructures are utilized for the parallel execution of scientific workflows. This is empowered by additional extensions enabling the execution of applications and workflows on distributed computing resources. The actual implementation and therewith the general approach of workflow optimization is experimentally verified by four use cases in the life science domain. All workflows were significantly improved, which demonstrates the advantage of the proposed workflow optimization. Finally, a new collaboration-based approach is introduced that harnesses optimization provenance to make optimization faster and more robust in the future.


Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures Related Books

Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures
Language: en
Pages: 207
Authors: Sonja Holl
Categories:
Type: BOOK - Published: 2014 - Publisher: Forschungszentrum Jülich

DOWNLOAD EBOOK

Scientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments
Scientific and Statistical Database Management
Language: en
Pages: 659
Authors: Marianne Winslett
Categories: Business & Economics
Type: BOOK - Published: 2009-05-22 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 21st International Conference on Scientific and Statistical Database Management, SSDBM 2009, held in New O
Guide to e-Science
Language: en
Pages: 554
Authors: Xiaoyu Yang
Categories: Computers
Type: BOOK - Published: 2011-05-26 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This guidebook on e-science presents real-world examples of practices and applications, demonstrating how a range of computational technologies and tools can be
Workflows for e-Science
Language: en
Pages: 532
Authors: Ian J. Taylor
Categories: Computers
Type: BOOK - Published: 2007-12-31 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This is a timely book presenting an overview of the current state-of-the-art within established projects, presenting many different aspects of workflow from use
Data Management in Grid and Peer-to-Peer Systems
Language: en
Pages: 158
Authors: Abdelkader Hameurlain
Categories: Computers
Type: BOOK - Published: 2009-08-24 - Publisher: Springer

DOWNLOAD EBOOK

The synergy and convergence of research on grid computing and peer-to-peer (P2P) computing have materialized in the meeting of the two research communities: par