Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Custom Execution Environments with Containers in Pegasus-enabled Scientific Workflows (1905.08204v1)

Published 20 May 2019 in cs.DC

Abstract: Science reproducibility is a cornerstone feature in scientific workflows. In most cases, this has been implemented as a way to exactly reproduce the computational steps taken to reach the final results. While these steps are often completely described, including the input parameters, datasets, and codes, the environment in which these steps are executed is only described at a higher level with endpoints and operating system name and versions. Though this may be sufficient for reproducibility in the short term, systems evolve and are replaced over time, breaking the underlying workflow reproducibility. A natural solution to this problem is containers, as they are well defined, have a lifetime independent of the underlying system, and can be user-controlled so that they can provide custom environments if needed. This paper highlights some unique challenges that may arise when using containers in distributed scientific workflows. Further, this paper explores how the Pegasus Workflow Management System implements container support to address such challenges.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Karan Vahi (14 papers)
  2. Mats Rynge (9 papers)
  3. George Papadimitriou (15 papers)
  4. Duncan A. Brown (51 papers)
  5. Rajiv Mayani (3 papers)
  6. Rafael Ferreira da Silva (31 papers)
  7. Ewa Deelman (37 papers)
  8. Anirban Mandal (12 papers)
  9. Eric Lyons (3 papers)
  10. Michael Zink (12 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.