Skip to main content
Publication

Globus automation services: Research process automation across the spacetime continuum

Authors

Chard, Ryan; Pruyne, Jim; McKee, Kurt; Bryan, Josh; Raumann, Brigitte; Ananthakrishnan, Rachana; Chard, Kyle; Foster, Ian

Abstract

Research process automation-the reliable, efficient, and reproducible execution of linked sets of actions on scientific instruments, computers, data stores, and other resources-has emerged as an essential element of modern science. We report here on new services within the Globus research data management platform that enable the specification of diverse research processes as reusable sets of actions, flows, and the execution of such flows in heterogeneous research environments. To support flows with broad spatial extent (e.g., from scientific instrument to remote data center) and temporal extent (from seconds to weeks), these Globus automation services feature: (1) cloud hosting for reliable execution of even long-lived flows despite sporadic failures; (2) a simple specification and extensible asynchronous action provider API, for defining and executing a wide variety of actions and flows involving heterogeneous resources; (3) an event-driven execution model for automating execution of flows in response to arbitrary events; and (4) a rich security model enabling authorization delegation mechanisms for secure execution of long-running actions across distributed resources. These services permit researchers to outsource and automate the management of a broad range of research tasks to a reliable, scalable, and secure cloud platform. We present use cases for Globus automation services, describe their design and implementation, present microbenchmark studies, and review experiences applying the services in a range of applications.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://​cre​ativecom​mons​.org/​l​i​c​e​n​s​e​s​/​b​y​-​n​c​-​n​d​/4.0/).