The ATLAS PanDA Pilot in Operation

P Nilsson(The University of Texas at Arlington), José Manuel Rodríguez Caballero(Brookhaven National Laboratory), K. De(The University of Texas at Arlington), T. Maeno(Brookhaven National Laboratory), A. R. Stradling(The University of Texas at Arlington), T. Wenaus(Brookhaven National Laboratory)
Journal of Physics Conference Series
December 23, 2011
Cited by 24Open Access
Full Text

Abstract

The ATLAS Production and Distributed Analysis system (PanDA) was designed to meet ATLAS requirements for a data-driven workload management system capable of operating at LHC data processing scale. Submitted jobs are executed on worker nodes by pilot jobs sent to the grid sites by pilot factories. This poster provides an overview of the PanDA pilot system and presents major features added in light of recent operational experience, including multi-job processing, advanced job recovery for jobs with output storage failures, gLExec based identity switching from the generic pilot to the actual user, and other security measures. The PanDA system serves all ATLAS distributed processing and is the primary system for distributed analysis; it is currently used at over 100 sites world-wide. We analyze the performance of the pilot system in processing real LHC data on the OSG, EGI and Nordugrid infrastructures used by ATLAS, and describe plans for its evolution.


Related Papers

No related papers found

Powered by citation graph analysis