Please visit Jefferson Lab Event Policies and Guidance before planning your next event: https://www.jlab.org/conference_planning.

May 8 – 12, 2023
Norfolk Waterside Marriott
US/Eastern timezone

Accounting and monitoring tools enhancements for Run 3 in ATLAS distributed computing

Not scheduled
1h
Hampton Roads Ballroom and Foyer Area (Norfolk Waterside Marriott)

Hampton Roads Ballroom and Foyer Area

Norfolk Waterside Marriott

235 East Main Street Norfolk, VA 23510
Poster Poster Poster Session

Speaker

Chudoba, Jiri

Description

The ATLAS experiment at the LHC utilizes complex multicomponent distributed systems for processing (PanDA WMS) and managing (Rucio) of data. The complexity of the relationships between components, the amount of data being processed and the continuous development of new functionality of the critical systems are the main challenges to consider when creating monitoring and accounting tools able to adapt to the dynamic environment in a short time. To overcome these challenges, ATLAS uses the unified monitoring infrastructure (UMA) provided by CERN-IT since 2018, which accumulates information from distributed data sources and then makes it available for different ATLAS distributed computing user groups. The information is displayed using Grafana dashboards. Based on the information provided, they can be grouped as “data transfers”, “site accounting”, “jobs accounting” and so on. These monitoring tools are used daily by ATLAS users to spot and fix issues. In addition, LHC Run 3 required the implementation of significant changes in the monitoring and accounting infrastructure to collect and process data collected by ATLAS during the LHC run. Recent enhancements to the UMA-based monitoring and accounting dashboards will be described.

Consider for long presentation No

Primary authors

Barberis, Dario (Università e INFN Genova) Svatos, Michal (Prague AS)

Co-author

Alekseev, Aleksandr (UT Arlington)

Presentation materials

Peer reviewing

Paper