Please visit Jefferson Lab Event Policies and Guidance before planning your next event: https://www.jlab.org/conference_planning.

Indico is being upgraded to version 3.3.4 on October 15, 2024. There are no impacts to events expected. There are no major feature updates – only minor feature improvements and bugfixes. See the news link for more information.

May 8 – 12, 2023
Norfolk Waterside Marriott
US/Eastern timezone

Coffea-Casa: Building composable analysis facilities for the HL-LHC

May 9, 2023, 10:00 AM
30m
Norfolk Ballroom III-V (Norfolk Waterside Marriott)

Norfolk Ballroom III-V

Norfolk Waterside Marriott

235 East Main Street Norfolk, VA 23510
Plenary Track 7 - Facilities and Virtualization Plenary Session

Speaker

Shadura, Oksana (University Nebraska-Lincoln (US))

Description

The large data volumes expected from the High Luminosity LHC (HL-LHC) present challenges to existing paradigms and facilities for end-user data analysis. Modern cyberinfrastructure tools provide a diverse set of services that can be composed into a system that provides physicists with powerful tools that give them straightforward access to large computing resources, with low barriers to entry. The coffea-casa analysis facility provides an environment for end users enabling the execution of increasingly complex analyses such as those demonstrated by the Analysis Grand Challenge (AGC) and capturing the features that physicists will need for the HL-LHC.
We describe the development progress of the coffea-casa facility featuring its modularity while demonstrating the ability to port and customize the facility software stack to other locations. The facility also facilitates the support of different backends to other batch systems while staying Kubernetes-native.
We present evolved architecture of the facility, such as the integration of advanced data delivery services (e.g. ServiceX) and making data caching services (e.g. XCache) available to end users of the facility.
We also highlight the composability of modern cyberinfrastructure tools. To enable machine learning pipelines at coffee-casa analysis facilities, a set of industry ML solutions adopted for HEP columnar analysis were integrated on top of existing facility services. These services also feature transparent access for user workflows to GPUs available at a facility via inference servers while using Kubernetes as enabling technology.

Consider for long presentation Yes

Primary authors

Shadura, Oksana (University Nebraska-Lincoln (US)) Bloom, Kenneth (University of Nebraska-Lincoln) Bockelman, Brian (Morgridge Institute for Research) Lundstedt, Carl (University Nebraska-Lincoln) Wightman, Andrew (University Nebraska-Lincoln) Attebury, Garhan (University Nebraska-Lincoln) Thiltges, John (University Nebraska-Lincoln) Albin, Sam (University Nebraska-Lincoln (US))

Presentation materials

Peer reviewing

Paper