26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)

Name: 26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)
Start: 2023-05-08T08:00:00-04:00
End: 2023-05-12T16:00:00-04:00
Location: Norfolk Waterside Marriott

May 8 – 12, 2023

Norfolk Waterside Marriott

US/Eastern timezone

Conference Secretariat

chep2023-secretariat@jlab.org

Contribution List

642. Welcome and Introduction to CHEP23

Boehnlein, Amber (JLAB), Dodge, Gail (Old Dominion University), Henderson, Stuart (Jefferson Lab), Scott, Bobby (United States Congressman)

5/8/23, 9:00 AM

Plenary Session

643. Keynote: Evolutions and Revolutions in Computing: Science at the Frontier

Dean, David (TJNAF)

5/8/23, 9:30 AM

Plenary Session

The world is full of computing devices that calculate, monitor, analyze, and control processes. The underlying technical advances within computing hardware have been further enhanced by tremendous algorithmic advances across the spectrum of the sciences. The quest -- ever present in humans -- to push the frontiers of knowledge and understanding requires continuing advances in the development...

644. Keynote: Future Trends in Nuclear Physics Computing

Diefenthaler, Markus (Jefferson Lab)

5/8/23, 10:00 AM

Plenary

Plenary Session

In today's Nuclear Physics (NP), the exploration of the origin, evolution, and structure of the universe's matter is pursued through a broad research program at various collaborative scales, ranging from small groups to large experiments comparable in size to those in high-energy physics (HEP). Consequently, software and computing efforts vary from DIY approaches among a few researchers to...

252. Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels

Kwok, Ka Hei Martin (Fermilab)

5/8/23, 11:00 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Next generation High-Energy Physics (HEP) experiments are presented with significant computational challenges, both in terms of data volume and processing power. Using compute accelerators, such as GPUs, is one of the promising ways to provide the necessary computational power to meet the challenge. The current programming models for compute accelerators often involve using...

25. dCache: storage system for data intensive science

Mr Litvintsev, Dmitry (Fermilab)

5/8/23, 11:00 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The dCache project provides open-source software deployed internationally to satisfy
ever more demanding storage requirements. Its multifaceted approach provides an integrated
way of supporting different use-cases with the same storage, from high throughput data
ingest, data sharing over wide area networks, efficient access from HPC clusters and long
term data persistence on a tertiary...

197. Distributed Machine Learning with PanDA and iDDS in LHC ATLAS

Weber, Christian (Brookhaven National Laboratory)

5/8/23, 11:00 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Machine learning has become one of the important tools for High Energy Physics analysis. As the size of the dataset increases at the Large Hadron Collider (LHC), and at the same time the search spaces become bigger and bigger in order to exploit the physics potentials, more and more computing resources are required for processing these machine learning tasks. In addition, complex advanced...

39. Involving the new generations in Fermilab endeavours

Mambelli, Marco (Fermilab)

5/8/23, 11:00 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Since 1984 the Italian groups of the Istituto Nazionale di Fisica Nucleare (INFN) and Italian Universities, collaborating with the
DOE laboratory of Fermilab (US) have been running a two-month summer training program for Italian university students. While
in the first year the program involved only four physics students of the University of Pisa, in the following years it was extended
to...

171. Is Julia ready to be adopted by HEP?

Gál, Tamás (Erlangen Centre for Astroparticle Physics)

5/8/23, 11:00 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The Julia programming language was created 10 years ago and is now a mature and stable language with a large ecosystem including more than 8,000 third-party packages. It was designed for scientific programming to be a high-level and dynamic language as Python is, while achieving runtime performances comparable to C/C++ or even faster. With this, we ask ourselves if the Julia language and its...

174. Machine learning for ambiguity resolution in ACTS

Allaire, Corentin (IJCLab)

5/8/23, 11:00 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The reconstruction of particle trajectories is a key challenge of particle physics experiments, as it directly impacts particle identification and physics performances while also representing one of the main CPU consumers of many high energy physics experiments. As the luminosity of particle collider increases, this reconstruction will become more challenging and resource intensive. New...

43. MLHad: Simulating Hadronization with Machine Learning

Wilkinson, Michael (University of Cincinnati)

5/8/23, 11:00 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Hadronization is an important step in Monte Carlo event generators, where quarks and gluons are bound into physically observable hadrons. Today’s generators rely on finely-tuned empirical models, such as the Lund string model; while these models have been quite successful overall, there remain phenomenological areas where they do not match data well. In this talk, we present MLHad, a...

574. One year of LHCb triggerless DAQ: achievements and lessons learned.

Pisani, Flavio (CERN)

5/8/23, 11:00 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The LHCb experiment is one of the four large experiments on the LHC at CERN. This forward spectrometer is designed to investigate differences between matter and antimatter by studying beauty and charm Physics. The detector and the entire DAQ chain have been upgraded, to profit from the higher luminosity delivered by the particle accelerator during Run3. The new DAQ system introduces a...

447. RooFit's new heterogeneous computing backend

Rembser, Jonas (CERN)

5/8/23, 11:00 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

RooFit is a library for building and fitting statistical models that is part of ROOT. It is used in most experiments in particle physics, in particular, the LHC experiments. Recently, the backend that evaluates the RooFit likelihood functions was rewritten to support performant computations of model components on different hardware. This new backend is referred to as the "batch mode". So far,...

551. Securing an Open and Trustworthy Ecosystem for Research Infrastructure and Applications (SOTERIA)

Bockelman, Brian (Morgridge Institute for Research)

5/8/23, 11:00 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Managing a secure software environment is essential to a trustworthy cyberinfrastructure. Software supply chain attacks may be a top concern for IT departments, but they are also an aspect of scientific computing. The threat to scientific reputation caused by problematic software can be just as dangerous as an environment contaminated with malware. The issue of managing environments affects...

423. ATLAS data analysis using a parallelized workflow on distributed cloud-based services with GPUs

Sandesara, Jay (University of Massachusetts, Amherst)

5/8/23, 11:15 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

We present a new implementation of simulation-based inference using data collected by the ATLAS experiment at the LHC. The method relies on large ensembles of deep neural networks to approximate the exact likelihood. Additional neural networks are introduced to model systematic uncertainties in the measurement. Training of the large number of deep neural networks is automated using a...

625. DUNE Computing Tutorials

DeMuth, David (Valley City State University)

5/8/23, 11:15 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Providing computing training to the next generation of physicists is the
principal driver for a biannual multi-day workshop hosted by the DUNE
Computing Consortium. Materials are cast in the Software Carpentries
templates, and to date topics have included storage space, data
management, LArSoft, grid job submission and monitoring. Moreover,
experts provide extended breakout sessions to...

105. Erasure Coding Xrootd Object Store

Yang, Wei (SLAC National Accelerator Laboratory)

5/8/23, 11:15 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

XRootD implemented a client-side erasure coding (EC) algorithm utilizing the Intel Intelligent Storage Acceleration Library. At SLAC, a prototype of XRootD EC storage was set up for evaluation. The architecture and configuration of the prototype is almost identical to that of a traditional non-EC XRootD storage behind a firewall: a backend XRootD storage cluster in its simplest form, and an...

222. Generalizing mkFit and its Application to HL-LHC

Tadel, Matevz (UCSD)

5/8/23, 11:15 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

MkFit is an implementation of the Kalman filter-based track reconstruction algorithm that exploits both thread- and data-level parallelism. In the past few years the project transitioned from the R&D phase to deployment in the Run-3 offline workflow of the CMS experiment. The CMS tracking performs a series of iterations, targeting reconstruction of tracks of increasing difficulty after...

497. INFN and the evolution of distributed scientific computing in Italy

Fanzago, Federica

5/8/23, 11:15 AM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

INFN has been running for more than 20 years a distributed infrastructure (the Tier-1 at Bologna-CNAF and 9 Tier-2 centers) which currently offers about 140000 CPU cores, 120 PB of enterprise-level disk space and 100 PB of tape storage, serving more than 40 international scientific collaborations.
This Grid-based infrastructure was augmented in 2019 with the INFN Cloud: a production quality...

389. Making Likelihood Calculations Fast: Automatic Differentiation Applied to RooFit

Singh, Garima (Princeton University (US)/CERN)

5/8/23, 11:15 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

With the growing datasets of current and next-generation High-Energy and Nuclear Physics (HEP/NP) experiments, statistical analysis has become more computationally demanding. These increasing demands elicit improvements and modernizations in existing statistical analysis software. One way to address these issues is to improve parameter estimation performance and numeric stability using...

173. Modular toolsets for integrating HPC clusters in experiment control systems

Dr Al-Turany, Mohammad (GSI)

5/8/23, 11:15 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

New particle/nuclear physics experiments require a massive amount of computing power that is only achieved by using high performance clusters directly connected to the data acquisition systems and integrated into the online systems of the experiments. However, integrating an HPC cluster into the online system of an experiment means: Managing and synchronizing thousands of processes that handle...

286. Polyglot Jet Finding

Stewart, Graeme (CERN)

5/8/23, 11:15 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The evaluation of new computing languages for a large community, like HEP, involves comparison of many aspects of the languages' behaviour, ecosystem and interactions with other languages. In this paper we compare a number of languages using a common, yet non-trivial, HEP algorithm: the tiled $N^2$ clustering algorithm used for jet finding. We compare specifically the algorithm implemented in...

262. SYMBA: Symbolic Computation of Squared Amplitudes in High Energy Physics with Machine Learning

Mr Alnuqaydan, Abdulhakim (University of Kentucky)

5/8/23, 11:15 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The calculation of particle interaction squared amplitudes is a key step in the calculation of cross sections in high-energy physics. These lengthy calculations are currently done using domain-specific symbolic algebra tools, where the time required for the calculations grows rapidly with the number of final state particles involved. While machine learning has proven to be highly successful in...

206. The new ALICE Data Acquisition system (O2/FLP) for LHC Run 3

Barroso, Vasco (CERN)

5/8/23, 11:15 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

ALICE (A Large Ion Collider Experiment) has undertaken a major upgrade during the LHC Long Shutdown 2. The increase in the detector data rates led to a hundredfold increase in the input raw data, up to 3.5 TB/s. To cope with it, a new common Online and Offline computing system, called O2, has been developed and put in production.

The O2/FLP system, successor of the ALICE DAQ system,...

257. A customized web application for XENON collaboration member management

Liang, Shixiao (Rice University)

5/8/23, 11:30 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The common form of inter-institute particle physics experiment collaborations generates unique needs for member management including paper authorship, shift assignments, subscription to mailing lists and access to 3rd party applications such as Github and Slack. For smaller collaborations, typically no facility for centralized member management is available and these needs are usually manually...

488. Build-a-Fit: RooFit configurable parallelization and fine-grained benchmarking tools for order of magnitude speedups in your fits

Wolffs, Zef (Nikhef)

5/8/23, 11:30 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

RooFit is a toolkit for statistical modeling and fitting, presented first at CHEP2003, and together with RooStats is used for measurements and statistical tests by most experiments in particle physics, particularly the LHC experiments.

As the LHC program progresses, physics analyses become more ambitious and computationally more demanding, with fits of hundreds of data samples to joint...

357. Federated Heterogenous Compute and Storage Infrastructure for the PUNCH4NFDI Consortium

Dr Giffels, Manuel (Karlsruher Institut für Technologie (KIT))

5/8/23, 11:30 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

PUNCH4NFDI, funded by the Germany Research Foundation initially for five years, is a diverse consortium of particle, astro-, astroparticle, hadron and nuclear physics embedded in the National Research Data Infrastructure initiative.

In order to provide seamless and federated access to the huge variaty of compute and storage systems provided by the participating communities covering their...

306. Modelling Distributed Heterogeneous Computing Infrastructures for HEP Applications

Mr Horzela, Maximilian (Karlsruhe Institute of Technology)

5/8/23, 11:30 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Predicting the performance of various infrastructure design options in complex federated infrastructures with computing sites distributed over a wide area that support a plethora of users and workflows, such as the Worldwide LHC Computing Grid (WLCG), is not trivial. Due to the complexity and size of these infrastructures, it is not feasible to deploy experimental test-beds at large scales...

542. On Estimating Gradients of Discrete Stochastic Programs for Detector Design Optimization

Heinrich, Lukas (Technical University of Munich)

5/8/23, 11:30 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The recent advances in Machine Learning and high-dimensional gradient-based optimization has led to increased interest in the question of whether we can use such methods to optimize the design of future detectors for high-level physics objectives. However this program faces a fundamental obstacle: The quality of a detector design must be judged on the physics inference it enables, but both...

297. Operational experience with the new ATLAS HLT framework for LHC Run 3

Poreba, Aleksandra (CERN)

5/8/23, 11:30 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Athena is the software framework used in the ATLAS experiment throughout the data processing path, from the software trigger system through offline event reconstruction to physics analysis. For Run 3 data taking (which started in 2022) the framework has been reimplemented into a multi-threaded framework. In addition to having to be remodelled to work in this new framework, the ATLAS High Level...

504. Outlines in hardware and software for new generations of exascale interconnects

Dr Martinelli, Michele (INFN)

5/8/23, 11:30 AM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

RED-SEA (https://redsea-project.eu/) is a European project funded in the framework of the H2020-JTI-EuroHPC-2019-1 call that started in April 2021. The goal of the project is to evaluate the architectural design of the main elements of the interconnection networks for the next generation of HPC systems supporting hundreds of thousands of computing nodes enabling the Exa-scale for HPC, HPDA and...

433. POSIX access to remote storage with OpenID Connect AuthN/AuthZ

Dr Fornari, Federico (INFN-CNAF)

5/8/23, 11:30 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing support in terms of computing, networking, storage resources and services also to a wide variety of scientific collaborations, ranging from physics to bioinformatics and industrial engineering.
Recently, several collaborations working with our data center have developed computing and data management...

335. The ATLAS experiment software on ARM

Elmsheuser, Johannes (Brookhaven National Laboratory)

5/8/23, 11:30 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

With an increased dataset obtained during the Run-3 of the LHC at CERN and the even larger expected increase of the dataset by more than one order of magnitude for the HL-LHC, the ATLAS experiment is reaching the limits of the current data processing model in terms of traditional CPU resources based on x86_64 architectures and an extensive program for software upgrades towards the HL-LHC has...

382. traccc - a close to single-source track reconstruction demonstrator for CPU and GPU

Krasznahorkay, Attila (CERN)

5/8/23, 11:30 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Despite recent advances in optimising the track reconstruction problem for high particle multiplicities in high energy physics experiments, it remains one of the most demanding reconstruction steps in regards to complexity and computing ressources. Several attemps have been made in the past to deploy suitable algorithms for track reconstruction on hardware accelerators, often by tailoring the...

544. Digital Twin Engine Infrastructure in the interTwin Project

Fuhrmann, Patrick (DESY)

5/8/23, 11:45 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

InterTwin is an EU-funded project that started on the 1st of September 2022. The project will work with domain experts from different scientific domains in building a technology to support digital twins within scientific research. Digital twins are models for predicting the behaviour and evolution of real-world systems and applications.

InterTwin will focus on employing machine-learning...

276. Enabling Storage Business Continuity and Disaster Recovery with Ceph distributed storage

Bocchi, Enrico (CERN)

5/8/23, 11:45 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The Storage Group in the CERN IT Department operates several Ceph storage clusters with an overall capacity exceeding 100 PB. Ceph is a crucial component of the infrastructure delivering IT services to all the users of the Organization as it provides: i) Block storage for the OpenStack infrastructure, ii) CephFS used as persistent storage by containers (OpenShift and Kubernetes) and as shared...

387. GPU-based algorithms for primary vertex reconstruction at CMS

Erice Cid, Carlos Francisco (Boston University (US))

5/8/23, 11:45 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The high luminosity expected from the LHC during the Run 3 and, especially, the HL-LHC of data taking introduces significant challenges in the CMS event reconstruction chain. The additional computational resources needed to treat this increased quantity of data surpass the expected increase in processing power for the next years. In order to fit the projected resource envelope, CMS is...

114. ICSC: The Italian National Research Centre on HPC, Big Data and Quantum Computing

Grandi, Claudio (INFN Bologna)

5/8/23, 11:45 AM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

ICSC is one of the five Italian National Centres created in the framework of the Next Generation EU funding by the European Commission. The aim of ICSC, designed and approved through 2022 and eventually started in September 2022, is to create the national digital infrastructure for research and innovation, leveraging exixting HPC, HTC and Big Data infrastructures evolving towards a cloud...

607. INDRA-ASTRA

Mr Fang, Ronglong

5/8/23, 11:45 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The INDRA-ASTRA project is part of the ongoing R&D on streaming readout and AI/ML at Jefferson Lab. In the interdisciplinary project, nuclear physicists and data scientists work towards a prototype for an autonomous, responsive detector system as a first step towards a fully autonomous experiment. In our presentation, we will present our method for autonomous calibration of DIS experiments...

601. Multi-Module based VAE to predict HVCM faults in the SNS accelerator

Mr Alanazi, Yasir (Jefferson Lab)

5/8/23, 11:45 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

We present a Multi-Module framework based on Conditional Variational Autoencoder (CVAE) to detect anomalies in the High Voltage Converter Modulators (HVCMs) which have historically been a cause of major down time for the Spallation Neutron Source (SNS) facility. Previous studies using machine learning techniques were to predict faults ahead of time in the SNS accelerator using a Single...

451. Multilanguage Frameworks

Hrivnac, Julius (IJCLab)

5/8/23, 11:45 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

High Energy Physics software has been a victim of the necessity to choose one implementation language as no really usable multi-language environment existed. Even a co-existence of two languages in the same framework (typically C++ and Python) imposes a heavy burden on the system. The role of different languages was generally limited to well encapsulated domains (like Web applications,...

535. New developments in Minuit2

Moneta, Lorenzo (CERN)

5/8/23, 11:45 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Minuit is a program implementing a function minimisation algorithm written at CERN more than 50 years ago. It is still used by almost all statistical analysis in High Energy Physics to find optimal likelihood and best parameter values. A new version, Minuit2, has been re-implemented the original algorithm in C++ a few years ago and it is provided as a ROOT library or a standalone C++...

410. Progress on cloud native solution of Machine Learning as Service for HEP

Giommi, Luca (INFN Bologna)

5/8/23, 11:45 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Nowadays Machine Learning (ML) techniques are successfully used in many areas of High-Energy Physics (HEP) and will play a significant role also in the upcoming High-Luminosity LHC upgrade foreseen at CERN, when a huge amount of data will be produced by LHC and collected by the experiments, facing challenges at the exascale. To favor the usage of ML in HEP analyses, it would be useful to have...

158. Training and on-boarding initiatives in HEP

Reinsvold Hall, Allison (US Naval Academy)

5/8/23, 11:45 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

We will discuss the training and on-boarding initiatives currently adopted by a range of High Energy Physics (HEP) experiments. On-boarding refers to the process by which new members of a collaboration gain the knowledge and skills needed to become effective members. Fast and efficient on-boarding is increasingly important for HEP experiments as physics analyses and, as a consequence, the...

305. ATLAS Trigger and Data Acquisition Upgrades for the High Luminosity LHC

Negri, Andrea (CERN)

5/8/23, 12:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The ATLAS experiment at CERN is constructing upgraded system
for the "High Luminosity LHC", with collisions due to start in
2029. In order to deliver an order of magnitude more data than
previous LHC runs, 14 TeV protons will collide with an instantaneous
luminosity of up to 7.5 x 10e34 cm^-2s^-1, resulting in much higher pileup and
data rates than the current experiment was designed to...

605. Bayesian methodologies in particle physics with pyhf

Horstmann, Malin (TU Munich / Max Planck Institute for Physics)

5/8/23, 12:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Collider physics analyses have historically favored Frequentist statistical methodologies, with some exceptions of Bayesian inference in LHC analyses through use of the Bayesian Analysis Toolkit (BAT). We demonstrate work towards an approach for performing Bayesian inference for LHC physics analyses that builds upon the existing APIs and model building technology of the pyhf and PyMC Python...

507. Building International Research Software Collaborations in Physics

Lange, David (Princeton University)

5/8/23, 12:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Building successful multi-national collaborations is challenging. The scientific communities in a range of physical sciences have been learning how to build collaborations that build upon regional capabilities and interests over decades, iteratively with each new generation of large scientific facilities required to advance their scientific knowledge. Much of this effort has naturally focused...

38. Demand-driven provisioning of Kubernetes-like resource in OSG

Andrijauskas, Fabio, Schultz, David (IceCube, University of Wisconsin-Madison)

5/8/23, 12:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The OSG-operated Open Science Pool is an HTCondor-based virtual cluster that aggregates resources from compute clusters provided by several organizations. A user can submit batch jobs to the OSG-maintained scheduler, and they will eventually run on a combination of supported compute clusters without any further user action. Most of the resources are not owned by, or even dedicated to OSG, so...

606. ECHO: optimising object store data access for Run-3 and the evolution towards HL-LHC

Walder, James (RAL, STFC, UKRI)

5/8/23, 12:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Data access at the UK Tier-1 facility at RAL is provided through its ECHO storage, serving the requirements for the WLGC and increasing numbers of other HEP and astronomy related communities.
ECHO is a Ceph-backed erasure-coded object store, currently providing in excess of 40PB of usable space, with frontend access to data provided via XRootD or gridFTP, using the libradosstriper library of...

349. IceCube SkyDriver - A SaaS Solution for Event Reconstruction using the Skymap Scanner

Evans, Ric (IceCube, University of Wisconsin-Madison)

5/8/23, 12:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. To accurately and promptly reconstruct the arrival direction of candidate neutrino events for Multi-Messenger Astrophysics use cases, IceCube employs Skymap Scanner workflows managed by the SkyDriver service. The Skymap Scanner performs maximum-likelihood tests on individual pixels...

224. Performance Portability with Compiler Directives for Lattice QCD in the Exascale Era

Lin, Meifeng (Brookhaven National Laboratory)

5/8/23, 12:00 PM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The upcoming exascale computers in the United States and elsewhere will have diverse node architectures, with or without compute accelerators, making it a challenge to maintain a code base that is performance portable across different systems. As part of the US Exascale Computing Project (ECP), the USQCD collaboration has embarked on a collaborative effort to prepare the lattice QCD software...

132. Resilient Variational Autencoder for Unsupervised Anomaly Detection at the SLAC Linac Cohrerent Light Source

Humble, Ryan (Stanford University)

5/8/23, 12:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Significant advances in utilizing deep learning for anomaly detection have been made in recent years. However, these methods largely assume the existence of a normal training set (i.e., uncontaminated by anomalies), or even a completely labeled training set. In many complex engineering systems, such as particle accelerators, labels are sparse and expensive; in order to perform anomaly...

408. The Exa.TrkX Project

Calafiura, Paolo (LBNL)

5/8/23, 12:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Building on the pioneering work of the HEP.TrkX project [1], Exa.TrkX developed geometric learning tracking pipelines that include metric learning and graph networks. These end-to-end pipelines capture the relationships between spacepoint measurements belonging to a particle track. We tested the pipelines on simulated data from HL-LHC tracking detectors [2,5], Liquid Argon TPCs for neutrino...

603. User-Centered Design for the Electron-Ion Collider

Diefenthaler, Markus (Jefferson Lab)

5/8/23, 12:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Software and computing are an integral part of our research. According to the survey for the “Future Trends in Nuclear Physics Computing” workshop in September 2020, students and postdocs spent 80% of their time on the software and computing aspects of your research. For the Electron-Ion Collider, we are looking for ways to make software (and computing) "easier" to use. All scientists of all...

618. A multidimensional, event-by-event, statistical weighting procedure for signal to background separation

Baldwin, Zachary (Carnegie Mellon University)

5/8/23, 12:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Many current analyses in nuclear and particle physics are in search for signals that are encompassed by irreducible background events. These background events, entirely surrounding a signal of interest, would lead to inaccurate results when extracting physical observables from the data, due to the inability to reduce the signal to background ratio using any type of selection criteria. By...

511. Analysis Grand Challenge benchmarking tests on selected sites

Koch, David (LMU)

5/8/23, 12:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

A fast turn-around time and ease of use are important factors for systems supporting the analysis of large HEP data samples. We study and compare multiple technical approaches.
This presentation will be about setting up and benchmarking the Analysis Grand Challenge (AGC) [1] using CMS Open Data. The AGC is an effort to provide a realistic physics analysis with the intent of showcasing the...

209. EOS Software Evolution Enabling LHC Run-3

Mr Caffy, Cedric (CERN)

5/8/23, 12:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

EOS has been the main storage system at CERN for more than a decade, continuously improving in order to meet the ever evolving requirements of the LHC experiments and the whole physics user community. In order to satisfy the demands of LHC Run-3, in terms of storage performance and tradeoff between cost and capacity, EOS was enhanced with a set of new functionalities and features that we will...

430. Faster simulated track reconstruction in the ATLAS Fast Chain

Dr Leight, William (University of Massachusetts, Amherst)

5/8/23, 12:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The production of simulated datasets for use by physics analyses consumes a large fraction of ATLAS computing resources, a problem that will only get worse as increases in the instantaneous luminosity provided by the LHC lead to more collisions per bunch crossing (pile-up). One of the more resource-intensive steps in the Monte Carlo production is reconstructing the tracks in the ATLAS Inner...

208. From hyperons to hypernuclei online

Prof. Kisel, Ivan (Goethe University Frankfurt am Main)

5/8/23, 12:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The fast algorithms for data reconstruction and analysis of the FLES (First Level Event Selection) package of the CBM (FAIR/GSI) experiment were successfully adapted to work on the High Level Trigger (HLT) of the STAR (BNL) experiment online. For this purpose, a so-called express data stream was created on the HLT, which enabled full processing and analysis of the experimental data in real...

582. Machine Learning for Etch-Pit Identification at MoEDAL

Hays, Jonathan (Queen Mary University of London)

5/8/23, 12:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The MoEDAL experiment at CERN (https://home.cern/science/experiments/moedal-mapp) carries out searches for highly ionising exotic particles such as magnetic monopoles. One of the technologies deployed in this task is the Nuclear Track Detector (NTD). In the form of plastic films, these are passive detectors that are low cost and easy to handle. After exposure to the LHC collision environment...

90. Opticks : GPU Optical Photon Simulation using NVIDIA OptiX 7 and NVIDIA CUDA

Dr Lin, Tao (IHEP)

5/8/23, 12:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Opticks is an open source project that accelerates optical photon simulation by
integrating NVIDIA GPU ray tracing, accessed via the NVIDIA OptiX 7 API, with
Geant4 toolkit based simulations. A single NVIDIA Turing architecture GPU has
been measured to provide optical photon simulation speedup factors exceeding
1500 times single threaded Geant4 with a full JUNO analytic GPU...

413. Support for experiments at INFN-T1

Dr Pellegrino, Carmelo (INFN-CNAF)

5/8/23, 12:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The Italian WLCG Tier-1 located in Bologna and managed by INFN-CNAF has a long tradition in supporting several research communities in the fields of High-Energy Physics, Astroparticle Physics, Gravitational Waves, Nuclear Physics and others, to which provides computing resources in the form of batch computing, both HPC, HTC and Cloud, and storage. Although the LHC experiments at CERN represent...

615. Train to Sustain

Malik, Sudhir (University of Puerto Rico Mayaguez)

5/8/23, 12:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The HSF/IRIS-HEP Software Training group provides software training skills to new researchers in High Energy Physics (HEP) and related communities. These skills are essential to produce high-quality and sustainable software needed to do the research. Given the thousands of users in the community, sustainability, though challenging, is the centerpiece of its approach. The training modules are...

58. Automated Network Services for Exascale Data Movement

Balcas, Justas (California Institute of Technology)

5/8/23, 2:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The Large Hadron Collider (LHC) experiments distribute data by leveraging a diverse array of National Research and Education Networks (NRENs), where experiment data management systems treat networks as a “blackbox” resource. After the High Luminosity upgrade, the Compact Muon Solenoid (CMS) experiment alone will produce roughly 0.5 exabytes of data per year. NREN Networks are a critical part...

345. CaTS: Integration of Geant4 and Opticks

Wenzel, Hans (Fermilab)

5/8/23, 2:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

CaTS is a Geant4 advanced example that is part of Geant4[1] since version 11.0. It demonstrates the use of Opticks[2] to offload the simulation of optical photons to GPUs. Opticks interfaces with the Geant4 toolkit to collect all the necessary information to generate and trace optical photons, re-implements the optical physics processes to be run on the GPU, and automatically translates the...

273. Cluster reconstruction in the HGCAL at the Level 1 trigger

Alves, Bruno (LLR, École Polytechnique, Paris)

5/8/23, 2:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS collaboration has chosen a novel high granularity calorimeter (HGCAL) for the endcap regions as part of its planned upgrade for the high luminosity LHC. The calorimeter will have fine segmentation in both the transverse and longitudinal directions and will be the first such calorimeter specifically optimised for particle flow reconstruction to operate at a colliding-beam experiment....

251. Deep Learning for Amplitude Analysis in Spectroscopy

Dr Phelps, William (Christopher Newport University and Jefferson Lab)

5/8/23, 2:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Abstract

We present results on Deep Learning applied to Amplitude and Partial Wave Analysis (PWA) for spectroscopic analyses. Experiments in spectroscopy often aim to observe strongly-interacting, short-lived particles that decay to multi-particle final states. These particle decays have angular distributions that our deep learning model has been trained to identify. Working with TensorFlow...

416. Development of particle flow algorithms based on Neural Network techniques for the IDEA calorimeter at future colliders

DOnofrio, Adelina (INFN-Roma Tre)

5/8/23, 2:00 PM

Track 3 - Offline Computing

Oral

Track 3+9 Crossover

IDEA (Innovative Detector for an Electron-positron Accelerator) is an innovative general-purpose detector concept, designed to study electron-positron collisions at future e$^+$e$^-$ circular colliders (FCC-ee and CEPC).
The detector will be equipped with a dual read-out calorimeter able to measure separately the hadronic component and the electromagnetic component of the showers initiated...

133. DUNE HDF5 Experience

Dr Chowdhury, Barnali

5/8/23, 2:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The Deep Underground Neutrino Experiment (DUNE) has historically represented data using a combination of custom data formats and those based on ROOT I/O. Recently, DUNE has begun using the Hierarchical Data Format (HDF5) for some of its data storage applications. HDF5 provides high-performance, low-overhead I/O in DUNE’s data acquisition (DAQ) environment. DUNE will use HDF5 to record raw...

567. Experience deploying an analysis facility for the Rubin Observatory’s Legacy Survey of Space and Time (LSST) data

Mainetti, Gabriele (CC-IN2P3/CNRS)

5/8/23, 2:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The Vera C. Rubin observatory is preparing for the execution of the most ambitious astronomical survey ever attempted, the Legacy Survey of Space and Time (LSST). Currently, in its final phase of construction in the Andes mountains in Chile and due to start operations in late 2024 for 10 years, its 8.4-meter telescope will nightly scan the southern sky and collect images of the entire visible...

165. Investigating mixed-precision for AGATA pulse-shape analysis

Molina, Roméo (IJCLab)

5/8/23, 2:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The AGATA project (1) aims at building a 4pi gamma-ray spectrometer consisting of 180 germanium crystals, each crystal being divided into 36 segments. Each gamma ray produces an electrical signal within several neighbouring segments, which is compared with a data base of reference signals, enabling to locate the interaction. This step is called Pulse-Shape Analysis (PSA).

In the execution...

267. JUNO distributed computing system

Zhang, Xiaomei (Institute of High Energy Physics)

5/8/23, 2:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Jiangmen Underground Neutrino Observatory (JUNO) is a multipurpose neutrino experiment and the determination of the neutrino mass hierarchy is its primary physics goal. JUNO is going to take data in 2024 with 2PB raw data each year and use distributed computing infrastructure for simulation, reconstruction and analysis tasks. The JUNO distributed computing system has been built up based on...

571. Machine Learning for Columnar High Energy Physics Analysis

Kauffman, Elliott (Princeton University)

5/8/23, 2:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Machine learning (ML) has become an integral component of high energy physics data analyses and is likely to continue to grow in prevalence. Physicists are incorporating ML into many aspects of analysis, from using boosted decision trees to classify particle jets to using unsupervised learning to search for physics beyond the Standard Model. Since ML methods have become so widespread in...

125. A Named Data Networking Based Fast Open Storage System plugin for XRootD

Shannigrahi, Susmit (Tennessee Tech University)

5/8/23, 2:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

We present an NDN-based Open Storage System (OSS) plugin for XRootD instrumented with an accelerated packet forwarder, built for data access in the CMS and other experiments at the LHC, together with its current status, performance as compared to other tools and applications, and plans for ongoing developments.

Named Data Networking (NDN) is a leading Future Internet Architecture where data...

291. Computing Challenges for the Einstein Telescope project

Pardi, Silvio (INFN)

5/8/23, 2:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The discovery of gravitational waves, first observed in September 2015 following the merger of a binary black hole system, has already revolutionised our understanding of the Universe. This was further enhanced in August 2017, when the coalescence of a binary neutron star system was observed both with gravitational waves and a variety of electromagnetic counterparts; this joint observation...

279. Kiwaku, a C++20 library for multidimensional arrays, applied to ACTS tracking

Mr Joube, Sylvain (IJCLab - LISN, Universit Paris-Saclay, CNRS/IN2P3)

5/8/23, 2:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Track reconstruction, also known as tracking, is a vital part of the HEP event reconstruction process, and one of the largest consumers of computing resources. The upcoming HL-LHC upgrade will exacerbate the need for efficient software able to make good use of the underlying heterogeneous hardware. However, this evolution should not imply the production of code unintelligible to most of its...

20. Level-3 trigger for CLAS12 with Artificial Intelligence

Mr Tyson, Richard (Glasgow University)

5/8/23, 2:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Fast, efficient and accurate triggers are a critical requirement for modern high energy physics experiments given the increasingly large quantities of data that they produce. The CEBAF Large Acceptance Spectrometer (CLAS12) employs a highly efficient Level 3 electron trigger to filter the amount of data recorded by requiring at least one electron in each event, at the cost of a low purity in...

376. Machine learning based compression for scientific data

Gallén, Axel (Lund University (SE)), Ekman, Alexander (Lund University (SE))

5/8/23, 2:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

One common issue in vastly different fields of research and industry is the ever-increasing need for more data storage. With experiments taking more complex data at higher rates, the data recorded is quickly outgrowing the storage capabilities. This issue is very prominent in LHC experiments such as ATLAS where in five years the resources needed are expected to be many times larger than the...

356. Madgraph5_aMC@NLO on GPUs and vector CPUs: experience with the first alpha release

Hageboeck, Stephan

5/8/23, 2:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Madgraph5_aMC@NLO is one of the workhorses for Monte Carlo event generation in the LHC experiments and an important consumer of compute resources. The software has been reengineered to maintain the overall look-and-feel of the user interface while achieving very large overall speedups. The computationally intensive part (the calculation of "matrix elements") is offloaded to new implementations...

104. ML_INFN project: status report and future perspectives

Giommi, Luca (INFN Bologna)

5/8/23, 2:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The ML_INFN initiative (“Machine Learning at INFN”) is an effort to foster Machine Learning activities at the Italian National Institute for Nuclear Physics (INFN).

In recent years, AI inspired activities have flourished bottom-up in many efforts in Physics, both at the experimental and theoretical level.

Many researchers have procured desktop-level devices, with consumer oriented GPUs,...

204. PARSIFAL: parametrized simulation of triple-GEM and micro-RWELL response to a charged particle

Farinelli, Riccardo (INFN Ferrara)

5/8/23, 2:15 PM

Track 3 - Offline Computing

Oral

Track 3+9 Crossover

PARSIFAL (PARametrized SImulation) is a software tool that can reproduce the complete response of both triple-GEM and micro-RWELL based trackers. It takes into account the involved physical processes by their simple parametrization and thus in a very fast way. Existing software as GARFIELD++ are robust and reliable, but very CPU time consuming. The implementation of PARSIFAL was driven by the...

496. Schema-Evolution and the TTree within HDF5

Eichlersmith, Tom (University of Minnesota)

5/8/23, 2:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

ROOT's TTree data structure has been highly successful and useful for HEP; nevertheless, alternative file formats now exist which may offer broader software tool support and more-stable in-memory interfacing. We present a data serialization library that produces a similar data structure within the HDF5 data format; supporting C++ standard collections, user-defined data types, and schema...

157. The Spanish CMS Analysis Facility at CIEMAT

Dr Delgado Peris, Antonio (CIEMAT - PIC), Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)

5/8/23, 2:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The increasingly larger data volumes that the LHC experiments will accumulate in the coming years, especially in the High-Luminosity LHC era, call for a paradigm shift in the way experimental datasets are accessed and analyzed. The current model, based on data reduction on the Grid infrastructure, followed by interactive data analysis of manageable size samples on the physicists’ individual...

407. A physical dimensions aware evaluator for High Energy Physics applications

Clemencic, Marco (CERN EP-LBC)

5/8/23, 2:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The LHCb software stack is developed in C++ and uses the Gaudi framework for event processing and DD4hep for the detector description. Numerical computations are done either directly in the C++ code or by an evaluator used to process the expressions embedded in the XML describing the detector geometry.
The current system relies on conventions for the physical units used (identical as what...

189. Acceleration beyond lowest order event generation

Wettersten, Zenny (CERN)

5/8/23, 2:30 PM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

An important area of HEP studies at the LHC currently concerns the need for more extensive and precise comparison data. Important tools in this realm are event reweighting and the evaluation of more precise next-to-leading order (NLO) physics processes via Monte Carlo (MC) event generators, especially in the context of the upcoming High Luminosity LHC phase. Current event generators need to...

584. ALTO/TCN: Toward an Architecture of Efficient and Flexible Data Transport Control for Data-Intensive Sciences using Deep Infrastructure Visibility

Yang, Ryan (Choate Rosemary Hall)

5/8/23, 2:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

There is increasing demand for the efficiency and flexibility of data transport systems supporting data-intensive sciences. With growing data volume, it is essential that the transport system of a data-intensive science project fully utilize all available transport resources (e.g., network bandwidth); to achieve statistical multiplexing gain, there is an increasing trend that multiple projects...

68. Impact of the high-level trigger for detecting long-lived particles at LHCb

Prof. Vilasís-Cardona, Xavier (La Salle URL)

5/8/23, 2:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Long-lived particles (LLPs) are very challenging to search for with current detectors and computing requirements, due to their very displaced vertices. This study evaluates the ability of the trigger algorithms used in the Large Hadron Collider beauty (LHCb) experiment to detect long-lived particles and attempts to adapt them to enhance the sensitivity of this experiment to undiscovered...

439. Pion/Kaon Identification at STCF DTOF Based on Classical/Quantum Convolutional Neural Network

Li, Teng

5/8/23, 2:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The Super Tau Charm Facility (STCF) proposed in China is a new-generation electron–positron collider with center-of-mass energies covering 2-7 GeV. In STCF, the discrimination of high momentum hadrons is a challenging and critical task for various physics studies. In recent years, machine learning methods have gradually become one of the mainstream methods in the PID field of high energy...

444. ROOT’s RNTuple I/O Subsystem: The Path to Production

Blomer, Jakob (CERN)

5/8/23, 2:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The RNTuple I/O subsystem is ROOT's future event data file format and access API. It is driven by the expected data volume increase at upcoming HEP experiments, e.g. at the HL-LHC, and recent opportunities in the storage hardware and software landscape such as NVMe drives and distributed object stores. RNTuple is a redesign of the TTree binary format and API and has shown to deliver...

210. ScienceMesh: pan-European collaboration fabric for Science

Gonzalez, Hugo (CERN)

5/8/23, 2:30 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Over the last few years, Cloud Sync&Share platforms have become go-to services for collaboration in scientific, academic and research environments, providing users with coherent and simple ways to access their data assets. Collaboration within those platforms, between local users on local applications, has been demonstrated in various settings, with visible improvements in the research...

420. The AtlFast3, the ATLAS fast simulation tool in Run3

Dr Faucci Giannelli, Michele (INFN Roma Tor Vergata)

5/8/23, 2:30 PM

Track 3 - Offline Computing

Oral

Track 3+9 Crossover

AtlFast3 is the new, high-precision fast simulation in ATLAS that was deployed by the collaboration to replace AtlFastII, the fast simulation tool that was successfully used for most of Run2. AtlFast3 combines a parametrization-based Fast Calorimeter Simulation and a new machine-learning-based Fast Calorimeter Simulation based on Generative Adversarial Networks (GANs). The new fast simulation...

354. The Creation and Evolution of the US ATLAS Shared Analysis Facilities

Rind, Ofer (Brookhaven National Laboratory)

5/8/23, 2:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Prior to the start of the LHC Run 3, the US ATLAS Software and Computing operations program established three shared Tier 3 Analysis Facilities (AFs). The newest AF was established at the University of Chicago in the past year, joining the existing AFs at Brookhaven National Lab and SLAC National Accelerator Lab. In this paper, we will describe both the common and unique aspects of these three...

378. The Ligo-Virgo-KAGRA Computing Infrastructure for Gravitational-wave Research

Legger, Federica (INFN Torino)

5/8/23, 2:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The LIGO, VIRGO and KAGRA Gravitational-wave (GW) observatories are getting ready for their fourth observational period, O4, scheduled to begin in March 2023, with improved sensitivities and thus higher event rates.
GW-related computing has both large commonalities with HEP computing, particularly in the domain of offline data processing and analysis, and important differences, for example in...

213. 10 years of Zenodo – growing the world’s largest general-purpose scholarly repository

Nielsen, Lars Holm (CERN)

5/8/23, 2:45 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Zenodo has over the past 10 years grown from a proof of concept to being the world's largest general-purpose research repository, cementing CERN’s image as a pioneer and leader in Open Science. We will review key challenges faced over the past 10 years and how we overcame them, from getting off the ground, over building trust to securing funding.

Growing Zenodo was an enriching and...

285. A study case of Content Delivery Network solutions for the CMS experiment

Dengra, Carlos (CIEMAT/PIC)

5/8/23, 2:45 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

In 2029 the LHC will start the High-Luminosity LHC (HL-LHC) program, with a boost in the integrated luminosity resulting in an unprecedented amount of experimental and simulated data samples to be transferred, processed and stored in disk and tape systems across the Worldwide LHC Computing Grid (WLCG). Content delivery network (CDN) solutions are being explored with the purposes of improving...

435. Integration of RNTuple in ATLAS Athena

Mrs De Geus, Florine (University of Amsterdam)

5/8/23, 2:45 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

After using ROOT TTree for over two decades and storing more than an exabyte of compressed data, advances in technology have motivated a complete redesign, RNTuple, that breaks backward-compatibility to take better advantage of these storage options. The RNTuple I/O subsystem has been designed to address performance bottlenecks and shortcomings of ROOT's current state of the art TTree I/O...

93. Parallelization of Air Shower Simulation with IceCube

Dr Meagher, Kevin (University of Wisconsin–Madison)

5/8/23, 2:45 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope
located at the Geographic South Pole. For every observed neutrino event,
there are over 10^6 background events caused by cosmic-ray air shower
muons. In order to properly separate signal from background, it is
necessary to produce Monte Carlo simulations of these air showers.
Although to-date, IceCube has...

113. Particle identification with machine learning in ALICE Run 3

Kabus, Maja (CERN / Warsaw University of Technology)

5/8/23, 2:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The main focus of the ALICE experiment, quark-gluon plasma measurements, requires
accurate particle identification (PID). The ALICE detectors allow identifying particles over a broad momentum interval ranging from about 100 MeV/c up to 20 GeV/c.

However, hand-crafted selections and the Bayesian method do not perform well in the
regions where the particle signals overlap. Moreover, an ML...

421. Physics Performance of the ATLAS GNN4ITk Track Reconstruction Chain

Ju, Xiangyang (Lawrence Berkeley National Laboratory)

5/8/23, 2:45 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Applying graph-based techniques, and graph neural networks (GNNs) in particular, has been shown to be a promising solution to the high-occupancy track reconstruction problems posed by the upcoming HL-LHC era. Simulations of this environment present noisy, heterogeneous and ambiguous data, which previous GNN-based algorithms for ATLAS ITk track reconstruction could not handle natively. We...

287. Reconstructing jets in the Phase-2 upgrade of the CMS Level-1 Trigger with a seeded cone algorithm

Summers, Sioni (CERN)

5/8/23, 2:45 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The Phase-2 Upgrade of the CMS Level-1 Trigger will reconstruct particles using the Particle Flow algorithm, connecting information from the tracker, muon, and calorimeter detectors, and enabling fine-grained reconstruction of high level physics objects like jets. We have developed a jet reconstruction algorithm using a cone centred on an energetic seed from these Particle Flow candidates. The...

483. Simulation and Reconstruction of Photoelectric X-ray Polarimetry on LPD

Dr Yi, Difan (University of Chinese Academy of Sciences)

5/8/23, 2:45 PM

Track 3 - Offline Computing

Oral

Track 3+9 Crossover

The Large Field Low-energy X-ray Polarization Detector (LPD) is a gas photoelectric effect polarization detector designed for the detailed study of X-ray temporary sources in high-energy astrophysics. Previous studies have shown that the polarization degree of gamma ray bursts (GRBs) is generally low or unpolarized. Considering the spatial background and other interferences, We need high...

62. The U.S. CMS HL-LHC R&D Strategic Plan

Gutsche, Oliver (Fermi National Accelerator Laboratory)

5/8/23, 2:45 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The HL-LHC run is anticipated to start at the end of this decade and will pose a significant challenge for the scale of the HEP software and computing infrastructure. The mission of the U.S. CMS Software & Computing Operations Program is to develop and operate the software and computing resources necessary to process CMS data expeditiously and to enable U.S. physicists to fully participate in...

87. Toward Ten-Minute Turnaround in CMS Data Analysis: The View from Notre Dame

Lawrence, John (University of Notre Dame)

5/8/23, 2:45 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Effective analysis computing requires rapid turnaround times in order to enable frequent iteration, adjustment, and exploration, leading to discovery. An informal goal of reducing 10TB of experimental data in about ten minutes using campus-scale computing infrastructure is an achievable goal, just considering raw hardware capability. However, compared to production computing, which seeks to...

489. Calibration Data Flow and Performance at Belle II

Prim, Markus

5/8/23, 3:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The Belle II experiment has been accumulating data since 2019 at the SuperKEKB $e^+e^-$ accelerator in Tsukuba, Japan. The accelerator operates at the $\Upsilon(4S)$ resonance and is an excellent laboratory for precision flavor measurements and dark sector searches. The accumulated data are promptly reconstructed and calibrated at a dedicated calibration center in an automated process based on...

310. Celeritas: EM physics on GPUs and a path to full-featured accelerated detector simulation

Johnson, Seth (Oak Ridge National Laboratory)

5/8/23, 3:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Celeritas is a new Monte Carlo detector simulation code designed for computationally intensive applications (specifically, HL-LHC simulation) on high-performance heterogeneous architectures. In the past two years Celeritas has advanced from prototyping a simple, GPU-based, single-physics-model infinite medium to implementing a full set of electromagnetic physics processes in complex...

534. Data handling of CYGNO experiment using INFN-Cloud solution

Dr Mazzitelli, Giovanni (Laboratori Nazionali di Frascati)

5/8/23, 3:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The INFN Cloud project was launched at the beginning of 2020, aiming to build a distributed Cloud infrastructure and provide advanced services for the INFN scientific communities. A Platform as a Service (PaaS) was created inside INFN Cloud that allows the experiments to develop and access resources as a Software as a Service (SaaS), and CYGNO is the beta-tester of this system. The aim of the...

476. Flashsim: a ML based simulation for analysis datatiers

Vaselli, Francesco (Scuola Normale Superiore & INFN Pisa)

5/8/23, 3:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Analyses in HEP experiments often rely on large MC simulated datasets. These datasets are usually produced with full-simulation approaches based on Geant4 or exploiting parametric “fast” simulations introducing approximations and reducing the computational cost. With our work we created a prototype version of a new “fast” simulation that we named “flashsim” targeting analysis level data tiers...

318. Identifying and Understanding Scientific Network Flows

McKee, Shawn (University of Michigan Physics)

5/8/23, 3:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The High-Energy Physics (HEP) and Worldwide LHC Computing Grid (WLCG) communities have faced significant challenges in understanding their global network flows across the world’s research and education (R&E) networks. When critical links, such as transatlantic or transpacific connections, experience high traffic or saturation, it is very challenging to clearly identify which collaborations...

134. Improving ROOT I/O Performance for Analysis

Canal, Philippe

5/8/23, 3:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Analysis performance has a significant impact on the productivity of physicists. The vast majority of analyses use ROOT (https://root.cern). For a few years now, ROOT has offered an analysis interface called RDataFrame which helps getting the best performance for analyses, ideally making them I/O limited, i.e. with their performance limited by the throughput of reading the input data.

The...

193. R&D in ATLAS Distributed Computing towards HL-LHC

South, David

5/8/23, 3:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The computing challenges at HL-LHC require fundamental changes to the distributed computing models that have served experiments well throughout LHC. ATLAS planning for HL-LHC computing started back in 2020 with a Conceptual Design Report outlining various challenges to explore. This was followed in 2022 by a roadmap defining concrete milestones and associated effort required. Today, ATLAS is...

492. The ALICE Glance Service Work system

De Souza Pereira, Jomar Junior (Universidade Federal do Rio De Janeiro, COPPE/EE/IF, Brazil)

5/8/23, 3:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The "A Large Ion Collider Experiment" (ALICE), one of the four large experiments at the European Organization for Nuclear Research (CERN), is responsible for studying the physics of strongly interacting matter and the quark-gluon plasma.
In order to ensure the full success of ALICE operation and data taking during the Large Hadron Collider Runs 3 and 4, a list of tasks identified as Service...

629. The LHCb ultra-fast simulation option, Lamarr: design and validation

Barbetti, Matteo (INFN Firenze)

5/8/23, 3:00 PM

Track 3 - Offline Computing

Oral

Track 3+9 Crossover

Detailed detector simulation is the major consumer of CPU resources at LHCb, having used more than 80% of the total computing budget during Run 2 of the Large Hadron Collider at CERN. As data is collected by the upgraded LHCb detector during Run 3 of the LHC, larger requests for simulated data samples are necessary, and will far exceed the pledged resources of the experiment, even with...

233. The Particle Flow Algorithm in the Phase II Upgrade of the CMS Level-1 Trigger.

Summers, Sioni (CERN)

5/8/23, 3:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS experiment has greatly benefited from the utilization of the particle-flow (PF) algorithm for the offline reconstruction of the data. The Phase II upgrade of the CMS detector for the High Luminosity upgrade of the LHC (HL-LHC) includes the introduction of tracking in the Level-1 trigger, thus offering the possibility of developing a simplified PF algorithm in the Level-1 trigger. We...

149. Boosting RDataFrame performance with transparent bulk event processing

Guiraud, Enrico (EP-SFT, CERN)

5/8/23, 3:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

RDataFrame is ROOT's high-level interface for Python and C++ data analysis. Since it first became available, RDataFrame adoption has grown steadily and it is now poised to be a major component of analysis software pipelines for LHC Run 3 and beyond. Thanks to its design inspired by declarative programming principles, RDataFrame enables the development of high-performance, highly parallel...

552. EIC Software Overview

Lawrence, David (Jefferson Lab)

5/8/23, 3:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Development of the EIC project detector "ePIC" is now well underway and this includes the "single software stack" used for simulation and reconstruction. The stack combines several non-experiment-specific packages including ACTS, DD4hep, JANA2, and PODIO. The software stack aims to be forward looking in the era of AI/ML and heterogeneous hardware. A formal decision making process was...

424. FastCaloGAN: a fast simulation of the ATLAS Calorimeter with GANs

Giannelli, Michele Faucci (INFN Roma Tor Vergata and Universita' di Roma Tor Vergata)

5/8/23, 3:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

AtlFast3 is the new ATLAS fast simulation that exploits a wide range of ML techniques to achieve high-precision fast simulation. The latest version of the AtlFast3 used in Run3 deploys FastCaloGANV2 which consists of 500 Generative Adversarial Networks used to simulate the showers of all particles in the ATLAS calorimeter system. The Muon Punch Through tool has also been completely rewritten...

415. Future Data-Intensive Experiment Computing Models: Lessons learned from the recent evolution of the ATLAS Computing Model

Klimentov, Alexei (Brookhaven National Laboratory)

5/8/23, 3:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

In this talk, we discuss the evolution of the computing model of the ATLAS experiment at the LHC. After LHC Run 1, it became obvious that the available computing resources at the WLCG were fully used. The processing queue could reach millions of jobs during peak loads, for example before major scientific conferences and during large scale data processing. The unprecedented performance of the...

538. Generative Models for Fast Simulation of Electromagnetic and Hadronic Showers in Highly Granular Calorimeters

Mr Gaede, Frank (DESY)

5/8/23, 3:15 PM

Poster

Oral

Track 3+9 Crossover

Modern high energy physics experiments fundamentally rely on accurate simulation- both to characterise detectors and to connect observed signals to underlying theory. Traditional simulation tools are reliant upon Monte Carlo methods which, while powerful, consume significant computational resources. These computing pressures are projected to become a major bottleneck at the high luminosity...

118. I/O performance studies of analysis workloads on production and dedicated resources at CERN

Sciabà, Andrea (CERN)

5/8/23, 3:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The recent evolutions of the analysis frameworks and physics data formats of the LHC experiments provide the opportunity of using central analysis facilities with a strong focus on interactivity and short turnaround times, to complement the more common distributed analysis on the Grid. In order to plan for such facilities, it is essential to know in detail the performance of the combination of...

373. Managing multi-instrument data streams in secure environments

Dr Sauti, Godfrey (NASA)

5/8/23, 3:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The capture and curation of all primary instrument data is a potentially valuable source of added insight into experiments or diagnostics in laboratory experiments. The data can, when properly curated, enable analysis beyond the current practice that uses just a subset of the as-measured data. Complete curated data can also be input for machine learning and other data exploration tools....

619. Online tagging and triggering with deep learning AI for next generation particle imaging detector

Bhattacharya, Meghna (Fermilab)

5/8/23, 3:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The current and future programs for accelerator-based neutrino imaging detectors feature the use of Liquid Argon Time Projection Chambers (LArTPC) as the fundamental detection technology. These detectors combine high-resolution imaging and precision calorimetry to enable the study of neutrino interactions with unparalleled capabilities. However, the volume of data from LArTPCs will exceed 25...

543. Software Citation in HEP: Current State and Recommendations for the Future

Feickert, Matthew (University of Wisconsin-Madison)

5/8/23, 3:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

In November 2022, the HEP Software Foundation (HSF) and the Institute for Research and Innovation for Software in High-Energy Physics (IRIS-HEP) organized a workshop on the topic of “Software Citation and Recognition in HEP”. The goal of the workshop was to bring together different types of stakeholders whose roles relate to software citation and the associated credit it provides, in order to...

645. Radically different futures for HEP enabled by AI/ML

Cranmer, Kyle (University of Wisconsin-Madison)

5/8/23, 4:00 PM

Plenary

Plenary Session

Instead of focusing on the concrete challenges of incremental changes to HEP driven by AI/ML, it is perhaps a useful exercise to think through more radical, speculative changes. What might be enabled if we embraced a dramatically different approach? What would we lose? How would those changes impact the computational, organizational, and epistemological nature of the field?

646. AI and Beyond: New Techniques for Simulation and Design in HEP

Pedro, Kevin (Fermilab)

5/8/23, 4:30 PM

Plenary

Plenary Session

Simulation is a critical component of high energy physics research, with a corresponding computing footprint. Generative AI has emerged as a promising complement to intensive full simulation with relatively high accuracy compared to existing classical fast simulation alternatives. Such algorithms are naturally suited to acceleration on coprocessors, potentially running fast enough to match the...

516. The ESCAPE Dark Matter Science Project in the European Science Cloud

Little, Jared (LAPP/CNRS)

5/8/23, 5:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Plenary

Plenary Session

A [Dark Matter Science Project][1] is being developed in the context of the [ESCAPE project][2] as a collaboration between scientists in European Research Infrastructures and experiments seeking to explain the nature of dark matter (such as HL-LHC, KM3NeT, CTA, DarkSide).
The goal of this ESCAPE Science Project is to highlight the synergies between different dark matter communities and...

15. The ESCAPE Collaboration - long term perspective

Lamanna, Giovanni (CNRS-LAPP)

5/8/23, 5:30 PM

Track 10 - Exascale Science

Plenary

Plenary Session

The EU-funded ESCAPE project has brought together the ESFRI and other world class Research Infrastructures in High Energy and Nuclear Physics, Astro-Particle Physics, and Astronomy. In the 3 years of the project many synergistic and collaborative aspects have been highlighted and explored, from pure technical collaboration on common solutions for data management, AAI, and workflows, through...

101. The Virtual Research Environment: towards a comprehensive analysis platform

Gazzarrini, Elena (CERN)

5/9/23, 9:00 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Plenary

Plenary Session

One of the objectives of the EOSC (European Open Science Cloud) Future Project is to integrate diverse analysis workflows from Cosmology, Astrophysics and High Energy Physics in a common framework. The project’s development relies on the implementation of the Virtual Research Environment (VRE), a prototype platform supporting the goals of Dark Matter and Extreme Universe Science Projects in...

647. Evolution of ESnet - A Changing Landscape in Scientific Networking

Guok, Chin (ESnet)

5/9/23, 9:30 AM

Plenary Session

The Energy Sciences Network (ESnet) is the high performance network of the US Department of Energy Office of Science. Over its 36-year span, ESnet has evolved to meet the requirements of ever changing scientific workflows. This presentation will provide a brief history of ESnet's generational changes and highlight the capabilities of its current generation network ESnet6. This presentation...

379. Coffea-Casa: Building composable analysis facilities for the HL-LHC

Shadura, Oksana (University Nebraska-Lincoln (US))

5/9/23, 10:00 AM

Track 7 - Facilities and Virtualization

Plenary

Plenary Session

The large data volumes expected from the High Luminosity LHC (HL-LHC) present challenges to existing paradigms and facilities for end-user data analysis. Modern cyberinfrastructure tools provide a diverse set of services that can be composed into a system that provides physicists with powerful tools that give them straightforward access to large computing resources, with low barriers to entry....

138. A deep-learning reconstruction algorithm for cluster counting

Dr Wu, Linghui (Institute of High Energy Physics)

5/9/23, 11:00 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Future e+e- colliders are crucial to extend the search for new phenomena possibly related to the open questions that the Standard Model presently does not explain. Among the major physics programs, the flavor physics program requires particle identification (PID) performances well beyond that of most detectors designed for the current generation. The cluster counting, which measures the number...

558. Algorithms: Framework- and Experiment-independent algorithms at EPIC

Joosten, Sylvester (Argonne National Laboratory)

5/9/23, 11:00 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The EPIC collaboration at the Electron-Ion Collider recently laid the groundwork for its software infrastructure. Large parts of the software ecosystem for EPIC mirror the setup from the Key4hep project, for example DD4hep for geometry description, and EDM4hep/PODIO for the data model. However, other parts of the EPIC software ecosystem diverge from Key4hep, for example for the event...

150. Binning high-dimensional classifier output for HEP analyses through a clustering algorithm

Eich, Niclas (RWTH Aachen University)

5/9/23, 11:00 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The usage of Deep Neural Networks (DNNs) as multi-classifiers is widespread in modern HEP analyses. In standard categorisation methods, the high-dimensional output of the DNN is often reduced to a one-dimensional distribution by exclusively passing the information about the highest class score to the statistical inference method. Correlations to other classes are hereby omitted.
Moreover, in...

556. Finalizing Transition to the New Data Center at BNL

Mr Zaytsev, Alexandr (Brookhaven National Laboratory (BNL))

5/9/23, 11:00 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Computational science, data management and analysis have been key factors in the success of Brookhaven National Laboratory's scientific programs at the Relativistic Heavy Ion Collider (RHIC), the National Synchrotron Light Source (NSLS-II), the Center for Functional Nanomaterials (CFN), and in biological, atmospheric, and energy systems science, Lattice Quantum Chromodynamics (LQCD) and...

514. Large scale dynamic web deployment for CERN, experience and outlook

Rajula, Vineet Reddy

5/9/23, 11:00 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

CERN hosts more than 1200 websites essential for the mission of the Organization, internal and external collaboration and communicaiton as well as public outreach. The complexity and scale of CERN’s online presence is very diverse with some websites, like https://home.cern/

, accommodating more than one million unique visitors in a day.

However, regardless of their diversity, all...

263. Results from HEP-CCE

Leggett, Charles (Lawrence Berkeley National Laboratory)

5/9/23, 11:00 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

High energy physics is facing serious challenges in the coming decades due to the projected shortfall of CPU and storage resources compared to our anticipated budgets. In the past, HEP has not made extensive use of HPCs, however the U.S. has had a long term investment in HPCs and it is the platform of choice for many simulation workloads, and more recently, data processing for projects such as...

295. The Run-3 ATLAS jet trigger

Amerl, Maximilian

5/9/23, 11:00 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The ATLAS jet trigger is instrumental in selecting events both for Standard Model measurements and Beyond the Standard Model physics searches. Non-standard triggering strategies, such as saving only a small fraction of trigger objects for each event, avoids bandwidth limitations and increases sensitivity to low-mass and low-momentum objects. These events are used by Trigger Level Analyses,...

578. The SMARTHEP European Training Network

Gooding, Jamie (Dortmund)

5/9/23, 11:00 AM

Poster

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Synergies between MAchine learning, Real-Time analysis and Hybrid architectures for efficient Event Processing and decision making (SMARTHEP) is a European Training Network with the aim of training a new generation of Early Stage Researchers to advance real-time decision-making, effectively leading to data-collection and analysis becoming synonymous.

SMARTHEP...

527. Tools and Services for Collaborations: On-Boarding to the OSG Fabric of Services

Paschos, Pascal (University of Chicago)

5/9/23, 11:00 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

We present a collection of tools and processes that facilitate onboarding a new science collaboration onto the OSG Fabric of Services. Such collaborations typically rely on computational workflows for simulations and analysis that are ideal for executing on OSG's distributed High Throughput Computing environment (dHTC). The produced output can be accumulated and aggregated at available...

181. Xrootd S3 Gateway for WLCG Storage

Yang, Wei (SLAC National Accelerator Laboratory)

5/9/23, 11:00 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The Xrootd S3 Gateway is a universal high performance proxy service that can be used to access S3 portals using existing HEP credentials (e.g. JSON Web Tokens and x509). This eliminates one of the biggest roadblocks to using public cloud storage resources. This paper describes how the S3 Gateway leverages existing HEP software (e.g. Davix and XRootD) to provide a familiar scalable service that...

337. Deep generative models for generating Drell-Yan events in the ATLAS collaboration at the LHC

Ju, Xiangyang (Lawrence Berkeley National Laboratory)

5/9/23, 11:15 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The search for the dimuon decay of the Standard Model (SM) Higgs boson looks for a tiny peak on top of a smoothly falling SM background in the dimuon invariant mass spectrum 𝑚(𝜇𝜇). Due to the very small signal-to-background ratio, which is at the level of 0.2% in the region 𝑚(𝜇𝜇) = 120–130 GeV for an inclusive selection, an accurate determination of the background is of paramount importance....

396. Enabling INFN-T1 to support heterogeneous computing architectures

Dr Dal Pra, Stefano (INFN-CNAF)

5/9/23, 11:15 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The INFN-CNAF Tier-1 located in Bologna (Italy) is a center of the WLCG e-Infrastructure providing computing power to the four major LHC collaborations and also supports the computing needs of about fifty more groups - also from non HEP research domains. The CNAF Tier1 center has been historically very active putting effort in the integration of computing resources, proposing and prototyping...

219. LHCb Online Monitoring

Matev, Rosen

5/9/23, 11:15 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The LHCb experiment started taking data with an upgraded detector in the Run 3 of the LHC, reading at 30 MHz full detector data with a software-only trigger system. In this context, live data monitoring is crucial to ensure that the quality of the recorded data is optimal. Data from the experiment control system, as well as raw detector data and the output of the software trigger, is used as...

545. Managing the OSG Fabric of Services the GitOps Way

Bockelman, Brian (Morgridge Institute for Research)

5/9/23, 11:15 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

There is no lack of approaches for managing the deployment of distributed services – in the last 15 years of running distributed infrastructure, the OSG Consortium has seen many of them. One persistent problem has been each physical site has its style of configuration management and service operations, leading to a partitioning of the staff knowledge and inflexibility in migrating services...

505. Measuring Carbon: Net-Zero and the IRISCAST Project

Owen, Alex (Queen Mary University of London)

5/9/23, 11:15 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Moving towards Net-Zero requires robust information to enable good decision making at all levels: covering hardware procurement, workload management and operations, as well as higher level aspects encompassing grant funding processes and policy framework development.

The IRISCAST project is a proof-of-concept study funded as part of the UKRI Net-Zero Scoping Project. We have performed an...

140. Modern Software Development for JUNO offline software

Dr Lin, Tao (IHEP)

5/9/23, 11:15 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The Jiangmen Underground Neutrino Observatory (JUNO), under construction in South China, primarily aims to determine the neutrino mass hierarchy and the precise measure oscillation parameters. The data-taking is expected to start in 2024 and plans to run for more than 20 years. The development of JUNO offline software (JUNOSW) started in 2012, and it is quite challenging to maintain the JUNOSW...

244. Performance of track reconstruction at STCF using ACTS

Li, Teng (Shandong University)

5/9/23, 11:15 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The reconstruction of charged particles’ trajectories is one of the most complex and CPU-consuming event processing chains in high energy physics (HEP) experiments. Meanwhile, the precision of track reconstruction has direct and significant impact on vertex reconstruction, physics flavour tagging and particle identfication, and eventually on physics precision, in particular for HEP experiments...

32. Predicting Resource Usage Trends with Southern California Petabyte Scale Cache

Sim, Alex (Lawrence Berkeley National Laboratory)

5/9/23, 11:15 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

There has been a significant increase in data volume from various large scientific projects, including the Large Hadron Collider (LHC) experiment. The High Energy Physics (HEP) community requires increased data volume on the network, as the community expects to produce almost thirty times annual data volume between 2018 and 2028 [1]. To mitigate the repetitive data access issue and network...

309. Scalable, End-to-End, Machine-Learning-Based Data Reconstruction Chain for Particle Imaging Detectors

Drielsma, Francois (SLAC)

5/9/23, 11:15 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Recent inroads in Computer Vision (CV), enabled by Machine Learning (ML), have motivated a new approach to the analysis of particle imaging detector data. Unlike previous efforts which tackled isolated CV tasks, this paper introduces an end-to-end, ML-based data reconstruction chain for Liquid Argon Time Projection Chambers (LArTPCs), the state-of-the-art in precision imaging at the intensity...

275. ScienceBox 2.0: Evolving the demonstrator package for CERN storage and analysis services

Bocchi, Enrico (CERN)

5/9/23, 11:15 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

In this contribution we describe the 2022 reboot of the ScienceBox project, the containerised SWAN/CERNBox/EOS demonstrator package for CERN storage and analysis services. We evolved the original implementation to make use of Helm charts across the entire dependency stack. Charts have become the de-facto standard for application distribution and deployment in managed clusters (e.g.,...

391. CernVM-FS at Extreme Scales

Promberger, Laura (CERN)

5/9/23, 11:30 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The CernVM File System (CVMFS) provides the software distribution backbone for High Energy and Nuclear Physics experiments and many other scientific communities in the form of a globally available shared software area. It has been designed for the software distribution problem of experiment software for LHC Runs 1 and 2. For LHC Run 3 and even more so for HL-LHC (Runs 4-6), the complexity of...

89. First results from the PUNCH4NFDI Consortium

Schneide, Christiane (DESY)

5/9/23, 11:30 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

In the frame of the German NFDI (National Research Data Infrastructure), by now 27 consortia across all domains of science have been setup in order to enhance the FAIR usage and re-usage of scientific data. The consortium PUNCH4NFDI, composed of the German particle, astroparticle, hadron&nuclear, and astrophysics communities, has been approved for initially 5 years of significant...

501. First year of experience with the new operational monitoring tool for data taking in CMS during Run 3

Mr Petrucci, Andrea (University of California, San Diego, San Diego, California, USA)

5/9/23, 11:30 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS Online Monitoring System (OMS) aggregates and integrates different sources of information into a central place and allows users to view, compare and correlate information. It displays real-time and historical information.
The tool is heavily used by run coordinators, trigger experts and shift crews, to achieve optimal trigger and efficient data taking. It provides aggregated...

510. Flexible, robust and minimal-overhead Event Data Model for track reconstruction in ACTS

Dr Gessinger, Paul (CERN)

5/9/23, 11:30 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

ACTS is an experiment independent toolkit for track reconstruction, which is designed from the ground up for thread-safety and high performance. It is built to accommodate different experiment deployment scenarios, and also serves as community platform for research and development of new approaches and algorithms.

The Event Data Model (EDM) is a critical piece of the tracking library that...

425. HyperTrack: neural combinatorics for high energy physics

Mieskolainen, Mikael (Imperial College London)

5/9/23, 11:30 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

I will introduce a new neural algorithm -- HyperTrack, designed for exponentially demanding combinatorial inverse problems of high energy physics final state reconstruction and high-level analysis at the LHC and beyond. Many of these problems can be formulated as clustering on a graph resulting in a hypergraph. The algorithm is based on a machine learned geometric-dynamical input graph...

333. Operational Intelligence in the ATLAS Continuous Integration System

Undrus, Alexander (BNL)

5/9/23, 11:30 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The ATLAS Continuous Integration (CI) System is the major component of the ATLAS software development infrastructure, synchronizing efforts of several hundred software developers working around the world and around the clock. Powered by 700 fast processors, it is based on the ATLAS GitLab code management service and Jenkins CI server and performs daily up to 100 ATLAS software builds probing...

436. Storing LHC Data in Amazon S3 and Intel DAOS through RNTuple

Ms Lazzari Miotto, Giovanna (UFRGS (BR))

5/9/23, 11:30 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Current and future distributed HENP data analysis infrastructures rely increasingly on object stores in addition to regular remote file systems. Such file-less storage systems are popular as a means to escape the inherent scalability limits of the POSIX file system API. Cloud storage is already dominated by S3-like object stores, and HPC sites are starting to take advantage of object stores...

411. The IT infrastructure of the LUX-ZEPLIN experiment at SURF

Dr Luitz, Steffen (SLAC National Accelerator Laboratory)

5/9/23, 11:30 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

LUX-ZEPLIN (LZ) is a direct detection dark matter experiment currently operating at the Sanford Underground Research Facility (SURF) in Lead, South Dakota. The core component is a liquid xenon time projection chamber with an active mass of 7 tonnes.
To meet the performance, availability, and security requirements for the LZ DAQ, Online, Slow Control and data transfer systems located at SURF,...

414. Unbiased detection of data departures from expectations with machine learning

Grosso, Gaia (Universita e INFN, Padova (IT))

5/9/23, 11:30 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

We present New Physics Learning Machine (NPLM), a machine learning-based strategy to detect data departures from a Reference model, with no prior bias on the source of discrepancy. The main idea behind the method is to approximate the optimal log-likelihood-ratio hypothesis test parametrising the data distribution with a universal approximating function, and solving its maximum-likelihood fit...

326. Using parallel I/O libraries for managing HEP experimental data

Bashyal, Amit (Argonne National Lab)

5/9/23, 11:30 AM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The computing and storage requirements of the energy and intensity frontiers will grow significantly during the Run 4 & 5 and the HL-LHC era. Similarly, in the intensity frontier, with larger trigger readouts during supernovae explosions, the Deep Underground Neutrino Experiment (DUNE) will have unique computing challenges that could be addressed by the use of parallel and accelerated...

108. AUDITOR: Accounting for opportunistic resources

Dr Boehler, Michael (Freiburg University)

5/9/23, 11:45 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The increasing computational demand in High Energy Physics (HEP) as well as increasing concerns about energy efficiency in high performance/throughput computing are driving forces in the search for more efficient ways to utilize available resources. Since avoiding idle resources is key in achieving high efficiency, an appropriate measure is sharing of idle resources of under-utilized sites...

495. Computer Vision for Data Quality Monitoring with Hydra

Jeske, Torri (JLAB)

5/9/23, 11:45 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Hydra is a system which utilizes computer vision to monitor data quality in near real time. Currently, it is deployed in all of Jefferson Lab’s experimental halls and lightens the load on shift takers by autonomously monitoring diagnostic plots in near real time. Hydra is constructed from “off-the-shelf” technologies and is backed up by a full MySQL database. To aid with both labeling and...

442. Data driven background estimation in HEP using Generative Adversarial Networks

Lohezic, Victor (Irfu, CEA Saclay - Université Paris-Saclay)

5/9/23, 11:45 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Data-driven methods are widely used to overcome shortcomings of Monte Carlo (MC) simulations (lack of statistics, mismodeling of processes, etc.) in experimental High Energy Physics. A precise description of background processes is crucial to reach the optimal sensitivity for a measurement. However, the selection of the control region used to describe the background process in a region of...

485. EOSC-CZ plans for physical sciences

Chudoba, Jiri

5/9/23, 11:45 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Planned EOSC-CZ projects will significantly improve data management in many scientific fields in the Czech Republic. Several calls for projects are under preparation according to the implementation architecture document created in 2021. Emerging National data infrastructure will build basic infrastructure with significant storage capacity for long term archive of scientific data and their...

358. Estimating the environmental impact of a large Tier 2

Whitehouse, Dan (Imperial College)

5/9/23, 11:45 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Recent years have seen an increasing interest in the environmental impact, especially the carbon footprint, generated by the often large scale computing facilities used by the communities represented at CHEP. As this is a fairly new requirement, this information is not always readily available, especially at universities and similar institutions which do not necessarily see large scale...

164. Fast, high-quality pseudo random number generators for heterogeneous computing

Barbone, Marco (Imperial College London)

5/9/23, 11:45 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Random number generation is key to many applications in a wide variety of disciplines. Depending on the application, the quality of the random numbers from a particular generator can directly impact both computational performance and critically the outcome of the calculation.

High-energy physics applications use Monte Carlo simulations and machine learning widely, which both require...

463. Implicit Neural Representation as a Differentiable Surrogate for Photon Propagation in a Monolithic Neutrino Detector

Tsang, Patrick (SLAC)

5/9/23, 11:45 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Modern neutrino experiments employ hundreds to tens of thousands of photon detectors to detect scintillation photons produced from the energy deposition of charged particles. A traditional approach of modeling individual photon propagation as a look-up table requires high computational resources, and therefore it is not scalable for future experiments with multi-kiloton target volume.

We...

341. Managing software build infrastructure at ALICE using Hashicorp Nomad

Wilken, Timo (CERN)

5/9/23, 11:45 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The ALICE experiment at CERN uses a cluster consisting of virtual and bare-metal machines to build and test proposed changes to the ALICE Online-Offline (O²) software in addition to building and publishing regular software releases.

Nomad is a free and open-source job scheduler for containerised and non-containerised applications developed by Hashicorp. It is integrated into an...

484. Multithreading ATLAS offline software: a retrospective

Snyder, Scott (Brookhaven National Laboratory)

5/9/23, 11:45 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

For Run 3, ATLAS redesigned its offline software, Athena, so that the
main workflows run completely multithreaded. The resulting substantial
reduction in the overall memory requirements allows for better use
of machines with many cores. This talk will discuss the performance
achieved by the multithreaded reconstruction as well as the process
of migrating the large ATLAS code base and...

50. Understanding Data Access Patterns for dCache System

Wu, Kesheng (Lawrence Berkeley National Laboratory)

5/9/23, 11:45 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

At Brookhaven National Lab, the dCache storage management system is used as a disk cache for large high-energy physics (HEP) datasets primarily from the ATLAS experiment[1]. Storage space on dCache is considerably smaller than the full ATLAS data collection. Therefore, a policy is needed to determine what data files to keep in the cache and what files to evict. A good policy is to keep...

611. A Ceph S3 Object Data Store for HEP

Smith, Nick (Fermilab)

5/9/23, 12:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

In this talk, we present a novel data format design that obviates the need for data tiers by storing individual event data products in column objects. The objects are stored and retrieved through Ceph S3 technology, and a companion metadata system handles tracking of the object lifecycle. Performance benchmarks of data storage and retrieval will be presented, along with scaling tests of the...

458. An Active Learning application in a dark matter search with ATLAS PanDA and iDDS

Weber, Christian (Brookhaven National Laboratory)

5/9/23, 12:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Many theories of Beyond Standard Model (BSM) physics feature multiple BSM particles. Generally, these theories live in higher dimensional phase spaces that are spanned by multiple independent BSM parameters such as BSM particle masses, widths, and coupling constants. Fully probing these phase spaces to extract comprehensive exclusion regions in the high dimensional space is challenging....

564. Facilitating the preservation of LHCb analyses

Couturier, Benjamin (CERN)

5/9/23, 12:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

High Energy Physics experiments at the Large Hadron Collider generate petabytes of data that go though multiple transformation before final analysis and paper publication. Recording the provenance of these data is therefore crucial to maintain the quality of the final results. While the tools are in place within LHCb to keep this information for the common experiment-wide transforms, analysts...

320. JIRIAF: JLAB Integrated Research Infrastructure Across Facilities

Gyurjyan, Vardan (Jefferson Lab)

5/9/23, 12:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The JIRIAF project aims to combine geographically diverse computing facilities into an integrated science infrastructure. This project starts by dynamically evaluating temporarily unallocated or idled compute resources from multiple providers. These resources are integrated to handle additional workloads without affecting local running jobs. This paper describes our approach to launch...

554. Machine Learning Tools for the CMS Tracker Data Quality Monitoring and Certification

Benelli, Gabriele (Brown University)

5/9/23, 12:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

During the long shutdown between LHC Run 2 and 3, a reprocessing of 2017 and 2018 CMS data with higher granularity data quality monitoring (DQM) harvesting was done. The time granularity of DQM histograms in this dataset is increased by 3 orders of magnitude. In anticipation of deploying this higher granularity DQM harvesting in the ongoing Run 3 data taking, this dataset is used to study the...

392. Migrating the INFN-CNAF datacenter to the Bologna Tecnopolo - a status update

Dr Pellegrino, Carmelo (INFN-CNAF)

5/9/23, 12:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The INFN Tier1 data center is currently located in the premises of the Physics Department of the University of Bologna, where CNAF is also located. During 2023 it will be moved to the “Tecnopolo”, the new facility for research, innovation, and technological development in the same city area; the same location is also hosting Leonardo, the pre-exascale supercomputing machine managed by CINECA,...

41. Neutrino interaction vertex-finding in a DUNE far-detector using Pandora deep-learning

Chappell, Andrew (University of Warwick)

5/9/23, 12:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The Deep Underground Neutrino Experiment (DUNE) will operate four large-scale Liquid-Argon Time-Projection Chambers (LArTPCs) at the far site in South Dakota, producing high-resolution images of neutrino interactions.

LArTPCs represent a step-change in neutrino interaction imaging and the resultant images can be highly detailed and complex. Extracting the maximum value from LArTPC hardware...

73. The ALICE Data Quality Control

Konopka, Piotr (CERN)

5/9/23, 12:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

ALICE (A Large Ion Collider Experiment) has undertaken a major upgrade during the Long Shutdown 2. The increase in the detector data rates, and in particular the continuous readout of the TPC, led to a hundredfold increase in the input raw data, up to 3.5 TB/s. To cope with it, a new common Online and Offline computing system, called O2, has been developed and put in production.

The online...

227. Version control and DevOps platform for accelerator and experiments: experience and outlook

Posada Trobo, Ismael (CERN)

5/9/23, 12:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

GitLab has been running at CERN since 2012. It is a self-service code hosting application based on Git that provides collaboration and code review features, becoming one of the key infrastructures at CERN. It is being widely used at CERN, with more than 17 000 active users, hosting more than 120 000 projects and triggering more than 5 000 jobs per hour.

On its initial stage, a custom-made...

630. XkitS：A computational storage framework for high energy physics based on EOS storage system

CHENG, Yaodong (IHEP, CAS)

5/9/23, 12:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Large-scale high-energy physics experiments generate petabytes or even exabytes of scientific data, and high-performance data IO is required during their processing. However, computing and storage devices are often separated in large computing centers, and large-scale data transmission has become a bottleneck for some data-intensive computing tasks, such as data encoding and decoding,...

346. AI Driven Experiment Calibration and Control

Britton, Thomas (JLab)

5/9/23, 12:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

One critical step on the path from data taking to physics analysis is calibration. For many experiments this step is both time consuming and computationally expensive. The AI Experimental Calibration and Control project seeks to address these issues, starting first with the GlueX Central Drift Chamber (CDC). We demonstrate the ability of a Gaussian Process to estimate the gain correction...

180. Data Centre Refurbishment with the aim of Energy Saving and Achieving Carbon Net Zero

Traynor, Daniel (Queen Mary University of London)

5/9/23, 12:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Queen Mary University of London (QMUL) as part of the refurbishment of one of its's data centres has installed water to water heat pumps to use the heat produced by the computing servers to provide heat for the university via a district heating system. This will enable us to reduce the use of high carbon intensity natural gas heating boilers, replacing them with electricity which has a lower...

587. Deep Learning for Matrix Element Method

Neubauer, Mark (University of Illinois)

5/9/23, 12:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The matrix element method (MEM) is a powerful technique that can be used for the analysis of particle collider data utilizing an ab initio calculation of the approximate probability density function for a collision event to be due to a physics process of interest. The most serious difficulty with the ME method, which has limited its applicability to searches for beyond-the-SM physics and...

31. Evaluation of ARM CPUs for IceCube available through Google Kubernetes Engine

Sfiligoi, Igor

5/9/23, 12:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The IceCube experiment has substantial simulation needs and is in continuous search for the most cost-effective ways to satisfy them. The most CPU-intensive part relies on CORSIKA, a cosmic ray air shower simulation. Historically, IceCube relied exclusively on x86-based CPUs, like Intel Xeon and AMD EPYC, but recently server-class ARM-based CPUs are also becoming available, both on-prem and in...

192. Extending Rucio with modern cloud storage support: Experiences from ATLAS, SKA and ESCAPE

Lassnig, Mario (CERN)

5/9/23, 12:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Rucio is a software framework that provides scientific collaborations with the ability to organise, manage and access large volumes of data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centres, uniting different storage and network technologies as a single federated entity. Rucio offers advanced features such as...

460. Graph Neural Network for 3D Reconstruction in Liquid Argon Time Projection Chambers

Hewes, V (University of Cincinnati)

5/9/23, 12:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The Exa.TrkX team has developed a Graph Neural Network (GNN) for reconstruction of liquid argon time projection chamber (LArTPC) data. We discuss the network architecture, a multi-head attention message passing network that classifies detector hits according to the particle type that produced them. By utilizing a heterogeneous graph structure with independent subgraphs for each 2D plane’s hits...

597. Scalable training on scalable infrastructures for programmable hardware

Lorusso, Marco (University of Bologna)

5/9/23, 12:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The increasingly pervasive and dominant role of machine learning (ML) and deep learning (DL) techniques in High Energy Physics is posing challenging requirements to effective computing infrastructures on which AI workflows are executed, as well as demanding requests in terms of training and upskilling new users and/or future developers of such technologies.

In particular, a growth in the...

198. The Platform-as-a-Service paradigm meets ATLAS: developing an automated analysis workflow on the newly established INFN Cloud

Marcon, Caterina (INFN Milan (IT))

5/9/23, 12:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Worldwide LHC Computing Grid (WLCG) is a large-scale collaboration which gathers the computing resources of around 170 computing centres from more than 40 countries. The grid paradigm, unique to the realm of high energy physics, has successfully supported a broad variety of scientific achievements. To fulfil the requirements of new applications and to improve the long-term sustainability...

35. A Kinematic Kalman Filter Track Reconstruction Algorithm for the Mu2e Experiment

Brown, David (LBL)

5/9/23, 2:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The primary physics goal of the Mu2e experiment requires reconstructing an isolated 105 MeV electron with better than 500 KeV/c momentum resolution. Mu2e uses a low-mass straw tube tracker, and a CsI crystal calorimeter, to reconstruct tracks.
In this paper, we present the design and performance of a track reconstruction algorithm optimized for Mu2e’s unusual requirements. The algorithm is...

201. An HTTP REST API for Tape-backed Storage

Davis, Michael (CERN), Mr Afonso, Joao (CERN)

5/9/23, 2:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The goal of the “HTTP REST API for Tape” project is to provide a simple, minimalistic and uniform interface to manage data transfers between Storage Endpoints (SEs) where the source file is on tape. The project is a collaboration between the developers of WLCG storage systems (EOS+CTA, dCache, StoRM) and data transfer clients (gfal2, FTS). For some years, HTTP has been growing in popularity as...

126. CMS Tier0 data processing during the detector commissioning in Run-3

Amado, Jhonatan

5/9/23, 2:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The CMS Tier-0 service is responsible for the prompt processing and distribution of the data collected by the CMS Experiment. A number of upgrades were implemented during the long shutdown of the Large Hadron Collider, which improved the performance and reliability of the service. We report our experience of the data taking during Run-3 detector commissioning as well as performance of the...

600. End-to-End Geometric Representation Learning for Track Reconstruction

Calafiura, Paolo (LBNL)

5/9/23, 2:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Significant progress has been made in applying graph neural networks (GNNs) and other geometric ML ideas to the track reconstruction problem. State-of-the-art results are obtained using approaches such as the Exatrkx pipeline, which currently applies separate edge construction, classification and segmentation stages. One can also treat the problem as an object condensation task, and cluster...

120. HEPscore: a new benchmark for WLCG compute resources

Dr Sobie, Randall (University of Victoria)

5/9/23, 2:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

HEPscore is a CPU benchmark, based on HEP applications, that the HEPiX Working Group is proposing as a replacement of the HEPSpec06 benchmark (HS06), which is currently used by the WLCG for procurement, computing resource requests and pledges, accounting and performance studies. At the CHEP 2019 conference, we presented the reasons for building a benchmark for the HEP community that is based...

332. Iterative and incremental development of the ATLAS Publication Tracking system

Cruz, Ana Clara Loureiro (Universidade Federal do Rio De Janeiro)

5/9/23, 2:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The ATLAS experiment involves almost 6000 members from approximately 300 institutes spread all over the globe and more than 100 papers published every year. This dynamic environment brings some challenges such as how to ensure publication deadlines, communication between the groups involved, and the continuity of workflows. The solution found for those challenges was automation, which was...

289. Line Segment Tracking in the High-luminosity LHC

Chang, Philip (University of Florida)

5/9/23, 2:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The Large Hadron Collider (LHC) will be upgraded to High-luminosity LHC, increasing the number of simultaneous proton-proton collisions (pile-up, PU) by several-folds. The harsher PU conditions lead to exponentially increasing combinatorics in charged-particle tracking, placing a large demand on the computing resources. The projection on required computing resources exceeds the computing...

128. MicroBooNE Public Data Sets: a Collaborative Tool for LArTPC Software Development

Cerati, Giuseppe (Fermilab)

5/9/23, 2:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Among liquid argon time projection chamber (LArTPC) experiments MicroBooNE is the one that continually took physics data for the longest time (2015-2021), and represents the state of the art for reconstruction and analysis with this detector. Recently published analyses include oscillation physics results, searches for anomalies and other BSM signatures, and cross section measurements. LArTPC...

63. Running GPU-enabled CMSSW workflows through the production system

Koraka, Charis Kleio (University of Wisconsin Madison (US))

5/9/23, 2:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The CMS experiment at CERN accelerates several stages of its online reconstruction by making use of GPU resources at its High Level Trigger (HLT) farm for LHC Run 3. Additionally, during the past years, computing resources available to the experiment for performing offline reconstruction, such as Tier-1 and Tier-2 sites, have also started to integrate accelerators into their systems. In order...

69. The CMS monitoring applications for LHC Run 3

Legger, Federica (INFN Torino)

5/9/23, 2:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Data taking at the Large Hadron Collider (LHC) at CERN restarted in 2022. The CMS experiment relies on a distributed computing infrastructure based on WLCG (Worldwide LHC Computing Grid) to support the LHC Run 3 physics program. The CMS computing infrastructure is highly heterogeneous and relies on a set of centrally provided services, such as distributed workload management and data...

621. An Object Condensation Pipeline for Charged Particle Tracking

Lieret, Kilian (Princeton University)

5/9/23, 2:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Recent work has demonstrated that graph neural networks (GNNs) trained for charged particle tracking can match the performance of traditional algorithms while improving scalability. Most approaches are based on the edge classification paradigm, wherein tracker hits are connected by edges, and a GNN is trained to prune edges, resulting in a collection of connected components representing...

183. ATLAS Open Data: developing education and outreach resources from research data

Krasznahorkay, Attila (CERN)

5/9/23, 2:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The ATLAS Open Data project aims to deliver open-access resources for education and outreach in High Energy Physics using real data recorded by the ATLAS detector. The Open Data release so far has resulted in the release of a substantial amount of data from 8 TeV and 13 TeV collisions in an easily-accessible format and supported by dedicated software and documentation to allow its fruitful use...

228. BigPanDA monitoring system evolution in the ATLAS experiment

Klimentov, Alexei (Brookhaven National Laboratory)

5/9/23, 2:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Monitoring services play a crucial role in the day-to-day operation of distributed computing systems. The ATLAS experiment at LHC uses the production and distributed analysis workload management system (PanDA WMS), which allows a million computational jobs to run daily at over 170 computing centers of the WLCG and other opportunistic resources, utilizing 600k cores simultaneously on average....

448. Challenging the economy of tape storage with the disk-based one

Ahn, Sang Un (KISTI)

5/9/23, 2:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

CDS (Custodial Disk Storage), a disk-based custodial storage powered by CERN EOS storage system, has been operating for the ALICE experiment at the KISTI Tier-1 Centre since November 2021. The CDS replaced existing tape storage operated for almost a decade, after its stable demonstration in the WLCG Tape Challenges in October 2021. We tried to challenge the economy of tape storage in the...

557. Demonstration of track reconstruction with FPGAs on live data at LHCb

Lazzari, Federico (Pisa University & INFN)

5/9/23, 2:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The LHCb experiment is currently taking data with a completely renewed DAQ system, capable for the first time of performing a full real-time reconstruction of all collision events occurring at LHC point 8.
The Collaboration is now pursuing a further upgrade (LHCb "Upgrade-II"), to enable the experiment to retain the same capability at luminosities an order of magnitude larger than the maximum...

368. Enhancing data consistency in ATLAS and CERN databases through automated synchronization

Aleksandravicius, Gabriel de Aragão (Universidade Federal do Rio De Janeiro)

5/9/23, 2:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

As the largest particle physics laboratory in the world, CERN has more than 17000 collaborators spread around the globe. ATLAS, one of CERN’s experiments, has around 6000 active members and 300 associate institutes, all of which must go through the standard registration and updating procedures within CERN’s HR (Foundation) database. Simultaneously, the ATLAS Glance project, among other...

176. JUNO Offline Software for Data Processing and Analysis

HUANG, Xingtao (Shandong University)

5/9/23, 2:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

(on behalf of the JUNO Collaboration)

Jiangmen Underground Neutrino Observatory (JUNO), under construction in southern China, is a multi-purpose neutrino experiment designed to determine the neutrino mass hierarchy and precisely measure oscillation parameters. Equipped with a 20-kton liquid scintillator central detector viewed by 17,612 20-inch and 25,6000 3-inch photomultiplier tubes, JUNO...

595. LHCb GPU trigger commissioning with first data - LHCb

Fitzpatrick, Conor (CERN)

5/9/23, 2:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The LHCb experiment uses a triggerless readout system where its first stage (HLT1) is implemented on GPU cards. The full LHC event rate of 30 MHz is reduced to 1 MHz using efficient parallellisation techniques in order to meet throughput requirements. The GPU cards are hosted in the same servers as the FPGA cards receiving the detector data which reduces the network to a minimum. In this talk,...

37. MEDUSA, A MULTITHREAD 4-BODY DECAY FITTING AND SIMULATION SOFTWARE

Dr Ricci, Alessandro Maria (University of Pisa and INFN Pisa)

5/9/23, 2:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Among the biggest computational challenges for High Energy Physics (HEP) experiments there are the increasingly larger datasets that are being collected, which often require correspondingly complex data analyses. In particular, the PDFs used for modeling the experimental data can have hundreds of free parameters. The optimization of such models involves a significant computational effort and a...

49. Using CRIU for checkpointing batch jobs

Andrijauskas, Fabio

5/9/23, 2:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Creating new materials, discovering new drugs, and simulating systems are essential processes for research and innovation, and require substantial computational power. While many applications can be split into many smaller independent tasks, some cannot and may take hours or weeks to run to completion. To better manage those longer-running jobs, it would be desirable to stop them at any...

464. BaBar’s Experience with the Preservation of Data and Analysis Capabilities

Ebert, Marcus (University of Victoria)

5/9/23, 2:30 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The BaBar experiment collected electron-positron collisions at the SLAC National Accelerator Laboratory from 1999-2008. Although data taking has stopped 15 years ago, the collaboration is still actively doing data analyses, publishing results, and giving presentations at international conferences. Special considerations were needed to do analyses using a computing environment that was...

440. BESIII track reconstruction algorithm based on machine learning

Jia, Xiaoqian (Shandong University, CN)

5/9/23, 2:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Track reconstruction is one of the most important and challenging tasks in the offline data processing of collider experiments. For the BESIII detector working in the tau-charm energy region, plenty of efforts were made previously to improve the tracking performance with traditional methods, such as template matching and Hough transform etc. However, for difficult tracking tasks, such as the...

99. Event Generator Tuning Incorporating MC Systematic Uncertainty

Ju, Xiangyang (Lawrence Berkeley National Laboratory)

5/9/23, 2:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

To accurately describe data, tuning the parameters of MC event Generators is essential. At first, experts performed tunings manually based on their sense of physics and goodness of fit. The software, Professor, made tuning more objective by employing polynomial surrogate functions to model the relationship between generator parameters and experimental observables (inner-loop optimization),...

594. Glance Search Interface

Ferreira Brito Filho, Carlos Henrique

5/9/23, 2:30 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The LHCb experiment is one of the 4 LHC experiments at CERN. With more than 1500 members and tens of thousands of assets, the Collaboration requires systems that allow the extraction of data from many databases according to some very specific criteria. In LHCb there are 4 production web applications responsible for managing members and institutes, tracking assets and their current status,...

323. Preparing for a new Data Center: Automated Management of a 10’000 node bare-metal fleet in CERN IT

Dr Wiebalck, Arne (CERN)

5/9/23, 2:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

CERN IT has consolidated all life-cycle management of its physical server fleet on the Ironic bare-metal API. From the initial registration upon the first boot, over the inventory checking, the burn-in and the benchmarking for acceptance, the provisioning to the end users and the repairs during its service, up to the retirement at the end of the servers’ life, all stages can be managed within...

217. Repurposing of the Run 2 CMS High Level Trigger Infrastructure as an Cloud Resource for Offline Computing

Mascheroni, Marco (University of California San Diego)

5/9/23, 2:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The former CMS Run 2 High Level Trigger (HLT) farm is one of the largest contributors to CMS compute resources, providing about 30k job slots for offline computing. The role of this farm has been evolving, from an opportunistic resource exploited during inter-fill periods in the LHC Run 2, to a nearly transparent extension of the CMS capacity at CERN during LS2 and into the LHC Run 3 started...

159. Run-3 Commissioning of CMS Online HLT reconstruction using GPUs

Parida, Ganesh (University of Wisconsin Madison)

5/9/23, 2:30 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The software based High Level Trigger (HLT) of CMS reduces the data readout rate from 100kHz (obtained from Level 1 trigger) to around 2kHz. It makes use of all detector subsystems and runs a streamlined version of CMS reconstruction. Run-1 and Run-2 of the LHC saw the reconstruction algorithm run on a CPU farm (~30000 CPUs in 2018). But the need to have increased computational power as we...

22. Site Sonar - A Flexible and Extensible Infrastructure Monitoring Tool for ALICE Grid

Storetvedt, Maksim (CERN)

5/9/23, 2:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The ALICE experiment at the CERN Large Hadron Collider relies on a massive, distributed Computing Grid for its data processing. The ALICE Computing Grid is built by combining a large number of individual computing sites distributed globally. These Grid sites are maintained by different institutions across the world and contribute thousands of worker nodes possessing different capabilities and...

296. Track reconstruction for the ATLAS Phase-II HLT using GNNs on FPGAs

Dittmeier, Sebastian (Heidelberg University)

5/9/23, 2:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The High-Luminosity LHC (HL-LHC) will provide an order of magnitude increase in integrated luminosity and enhance the discovery reach for new phenomena. The increased pile-up foreseen during the HL-LHC necessitates major upgrades to the ATLAS detector and trigger. The Phase-II trigger will consist of two levels, a hardware-based Level-0 trigger and an Event Filter (EF) with tracking...

178. Updates to the ATLAS Data Carousel Project

Zhao, Xin (Brookhaven National Laboratory (US))

5/9/23, 2:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The High Luminosity upgrade to the LHC (HL-LHC) is expected to deliver scientific data at the multi-exabyte scale. In order to address this unprecedented data storage challenge, the ATLAS experiment launched the Data Carousel project in 2018. Data Carousel is a tape-driven workflow whereby bulk production campaigns with input data resident on tape are executed by staging and promptly...

340. A grid site reimagined: building a fully cloud-native ATLAS T2 on Kubernetes

Taylor, Ryan Paul (University of Victoria)

5/9/23, 2:45 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The University of Victoria (UVic) operates an Infrastructure-as-a-Service science cloud for Canadian researchers, and a WLCG T2 grid site for the ATLAS experiment at CERN. At first, these were two distinctly separate systems, but over time we have taken steps to migrate the T2 grid services to the cloud. This process has been significantly facilitated by basing our approach on Kubernetes, a...

331. A vendor-unlocked ITS GPU reconstruction in ALICE

Concas, Matteo (CERN)

5/9/23, 2:45 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

During the LHC Run 3, the significant upgrades on many detectors and a brand new reconstruction software allows the ALICE experiment to record Pb-Pb collisions at an interaction rate of 50 kHz in a trigger-less continuous readout mode.

The key to process the 1TB/s peak data rate in ALICE is the usage of GPUs. There are two main data processing phases: the synchronous phase, where the TPC...

380. Automatising open data publishing workflows: experience with CMS open data curation

Dr Simko, Tibor (CERN)

5/9/23, 2:45 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

In this paper we discuss the CMS open data publishing workflows, summarising experience with eight releases of CMS open data on the CERN Open Data portal since its initial launch in 2014. We present the recent enhancements of data curation procedures, including (i) mining information about collision and simulated datasets with accompanying generation parameters and processing configuration...

246. Bringing the ATLAS HammerCloud setup to the next level with containerization

Mr Rottler, Benjamin (Freiburg University)

5/9/23, 2:45 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

HammerCloud (HC) is a testing service and framework for continuous functional tests, on-demand large-scale stress tests, and performance benchmarks. It checks the computing resources and various components of distributed systems with realistic full-chain experiment workflows.

The HammerCloud software was initially developed in Python 2. After support for Python 2 was discontinued in 2020,...

203. Evolution of the CERN Backup system based on RESTIC and the CERN Tape Archive (CTA)

Dr Rademakers, Fons (CERN)

5/9/23, 2:45 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The CERN IT Department is responsible for ensuring the integrity and security of data stored in the IT Storage Services. General storage backends such as EOSHOME/PROJECT/MEDIA and CEPHFS are used to store data for a wide range of use cases for all stakeholders at CERN, including experiment project spaces and user home directories.

In recent years a backup system, CBACK, was developed based...

481. Integrating LHCb workflows on Supercomputers: State of Practice

Boyer, Alexandre (CERN)

5/9/23, 2:45 PM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

To better understand experimental conditions and performances of the Large Hadron Collider (LHC), CERN experiments execute tens of thousands of loosely-coupled Monte Carlo simulation workflows per hour on hundreds of thousands - small to mid-size - distributed computing resources federated by the Worldwide LHC Computing Grid (WLCG). While this approach has been reliable during the first LHC...

265. Offline Data Processing Software for the Super Tau Charm Facility

Li, Teng (Shandong University)

5/9/23, 2:45 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The Super Tau Charm Facility (STCF) proposed in China is a new-generation electron–positron collider with center-of-mass energies covering 2-7 GeV and a peak luminosity of 5*10^34 cm^-2s^-1. The offline software of STCF (OSCAR) is developed to support the offline data processing, including detector simulation, reconstruction, calibration as well as physics analysis. To meet STCF’s specific...

598. The migration to a standardized architecture for developing systems on the Glance project

Ferreira Brito Filho, Carlos Henrique (UFRJ)

5/9/23, 2:45 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The Glance project is responsible for over 20 systems across three CERN experiments: ALICE, ATLAS and LHCb. Students, engineers, physicists and technicians have been using systems designed and managed by Glance on a daily basis for over 20 years. In order to produce quality products continuously, considering internal stakeholder's ever-evolving requests, there is the need of standardization....

12. Track Identification for CLAS12 using Artificial Intelligence

Gavalian, Gagik (Jefferson Lab)

5/9/23, 2:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Particle track reconstruction is the most computationally intensive process in nuclear physics experiments.
Traditional algorithms use a combinatorial approach that exhaustively tests track measurements (hits) to
identify those that form an actual particle trajectory. In this article we describe the development of machine
learning models that assist the tracking algorithm by identifying...

608. `epic-analysis`: Common Physics Analysis Software for the EIC

Dilks, Christopher (Duke University)

5/9/23, 3:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Performing a physics analysis of data from simulations of a high energy experiment requires the application of several common procedures, from obtaining and reading the data to producing detailed plots for interpretation. Implementing common procedures in a general analysis framework allows the analyzer to focus on the unique parts of their analysis. Over the past few years, EIC simulations...

372. Accelerating science: the usage of commercial clouds in ATLAS distributed computing

Megino, Fernando Harald Barreiro (The University of Texas at Arlington)

5/9/23, 3:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The ATLAS experiment at CERN is one of the largest scientific machines built to date and will have ever growing computing needs as the Large Hadron Collider collects an increasingly larger volume of data over the next 20 years. ATLAS is conducting R&D projects on Amazon and Google clouds as complementary resources for distributed computing, focusing on some of the key features of commercial...

560. An Ntuple production service for accessing LHCb Open Data: the Ntuple Wizard

Fitzgerald, Dillon (University of Michigan)

5/9/23, 3:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Making the large datasets collected at the LHC accessible to the public is a considerable challenge given the complexity and volume of data. Yet to harness the full scientific potential of the facility, it is essential to enable meaningful access to the data by the broadest physics community possible. Here we present an application, the LHCb Ntuple Wizard, which leverages the existing...

550. Novel fully-heterogeneous GNN designs for track reconstruction at the HL-LHC

Mr Caillou, Sylvain (L2IT, Toulouse)

5/9/23, 3:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Data from the LHC detectors are not easily represented using regular data structures. These detectors are comprised of several species of subdetectors and therefore produce heterogeneous data. LHC detectors are granular by design so that nearby particles may be distinguished. As a consequence, LHC data are sparse, in that many detector channels are not active during a given collision event....

248. Operational Analytics Studies for ATLAS Distributed Computing: Data Popularity Forecast and Ranking of the WLCG Centers

Klimentov, Alexei (Brookhaven National Laboratory)

5/9/23, 3:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast...

137. Porting ATLAS FastCaloSim to GPUs with Performance Portable Programming Models

Leggett, Charles (Lawrence Berkeley National Laboratory)

5/9/23, 3:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

FastCaloSim is a parameterized simulation of the particle energy response and of the energy distribution in the ATLAS calorimeter. It is a relatively small and self-contained package with massive inherent parallelism and captures the essence of GPU offloading via important operations like data transfer, memory initialization, floating point operations, and reduction. Thus, it was identified as...

200. Preliminary Performance Study of an Alternative Calorimeter Clustering Solution for Allen in LHCb

Valls Canudas, Núria (La Salle URL)

5/9/23, 3:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The LHCb experiment has recently started a new period of data taking after a major upgrade in both software and hardware. One of the biggest challenges has been the migration of the first part of the trigger system (HLT1) into a parallel GPU architecture framework called Allen, which performs a partial reconstruction of most of the LHCb sub-detectors. In Allen, the reconstruction of the...

383. Security Models for ALICE Online Web-Based Applications

Raduta, George (CERN)

5/9/23, 3:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The recent major upgrade of the ALICE Experiment at CERN’s Large Hadron Collider has been coupled with the development of a new Online-Offline computing system capable of interacting with a sustained input throughput of 3.5TB/s. To facilitate the control of the experiment, new web applications have been developed and deployed to be used 24 hours a day, 365 days a year in the control room and...

281. Status of DUNE Offline Computing

Kirby, Michael (FNAL)

5/9/23, 3:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

We summarize the status of Deep Underground Neutrino Experiment (DUNE) software and computing development. We describe plans for the computing infrastructure needed to acquire, catalog, reconstruct, simulate and analyze the data from the DUNE experiment and its prototypes in pursuit of the experiment's physics goals of precision measurements of neutrino oscillation parameters, detection of...

199. The CERN Tape Archive Beyond CERN — an Open Source Data Archival System for HEP

Davis, Michael (CERN)

5/9/23, 3:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The CERN Tape Archive (CTA) was conceived as the successor to CASTOR and as the tape back-end to EOS, designed for the archival storage of data from LHC Run-3 and other experimental programmes at CERN. In the wider WLCG, the tape software landscape is quite heterogenous, but we are now entering a period of consolidation. This has led to a number of sites in WLCG (and beyond) reevaluating their...

102. Advances in developing deep neural networks for finding primary vertices in proton-proton collisions at the LHC

Garg, Rocky (Stanford University), Sokoloff, Michael (University of Cincinnati)

5/9/23, 3:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

We have been studying the use of deep neural networks (DNNs) to identify and locate primary vertices (PVs) in proton-proton collisions at the LHC. Earlier work focused on finding primary vertices in simulated LHCb data using a hybrid approach that started with kernel density estimators (KDEs) derived from the ensemble of charged track parameters heuristically and predicted “target histogram”...

29. Analysis and optimization of ALICE Run 3 multicore Grid jobs

Bertran Ferrer, Marta (CERN)

5/9/23, 3:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

For LHC Run3 the ALICE experiment software stack has been completely refactored, incorporating support for multicore job execution. The new multicore jobs spawn multiple processes and threads within the payload. Given that some of the deployed processes may be short-lived, accounting for their resource consumption presents a challenge. This article presents the newly developed methodology for...

406. Building a highly modular user-oriented notification system

Antunes, Carina

5/9/23, 3:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

CERN, as many large organizations, relies on multiple communication means for different use-cases and teams.
Email and mailing lists are the most popular ones, but more modern communications systems gain traction such as Mattermost and Push notifications.
On one end of the spectrum we have communication teams writing individual emails to users on a daily basis, which may be small targets, or...

264. DUNE Database Development

Dr Vizcaya Hernandez, Ana Paula (Colorado State University)

5/9/23, 3:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The Deep Underground Neutrino Experiment (DUNE) is a long-baseline experiment which aims to study neutrino oscillation and astroparticle physics. It will produce vast amounts of metadata, which describe the data coming from the read-out of the primary DUNE detectors. Various databases will make up the overall DB architecture for this metadata. ProtoDUNE at CERN is the largest existing...

583. FAIR principles for Digital Objects in High Energy Physics: A Case Study with Universal FeynRules Output (UFO) Models

Roy, Avik (University of Illinois at Urbana-Champaign)

5/9/23, 3:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Research in high energy physics (HEP) heavily relies on domain-specific digital contents. We reflect on the interpretation of principles of Findability, Accessibility, Interoperability, and Reusability (FAIR) in preservation and distribution of such digital objects. As a case study, we demonstrate the implementation of an end-to-end support infrastructure for preserving and accessing Universal...

566. Financial Case Study on the Use of Cloud Resources in HEP Computing

Misawa, Shigeki (Brookhaven National Laboratory), LAURET, Jerome (Brookhaven Science Associates)

5/9/23, 3:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

An all-inclusive analysis of costs for on-premises and public cloud-based solutions to handle the bulk of HEP computing requirements shows that dedicated on-premises deployments are still the most cost-effective. Since the advent of public cloud services, the HEP community has engaged in multiple proofs of concept to study the technical viability of using cloud resources; however, the...

360. Laurelin: A ROOT I/O implementation for Apache Spark

Melo, Andrew (Vanderbilt University)

5/9/23, 3:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Apache Spark is a distributed computing framework which can process very large datasets using large clusters of servers. Laurelin is a Java-based implementation of ROOT I/O which allows Spark to read and write ROOT files from common HEP storage systems without a dependency on the C++ implementation of ROOT. We discuss improvements due to the migration to an Arrow-based in-memory representation...

77. Performance of Heterogeneous Algorithm Scheduling in CMSSW

Dr Kortelainen, Matti (Fermi National Accelerator Laboratory)

5/9/23, 3:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The CMS experiment started to utilize Graphics Processing Units (GPU) to accelerate the online reconstruction and event selection running on its High Level Trigger (HLT) farm in the 2022 data taking period. The projections of the HLT farm to the High-Luminosity LHC foresee a significant use of compute accelerators in the LHC Run 4 and onwards in order to keep the cost, size, and power budget...

400. Triggerless data acquisition pipeline for Machine Learning based statistical anomaly detection

Migliorini, Matteo (Padova University & INFN)

5/9/23, 3:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The sensitivity of modern HEP experiments to New Physics (NP) is limited by the hardware-level triggers used to select data online, resulting in a bias in the data collected. The deployment of efficient data acquisition systems integrated with online processing pipelines is instrumental to increase the experiments' sensitivity to the discovery of any anomaly or possible signal of NP. In...

292. Benchmarking distributed-RDataFrame with CMS analysis workflows on the INFN analysis infrastructure

spiga, daniele (INFN)

5/9/23, 4:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The challenges expected for the HL-LHC era are pushing LHC experiments to re-think their computing models at many levels. The evolution toward solutions that allow an effortless interactive analysis experience is, among others, one of the topics followed closely by the CMS experiment. In this context, ROOT RDataFrame offers a high-level, lazy programming model which makes it a flexible and...

540. Co-Design of Quantum Hardware and Algorithms in Nuclear and High Energy Physics

Franz, Maja (Technical University of Applied Sciences Regensburg, Germany)

5/9/23, 4:30 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Quantum Computing (QC) is a promising early-stage technology that offers novel approaches to simulation and analysis in nuclear and high energy physics (NHEP). By basing computations directly on quantum mechanical phenomena, speedups and other advantages for many computationally hard tasks are potentially achievable, albeit both, the theoretical underpinning and the practical realization, are...

480. Collaborative Operational Security: The future of cybersecurity for Research and Education

Crooks, David (UKRI - STFC)

5/9/23, 4:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

No single organisation has the resources to defend its services alone against most modern malicious actors and so we must protect ourselves as a community. In the face of determined and well-resourced attackers, we must actively collaborate in this effort across HEP and more broadly across Research and Education (R&E).

Parallel efforts are necessary to appropriately respond to this...

493. Event Building studies for CMS Phase-2 at CERN

Dr Krawczyk, Rafal (CERN)

5/9/23, 4:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS experiment at CERN incorporates one of the highest throughput Data Acquisition (DAQ) systems in High-Energy Physics. Its network throughput will further increase by over an order of magnitude at the High-Luminosity LHC.
The current Run 3 CMS Event Builder receives all the fragments of Level-1 trigger accepted events from the front-end electronics (740 data streams from the CMS...

249. EvtGen – recent developments and prospects

Abudinén, Fernando (University of Warwick)

5/9/23, 4:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

EvtGen is a simulation generator specialized for decays of heavy hadrons. Since its early development in the 90’s, the generator has been extensively used and has become today an essential tool for heavy-flavour physics analyses. Throughout this time, its source code has remained mostly unchanged, except for additions of new decay models. In view of the upcoming boom of multi-threaded...

88. Fine-Grained HEP Analysis Task Graph Optimization with Coffea and Dask

Gray, Lindsey (FNAL)

5/9/23, 4:30 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The recent release of AwkwardArray 2.0 significantly changes the way that lazy evaluation and task-graph building are handled in columnar analysis. The Dask parallel processing library is now used for these pieces of functionality with AwkwardArray, and this change affords new ways of optimizing columnar analysis and distributing it on clusters. In particular this allows optimization of a task...

288. The Global Network Advancement Group: A Next Generation System for the LHC and Data Intensive Sciences

Newman, Harvey (California Institute of Technology)

5/9/23, 4:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

We will present the rapid progress, vision and outlook across multiple state of the art development lines within the Global Network Advancement Group and its Data Intensive Sciences and SENSE/AutoGOLE working groups, which are designed to meet the present and future needs and address the challenges of the Large Hadron Collider and other science programs with global reach. Since it was founded...

284. The Networking and Transfer Office ErUM-Data-Hub serving Digital Transformation in Research on Universe and Matter

Fackeldey, Peter (RWTH Aachen University Physics Institute III A)

5/9/23, 4:30 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Large research infrastructures, such as DESY and CERN, in the field of the exploration of the universe and matter (ErUM) are significantly driving the digital transformation of the future. The German action plan "ErUM-Data" promotes this transformation through the interdisciplinary networking and financial support of 20.000 scientists.
The ErUM-Data-Hub (https://erumdatahub.de) serves as a...

377. Transferable Improved Analyses Turnaround through Intelligent Caching and Optimized Resource Allocation

Fischer, Benjamin (RWTH Aachen University (DE))

5/9/23, 4:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The development of an LHC physics analysis involves numerous investigations that require the repeated processing of terabytes of measured and simulated data. Thus, a rapid processing turnaround is beneficial to the scientific process. We identified two bottlenecks in analysis independent algorithms and developed the following solutions.
First, inputs are now cached on individual SSD caches of...

348. Uncertainty Aware Machine Learning Models for Particle Physics Applications

Rajput, Kishansingh (Thomas Jefferson National Accelerator Facility)

5/9/23, 4:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Machine learning (ML) and deep learning (DL) are powerful tools for modeling complex systems. However, most of the standard models in ML/DL do not provide a measure of confidence or uncertainties associated with their predictions. Further, these models can only be trained on available data. During operation, models may encounter data samples poorly reflected in training data. These data...

300. Benchmarking Data Acquisition event building network performance for the ATLAS HL-LHC upgrade

Pozo Astigarraga, Eukeni (CERN)

5/9/23, 4:45 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The ATLAS experiment Data Acquisition (DAQ) system will be extensively upgraded to fully exploit the High-Luminosity LHC (HL-LHC) upgrade, allowing it to record data at unprecedented rates. The detector will be read out at 1 MHz generating over 5 TB/s of data. This design poses significant challenges for the Ethernet-based network as it will be required to transport 20 times more data than...

241. Evaluation of Rucio as a Metadata Service for the Belle II experiment

Panta, Anil (University of Mississippi)

5/9/23, 4:45 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Rucio is a Data Management software that has become a de-facto standard in the HEP community and beyond. It allows the management of large volumes of data over their full lifecycle. The Belle II experiment located at KEK (Japan) recently moved to Rucio to manage its data over the coming decade (O(10) PB/year). In addition to its Data Management functionalities, Rucio also provides support for...

581. Exploring Interpretability of Deep Neural Networks in Top Tagging

Roy, Avik (University of Illinois at Urbana-Champaign)

5/9/23, 4:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

We explore interpretability of deep neural network (DNN) models designed for identifying jets coming from top quark decay in the high energy proton-proton collisions at the Large Hadron Collider (LHC). Using state-of-the-art methods of explainable AI (XAI), we identify which features play the most important roles in identifying the top jets, how and why feature importance varies across...

270. Improving computer security in HEP with multiple factor authentication: experience and reflections

Ahmad, Adeel (CERN)

5/9/23, 4:45 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

In 2022, CERN ran its annual phishing campaign in which 2000 users gave away their passwords (Note: this number is in line with results of campaigns at other organisations). In a real phishing incident this would have meant 2000 compromised accounts... unless they were protected by Two-Factor Authentication (2FA)! In the same year, CERN introduced 2FA for accounts with access to critical...

579. Introducing HIJING++: the Heavy Ion Monte Carlo Generator for the High-Luminosity LHC Era

Krasznahorkay, Attila (CERN)

5/9/23, 4:45 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The upgrade of the Large Hadron Collider (LHC) is going well, during next decade we will face the ten-fold increase in experimental data. The application of state-of-the-art detectors and data acquisition systems requires high-performance simulation support, which even more demanding in case of heavy ion collisions. Our basic aim was to develop a Monte-Carlo simulation code for heavy ion...

110. Law: End-to-End Analysis Automation over Distributed Resources

Rieger, Marcel (University of Hamburg)

5/9/23, 4:45 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

In particle physics, workflow management systems are primarily used as tailored solutions in dedicated areas such as Monte Carlo production. However, physicists performing data analyses are usually required to steer their individual, complex workflows manually, frequently involving job submission in several stages and interaction with distributed storage systems by hand. This process is not...

122. NOTED: An intelligent network controller to improve the throughput of large data transfers in File Transfer Services by handling dynamic circuits

Misa Moreira, Carmen (CERN)

5/9/23, 4:45 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The NOTED (Network Optimised Transfer of Experimental Data) project has successfully demonstrated the ability to dynamically reconfigure network links to increase the effective bandwidth available for FTS-driven transfers between endpoints, such as WLCG sites by inspecting on-going data transfers and so identifying those that are bandwidth-limited for a long period of time. Recently, the...

355. RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation

Eulisse, Giulio

5/9/23, 4:45 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

ALICE, one of the four large experiments at CERN LHC, is a detector for the physics of heavy ions. In a high interaction rate environment, the pile-up of multiple events leads to an environment that requires advanced multidimensional data analysis methods.

Machine learning (ML) has become very popular in multidimensional data analysis in recent years. Compared to the simple, low-dimensional...

513. Towards a hybrid quantum operating system

Pasquale, Andrea (Università degli Studi di Milano - INFN Sezione di Milano - Technology Innovation Institute Abu Dhabi)

5/9/23, 4:45 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Over the last 20 years, thanks to the development of quantum technologies, it has been
possible to deploy quantum algorithms and applications, that before were only
accessible through simulation, on real quantum hardware. The current devices available are often refereed to as noisy intermediate-scale quantum (NISQ) computers and they require
calibration routines in order to obtain...

474. XAS Curve Match Application based on Larch

Liu, Jianli (Institute of High Energy Physics, CAS)

5/9/23, 4:45 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

XAS (synchrotron X-ray absorption spectroscopy) uses X-ray photon energy as a variable to measure the structure of X-ray absorption coefficient that changes with energy. In spectral experiments, the determination of the composition and structure of unknown samples requires data collection first, and then data processing and analysis. It takes a lot of time and there can be no errors in the...

503. : Using ZOOM Events for Scientific Conferences: the ICHEP2022 Experience

Rinaldi, Lorenzo

5/9/23, 5:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The organization of seminars and conferences was strongly influenced by the covid-19 pandemic.
In the early period of the pandemic, many events were canceled or held completely online, using video conferencing tools such as ZOOM or MS Teams. Later, thanks to large-scale vaccination and immunization, it was possible to organize again large events in the presence. Nevertheless, given some local...

553. Adoption of the alpaka performance portability library in the CMS software

Bocci, Andrea (CERN)

5/9/23, 5:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

To achieve better computational efficiency and exploit a wider range of computing resources, the CMS software framework (CMSSW) has been extended to offload part of the physics reconstruction to NVIDIA GPUs, while the support for AMD and Intel GPUs is under development. To avoid the need to write, validate and maintain a separate implementation of the reconstruction algorithms for each...

307. Analysis of physics analysis

Schreiner, Henry (Princeton University)

5/9/23, 5:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Data analysis in particle physics is socially distributed: unlike centrally developed and executed reconstruction pipelines, the analysis work performed after Analysis Object Descriptions (AODs) are made and before the final paper review—which includes particle and event selection, systematic error handling, decay chain reconstruction, histogram aggregation, fitting, statistical models, and...

185. Conditions Evolution in ATLAS

Costanzo, Davide (University of Sheffield (UK))

5/9/23, 5:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The ATLAS experiment is preparing a major change in the conditions data infrastructure in view of Run4 In this presentation we will expose the main motivations for the new design (called CREST for Conditions-REST), the ongoing changes in the DB architecture and present the developments for the deployment of the new system. The main goal is to setup a parallel infrastructure for full scale...

231. Experiences in deploying in-network data caches

Kissel, Ezra (Energy Sciences Network (ESnet))

5/9/23, 5:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Data caches of various forms have been widely deployed in the context of commercial and research and education networks, but their common positioning at the Edge limits their utility from a network operator perspective. When deployed outside the network core, providers lack visibility to make decisions or apply traffic engineering based on data access patterns and caching node location.

As...

259. Fast Inclusive Flavor Tagging at LHCb

Prouve, Claire (Universidade de Santiago de Compostela (ES))

5/9/23, 5:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The task of identifying B meson flavor at the primary interaction point in the LHCb detector is crucial for measurements of mixing and time-dependent CP violation.

Flavor tagging is usually done with a small number of expert systems that find important tracks to infer the B flavor from.

Recent advances show that replacing all of those expert systems with one ML algorithm that considers...

365. Physics analysis for the HL-LHC: concepts and pipelines in practice with the Analysis Grand Challenge

Held, Alexander (University of Wisconsin–Madison)

5/9/23, 5:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Realistic environments for prototyping, studying and improving analysis workflows are a crucial element on the way towards user-friendly physics analysis at HL-LHC scale. The IRIS-HEP Analysis Grand Challenge (AGC) provides such an environment. It defines a scalable and modular analysis task that captures relevant workflow aspects, ranging from large-scale data processing and handling of...

277. Precise Image Generation on Current Noisy Quantum Devices

Varo, Valle (DESY)

5/9/23, 5:00 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The Quantum Angle Generator (QAG) constitutes a new quantum machine learning model designed to generate accurate images on current Noise Intermediate Scale (NISQ) Quantum devices. Variational quan- tum circuits constitute the core of the QAG model, and various circuit architectures are evaluated. In combination with the so-called MERA- upsampling architecture, the QAG model achieves excellent...

66. Surface-based GPU-friendly geometry modeling for detector simulation

Gheata, Andrei (CERN)

5/9/23, 5:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

In a context where the HEP community is striving to improve the software to cope with higher data throughput, detector simulation is adapting to benefit from new performance opportunities. Given the complexity of the particle transport modeling, new developments such as adapting to accelerator hardware represent a scalable R&D effort.

The AdePT and Celeritas projects have already...

139. WLCG transition from X.509 to tokens: Status and Plans

Dack, Tom (STFC UKRI)

5/9/23, 5:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Since 2017, the Worldwide LHC Computing Grid (WLCG) has been working towards enabling token-based authentication and authorization throughout its entire middleware stack. Following the initial publication of the WLCG v1.0 Token Schema in 2019, work has been done to integrate OAuth2.0 token flows across the Grid middleware. There are many complex challenges to be addressed before the WLCG can...

163. Accelerated demonstrator of electromagnetic Particle Transport (AdePT) status and plans

Gheata, Andrei (CERN)

5/9/23, 5:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Motivated by the need to have large Monte Carlo data statistics to be able to perform the physics analysis for the coming runs of HEP experiments, particularly for HL-LHC, there are a number of efforts exploring different avenues for speeding up particle transport simulation. In particular, one of the possibilities is to re-implement the simulation code to run efficiently on GPUs. This could...

613. AI4EIC Hackathon: PID with the EPIC dRICH

Suresh, Karthik (University of Regina), Fanelli, Cristiano (William & Mary, Jefferson Lab)

5/9/23, 5:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Recently, a workshop on Artificial Intelligence for the Electron Ion Collider (AI4EIC) has been held at the College of William&Mary. The workshop covered all active and potential areas of applications of AI/ML for the EIC; it also had a strong outreach and educational component, with different tutorials given by experts in AI and machine learning from national labs, universities, and industry...

146. Application of quantum computing techniques in particle tracking at LHC

CHAN, Wai Yuen

5/9/23, 5:15 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

In the near future, the LHC detector will deliver much more data to be processed. Therefore, new techniques are required to deal with such a large amount of data. Recent studies showed that one of the quantum computing techniques, quantum annealing (QA), can be used to perform the particle tracking with efficiency higher than 90% even in the dense environment. The algorithm starts from...

27. Calibration and Conditions Database of the ALICE experiment in Run 3

Grigoras, Costin (CERN)

5/9/23, 5:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The ALICE experiment at CERN has undergone a substantial detector, readout and software upgrade for the LHC Run3. A signature part of the upgrade is the triggerless detector readout, which necessitates a real time lossy data compression from 1.1TB/s to 100GB/s performed on a GPU/CPU cluster of 250 nodes. To perform this compression, a significant part of the software, which traditionally is...

60. Complete End-to-End Network Path Control for Scientific Communities with QoS Capabilities

Guok, Chin (ESnet)

5/9/23, 5:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The Caltech team, in collaboration with network, computer science, and HEP partners at the DOE laboratories and universities, is building intelligent network services ("The Software-defined network for End-to-end Networked Science at Exascale (SENSE) research project") to accelerate scientific discovery.
The overarching goal of SENSE is to enable National Labs and universities to request and...

369. First implementation and results of the Analysis Grand Challenge with a fully Pythonic RDataFrame

Padulano, Vincenzo Eduardo (CERN)

5/9/23, 5:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

The growing amount of data generated by the LHC requires a shift in how HEP analysis tasks are approached. Efforts to address this computational challenge have led to the rise of a middle-man software layer, a mixture of simple, effective APIs and fast execution engines underneath. Having common, open and reproducible analysis benchmarks proves beneficial in the development of these modern...

142. Improved HLT framework for Belle II experiment

Itoh, Ryosuke (KEK)

5/9/23, 5:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The original HLT framework used in the Belle II experiment was formaly upgraded replacing the old IPC based ring buffer with the ZeroMQ data transport to overcome the unexpected IPC locking problem. The new framework has been working stably in the beam run so far, but it lacks the capability to recover the processing fault without stopping the on-going data taking. In addition, the...

517. LbMCSubmit: A new flexible and scalable request submission system for LHCb simulation

Dr Burr, Christopher (CERN)

5/9/23, 5:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

In the LHCb experiment, a wide variety of Monte Carlo simulated samples need to be produced for the experiment’s physics programme. LHCb has a centralised production system for simulating, reconstructing and processing collision data, which runs on the DIRAC backend on the WLCG.
To cope with a large set of different types of sample, requests for simulation production are based on a concept of...

624. Transitioning GlideinWMS, a multi domain distributed workload manager, from GSI proxies to tokens and other granular credentials

Mambelli, Marco (Fermilab)

5/9/23, 5:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

GlideinWMS is a distributed workload manager that has been used in production for many years to provision resources for experiments like CERN's CMS, many Neutrino experiments, and the OSG. Its security model was based mainly on GSI (Grid Security Infrastructure), using x509 certificate proxies and VOMS (Virtual Organization Membership Service) extensions. Even if other credentials, like ssh...

327. Using a Neural Network to Approximate the Negative Log Likelihood Distribution

Liu, Shenghua (University of Notre Dame)

5/9/23, 5:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

An increasingly frequent challenge faced in HEP data analysis is to characterize the agreement between a prediction that depends on a dozen or more model parameters–such as predictions coming from an effective field theory (EFT) framework–and the observed data. Traditionally, such characterizations take the form of a negative log likelihood (NLL) distribution, which can only be evaluated...

328. A parameter optimisation toolchain for Monte Carlo detector simulation

Volkel, Benedikt (CERN)

5/9/23, 5:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Monte Carlo detector transport codes are one of the backbones in high-energy physics. They simulate the transport of a large variety of different particle types through complex detector geometries based on a multitude of physics models.

Those simulations are usually configured or tuned through large sets of parameters. Often, tuning the physics accuracy on the one hand and optimising the...

212. Adoption of a token-based authentication model for the CMS Submission Infrastructure

Mascheroni, Marco (University of California San Diego)

5/9/23, 5:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The CMS Submission Infrastructure (SI) is the main computing resource provisioning system for CMS workloads. A number of HTCondor pools are employed to manage this infrastructure, which aggregates geographically distributed resources from the WLCG and other providers. Historically, the model of authentication among the diverse components of this infrastructure has relied on the Grid Security...

537. An Intelligent Data Analysis System for Biological Macromolecule Crystallography

Sun, Hao-Kai (IHEP, CAS)

5/9/23, 5:30 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

With the construction and operation of fourth-generation light sources like European Synchrotron Radiation Facility Extremely Brilliant Source (ESRF-EBS), Advanced Photon Source Upgrade (APS-U), Advanced Light Source Upgrade (ALS-U), High Energy Photon Source (HEPS), etc., several advanced biological macromolecule crystallography (MX) beamlines are or will be built and thereby the huge amount...

617. Building a Global HEP Software Training Community

Lieret, Kilian (Princeton University)

5/9/23, 5:30 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

To meet the computing challenges of upcoming experiments, software training efforts play an essential role in imparting best practices and popularizing new technologies. Because many of the taught skills are experiment-independent, the HSF/IRIS-HEP training group coordinates between different training initiatives while building a training center that provides students with various training...

628. Connecting HEPCloud with quantum applications using the Rigetti platform

Timm, S. (FNAL)

5/9/23, 5:30 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The Superconducting Quantum Materials and Systems Center (SQMS) and the Computational Science and AI Directorate (CSAID) at Fermi National Accelerator Laboratory and Rigetti Computing have teamed up to define and deliver a standard pathway for quantum computing at Rigetti from HEPCloud. HEPCloud now provides common infrastructure and interfacing for managing connectivity and providing access...

336. Efficient search for new physics using Active Learning in the ATLAS Experiment

Bhatti, Zubair (New York University)

5/9/23, 5:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Searches for new physics set exclusion limits in parameter spaces of typically up to 2 dimensions. However, the relevant theory parameter space is usually of a higher dimension but only a subspace is covered due to the computing time requirements of signal process simulations. An Active Learning approach is presented to address this limitation. Compared to the usual grid sampling, it reduces...

315. Kubernetes for DUNE DAQ

Lasorak, Pierre (Imperial College London)

5/9/23, 5:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The Deep Underground Neutrino Experiment (DUNE) is a next generation long-baseline neutrino experiment based in the USA which is expected to start taking data in 2029. DUNE aims to precisely measure neutrino oscillation parameters by detecting neutrinos from the LBNF beamline (Fermilab) at the Far Detector, 1300 kilometres away, in South Dakota. The Far Detector will consist of four cryogenic...

123. P4flow: A software-defined networking approach with programmable switches for accounting and forwarding IPv6 packets with user-defined flow label tags

Misa Moreira, Carmen (CERN)

5/9/23, 5:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

A comprehensive analysis of the HEP (High Energy Physics) experiment traffic across LHCONE (Large Hadron Collider Open Network Environment) and other networks, is essential for immediate network optimisation (for example by the NOTED project) and highly desirable for long-term network planning. Such an analysis requires two steps: tagging of network packets to indicate the type and owner of...

256. PyPWA: A Software Toolkit for Parameter Optimization and Amplitude Analysis

Mr Jones, Mark (Norfolk State University)

5/9/23, 5:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Abstract
PyPWA is a toolkit designed to fit (regression) parametric models to data and to generate distributions (simulation) according to a given model (function). PyPWA software has been written under the python ecosystem with the goal of performing Amplitude or Partial Wave Analysis (PWA) in nuclear and particle physics experiments. The aim of spectroscopy experiments is often the...

526. The HSF Conditions Database reference implementation

Gerlach, Lino (Brookhaven National Laboratory)

5/9/23, 5:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The HSF Conditions Databases activity is a forum for cross-experiment discussions hoping for as broad a participation as possible. It grew out of the HSF Community White Paper work to study conditions data access, where experts from ATLAS, Belle II, and CMS converged on a common language and proposed a schema that represents best practice. The focus of the HSF work is the most difficult use...

226. A method for inferring signal strength modifiers by conditional invertible neural networks

Farkas, Máté Zoltán (CMS)

5/9/23, 5:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The continuous growth in model complexity in high-energy physics (HEP) collider experiments demands increasingly time-consuming model fits. We show first results on the application of conditional invertible networks (cINNs) to this challenge. Specifically, we construct and train a cINN to learn the mapping from signal strength modifiers to observables and its inverse. The resulting network...

565. Analysis Productions: A declarative approach to ntupling

Burr, Chris (CERN)

5/9/23, 5:45 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Most analyses in the LHCb experiment start by filtering data and simulation stored on the WLCG. Traditionally this has been achieved by submitting user jobs that each process a small fraction of the total dataset. While this has worked well, it has become increasingly complex as the LHCb datasets have grown and this model requires all analysts to understand the intricacies of the grid. This...

23. Apptainer Without Setuid

Dykstra, Dave

5/9/23, 5:45 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Apptainer (formerly known as Singularity) since its beginning implemented many of its container features with the assistance of a setuid-root program. It still supports that mode, but as of version 1.1.0 it no longer uses setuid by default. This is feasible because it now can mount squash filesystems, mount ext2/3/4 filesystems, and use overlayfs using unprivileged user namespaces and FUSE. It...

375. B Meson Flavour Tagging via Continuous Variable Quantum Support Vector Machines

West, Maxwell

5/9/23, 5:45 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The study of the decays of $B$ mesons is a key component of modern experiments which probe heavy quark mixing and $CP$ violation, and may lead to concrete deviations from the predictions of the Standard Model [1]. Flavour tagging, the process of determining the quark flavour composition of $B$ mesons created in entangled pairs at particle accelerators, is an essential component of this...

313. DIRAC: current, upcoming and planned capabilities and technologies

Dr Boyer, Alexandre (CERN)

5/9/23, 5:45 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

DIRAC is the interware for building and operating large scale distributed computing systems. It is adopted by multiple collaborations from various scientific domains for implementing their computing models.
DIRAC provides a framework and a rich set of ready-to-use services for Workload, Data and Production Management tasks of small, medium and large scientific communities having different...

528. Ensuring Simulation Quality in the LHCb experiment

Couturier, Benjamin (CERN)

5/9/23, 5:45 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Monte Carlo simulations are a key tool for the physics program of High Energy Experiments. Their accuracy and reliability is of the utmost importance. A full suite of verifications is in place for the LHCb Simulation software to ensure the quality of the simulated samples produced.
In this contribution we will give a short overview of the procedure and the tests in place, that exploits the...

352. Optimizing Geant4 Hadronic Model Parameters Through Global Fits to Thin Target Data

Yarba, Julia (Fermi National Accelerator Laboratory)

5/9/23, 5:45 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Geant4, the leading detector simulation toolkit used in High Energy Physics, employs a set of physics models to simulate interactions of particles with matter across a wide range of interaction energies. These models, especially the hadronic ones, rely largely on directly measured cross-sections and inclusive characteristics, and use physically motivated parameters. However, they generally aim...

195. Scientific application development based on the Daisy

Dr Fu, Shiyuan ( Institute of High Energy Physics, CAS)

5/9/23, 5:45 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Large-scale research facilities are becoming prevalent in the modern scientific landscape. One of these facilities' primary responsibilities is to make sure that users can process and analysis measurement data for publication. To allow for barrier-less access to those highly complex experiments, almost all beamlines require fast feedback capable of manipulating and visualizing data online to...

261. Testing framework and monitoring system for the ATLAS EventIndex

Dr Gallas, Elizabeth (University of Oxford)

5/9/23, 5:45 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The ATLAS EventIndex is a global catalogue of the events collected, processed or generated by the ATLAS experiment. The system was upgraded in advance of LHC Run 3, with a migration of the Run 1 and Run 2 data from HDFS MapFiles to HBase tables with a Phoenix interface. The frameworks for testing functionality and performance of the new system have been developed. There are two types of tests...

626. vDUNE, a Virtual Tour of the DUNE South Dakota Laboratory

DeMuth, David (Valley City State University)

5/9/23, 5:45 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Deep underground, the removal of rock to fashion three soccer field
sized caverns is underway, as are detector prototypings. In 2024, the
first DUNE far detector will be constructed as a large cryostat,
instrumented as a traditional tracking calorimeter but in a cold bath of
zenon doped liquidized argon. An Epic Game UnReal Engine rendered 3D
simulation of the underground laboratory has...

640. The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3

Eulisse, Giulio, Rohr, David (CERN)

5/10/23, 9:00 AM

Track 5 - Sustainable and Collaborative Software Engineering

Plenary

Plenary Session

ALICE has upgraded many of its detectors for LHC Run 3 to operate in continuous readout mode recording Pb-Pb collisions at 50 kHz interaction rate without trigger.
This results in the need to process data in real time at rates 50 times higher than during Run 2. In order to tackle such a challenge we introduced O2, a new computing system and the associated infrastructure. Designed and...

648. Streaming Readout and Remote Compute Systems

Lawrence, David (Jefferson Lab)

5/10/23, 9:40 AM

Plenary

Plenary Session

Streaming Readout Data Acquisition systems coupled with distributed resources spread over vast geographic distances present new challenges to the next generation of experiments. High bandwidth modern network connectivity opens the possibility to utilize large, general-use, HTC systems that are not necessarily located close to the experiment. Near real-time response rates and workflow...

649. Data Management In Nuclear Physics: Cultural Changes For Larger Collaborations

Deconinck, Wouter (University of Manitoba)

5/10/23, 10:05 AM

Plenary Session

As nuclear physics collaborations and experiments increase in size, the data management and software practices in this community have changed as well. Large nuclear physics experiments at Brookhaven National Lab (STAR, PHENIX, sPHENIX), at Jefferson Lab (GlueX, CLAS12, MOLLER), and at the Electron-Ion Collider (ePIC) are taking different approaches to data management, building on existing...

650. The Future of HEP Software & Computing – The Snowmass Report

Elvira, Daniel (FNAL)

5/10/23, 10:50 AM

Plenary

Plenary Session

This presentation will cover the content of the report delivered by the Snowmass computational Frontier late in 2022. A description of the frontier organization and various preparatory events, including the Seattle Community Summer Study (CSS), will be followed by a discussion on the evolution of computing hardware and the impact of newly established and emerging technologies, including...

651. The Nuclear Science Long Range Plan and Computing

Gao, Haiyan (Duke University)

5/10/23, 11:15 AM

Plenary

Plenary Session

The U.S. Nuclear Physics community has been conducting long-range planning (LRP) for nuclear science since late 1970s. The process is known as the Nuclear Science Advisory Committee (NSAC) LRP with NSAC being an advisory body jointly appointed by the U.S. Department of Energy and the U.S. National Science Foundation. The last NSAC LRP was completed in 2015 and the current NSAC LRP is ongoing...

652. Integrated Research Infrastructure - Architecture Blueprint Activity (IRI-ABA)

Ramprakash, Jini (ANL)

5/10/23, 11:40 AM

Plenary

Plenary Session

653. DEI Roundtable: "Building an equitable STEM workforce for the future"

Weatherly, Kimberly (William & Mary), Turner, Clayton (NASA Langley), Williams, Aurelia (Norfolk State University), McKinney, Ivan (Unreasonable Kids College), Schultz, A.K. (SVT Robotics)

5/11/23, 9:00 AM

Plenary

Plenary Session

Today's students are tomorrow's leaders in science, technology, engineering and math. To ensure the best minds reach their potential tomorrow, it's vital to ensure that students not only experience meaningful STEM learning today, but also have the opportunities and support to pursue careers in a STEM environment that is more welcoming, inclusive and just. This panel will feature expertise from...

654. LEIDA - Ideas for Achieving Excellence in HEP Research By Minimizing Marginalization and Increasing Belonging

Gladney, Larry (Yale)

5/11/23, 9:45 AM

Plenary Session

Big science is represented by projects like those in particle physics. Big engineering is the application of engineering principles to large-scale projects that have a significant impact on society, like popular use of AI/ML (think ChatGPT and Google Bard). Both big science and big engineering are among the noblest and boldest applications of the human intellect to understanding the universe...

85. A holistic study of the WLCG energy needs for the LHC scientific program

Campana, Simone (CERN)

5/11/23, 10:15 AM

Track 4 - Distributed Computing

Plenary

Plenary Session

The WLCG infrastructure provides the compute power and the storage capacity for the computing needs of the experiments at the Large Hadron Collider (LHC) at CERN. The infrastructure is distributed across over 170 data centers in more than 40 countries. The amount of consumed energy in WLCG to support the scientific program of the LHC experiments and its evolution depends on different factors:...

364. Analysis Tools in Geant4

HRIVNACOVA, Ivana (IJCLab, IN2P3/CNRS)

5/11/23, 11:15 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The analysis category was introduced in Geant4 almost ten years ago (in 2014) with the aim to provide users with a lightweight analysis tool, available as part of the Geant4 installation without the need to link to an external analysis package. It helps capture statistical data in the form of histograms and n-tuples and store these in files in four various formats. It was already presented at...

52. Application of a supercomputer Tianhe-II in BESIII

Hu, Biying (Sun Yat-sen University)

5/11/23, 11:15 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

High energy physics experiments are pushing forward the precision measurements and searching for new physics beyond standard model. It is urgent to simulate and generate mass data to meet requirements from physics. It is one of the most popular areas to make good use of existing power of supercomputers for high energy physics computing. Taking the BESIII experiment as an illustration, we...

314. EJFAT: Accelerated Intelligent Compute Destination Load Balancing

Goodrich, Michael

5/11/23, 11:15 AM

Track 10 - Exascale Science

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

To increase the science rate for high data rates/volumes, Thomas Jefferson National Accelerator Facility (JLab) has partnered with Energy Sciences Network (ESnet) to define an edge to data center traffic shaping/steering transport capability featuring data event-aware network shaping and forwarding.

The keystone of this ESnet JLab FPGA Accelerated Transport (EJFAT) is the joint development...

575. Federated Access to Distributed Storage in the EIC Computing Era

Poat, Michael (BNL)

5/11/23, 11:15 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Electron Ion Collider (EIC) collaboration and future experiment is a unique scientific ecosystem within Nuclear Physics as the experiment starts right off as a cross-collaboration from Brookhaven National Lab (BNL) & Jefferson Lab (JLab). As a result, this muti-lab computing model tries at best to provide services accessible from anywhere by anyone who is part of the collaboration. While...

86. High-performance end-user analysis in pure Julia programming language

Ling, Jerry (Harvard University)

5/11/23, 11:15 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

We present tools for high-performance analysis written in pure Julia, a just-in-time (JIT) compiled dynamic programming language with a high-level syntax and performance. The packages we present center around UnROOT.jl, a pure Julia ROOT file I/O package that is optimized for speed, lazy reading, flexibility, and thread safety.

We discuss what affects performance in Julia, the challenges,...

91. Integrating the RIVET analysis tool into EPOS 4

Jahan, Johannes (University of Houston)

5/11/23, 11:15 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

EPOS 4 is the last version of the high-energy collision event generator EPOS, released publicly in 2022. It was delivered with improvements on several aspects, whether about the theoretical bases on which it relies, how they are handled technically, or regarding user's interface and data compatibility.

This last point is especially important, as part of a commitment to provide the widest...

154. JetNet library for machine learning in high energy physics

Pareja, Carlos

5/11/23, 11:15 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Machine learning (ML) has become ubiquitous in high energy physics (HEP) for many tasks, including classification, regression, reconstruction, and simulations. To facilitate development in this area, and to make such research more accessible, and reproducible, we require standard, easy-to-access, datasets and metrics. To this end, we develop the open source Python JetNet library with easily...

322. Nordic Data Lake Success Story

Wadenstein, Mattias (NeIC)

5/11/23, 11:15 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The Data Lake concept has promised increased value to science and more efficient operations for storage compared to the traditional isolated storage deployments. Building on the established distributed dCache serving as the Nordic Tier-1 storage for LHC data, we have also integrated tier-2 pledged storage in Slovenia, Sweden, and Switzerland, resulting in a coherent storage space well above...

182. Sharing ATLAS Science: communicating to the public

Le Boulicaut, Elise

5/11/23, 11:15 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Communicating the science and achievements of the ATLAS Experiment is a core objective of the ATLAS Collaboration. This talk will explore the range of communication strategies adopted in ATLAS communications, with particular focus on how these have been impacted by the COVID-19 pandemic. In particular, an overview of ATLAS’ digital communication platforms will be given – with focus on social...

586. Validation of the new hardware trigger processor at NA62 experiment

Rossi, Cristian (INFN Roma1)

5/11/23, 11:15 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

NA62 is a K meson physics experiment based on a decay-in-flight technique and whose Trigger and Data Acquisition system (TDAQ) is multi-level and network based. A reorganization of both the beam line and the detector is foreseen in the next years to complete and extend the physics reach of NA62. One of the challenging aspects of this upgrade is a significant increase (x4) in the event rate...

616. Artificial Intelligence and Machine Learning for EPIC: an Overview

Fanelli, Cristiano

5/11/23, 11:30 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The newly formed EPIC Collaboration has recently laid the foundations of its software infrastructure. Noticeably, several forward-looking aspects of the software are favorable for Artificial Intelligence (AI) and Machine Learning (ML) applications and utilization of heterogeneous resources. EPIC has a unique opportunity to integrate AI/ML from the beginning: the number of AI/ML activities is...

223. Extracting Columnar Event Data From Experiment Specific Data Formats At Scale

Galewsky, Ben (University of Illinois)

5/11/23, 11:30 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

We will describe how ServiceX, an IRIS-HEP project, generates C++ or python code from user queries and orchestrates thousands of experiment-provided docker containers to filter and select event data. The source datafiles are identified using Rucio. We will show how the service encapsulates best practice for using Rucio and helps inexperienced analysers get up to speed quickly. The data is...

445. Geant4 electromagnetic physics for Run3 and Phase2 LHC

Dr Sawkey, Daren (Varian Medical Systems)

5/11/23, 11:30 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

For the new Geant4 series 11.X electromagnetic (EM) physics sub-libraries were revised and reorganized in view of requirements for simulation of Phase-2 LHC experiments. EM physics simulation software takes a significant part of CPU during massive production of Monte Carlo events for LHC experiments. We present recent evolution of Geant4 EM sub-libraries for simulation of gamma, electron, and...

490. GEMC: a database driven Monte Carlo simulation program

Ungaro, Maurizio (Jefferson Lab)

5/11/23, 11:30 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

A mechanism to store in databases all the parameters needed to simulate the detectors response to physics interactions is presented. This includes geometry, materials, magnetic field, electronics.

GEMC includes a python API to populate the databases, and the software to run the Monte-Carlo simulation. The engine is written in C++ and uses Geant4 for the passage of particles through...

103. Improvement in user experience with Rucio integration into grid tools at Belle II

Panta, Anil (University of Mississippi)

5/11/23, 11:30 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Rucio, the data management software initially developed for ATLAS, has been in use at Belle II since January 2021. After the transition to Rucio, new features and functionality were implemented in Belle II grid tools based on Rucio, to improve the experience of grid users. The container structure in the Rucio File Catalog enabled us to define collections of arbitrary datasets, allowing the...

156. Integration of the Barcelona Supercomputing Center for CMS computing: towards large scale production

Dr Josep, Flix (CIEMAT - PIC)

5/11/23, 11:30 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The CMS experiment is working to integrate an increasing number of High Performance Computing (HPC) resources into its distributed computing infrastructure. The case of the Barcelona Supercomputing Center (BSC) is particularly challenging as severe network restrictions prevent the use of CMS standard computing solutions. The CIEMAT CMS group has performed significant work in order to overcome...

152. KServe inference extension for a FPGA vendor-free ecosystem

Ciangottini, Diego (INFN Perugia)

5/11/23, 11:30 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Field Programmable Gate Arrays (FPGAs) are playing an increasingly important role in the sampling and data processing industry due to their intrinsically highly parallel architecture, low power consumption, and flexibility to execute custom algorithms. In particular, the use of FPGAs to perform machine learning inference is increasingly growing thanks to the development of high-level synthesis...

151. The implementation of Data Management and Data Service for HEPS

Sun, Haokai (Institute of High Energy Physics, CAS)

5/11/23, 11:30 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

China’s High Energy Photon Source (HEPS), the first national high-energy synchrotron radiation light source and soon one of the world’s brightest fourth-generation synchrotron radiation facilities, is being under intense construction in Beijing’s Huairou District, and will be completed in 2025.

To make sure that the huge amount of data collected at HEPS is accurate, available and...

347. The International Particle Physics Outreach Group - Engaging the world with science

Yacoob, Sahal (University of Cape Town / CERN)

5/11/23, 11:30 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The International Particle Physics Outreach Group (IPPOG) is a network of scientists, science educators and communication specialists working across the globe in informal science education and public engagement for particle physics. The primary methodology adopted by IPPOG includes the direct participation of scientists active in current research with education and communication specialists,...

351. Upgrade of Online Storage and Express-Reconstruction system for the Belle II experiment

Park, Seokhee (KEK)

5/11/23, 11:30 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The backend of the Belle II data acquisition system consists of a high-level trigger system, online storage, and an express-reconstruction system for online data processing. The high-level trigger system was updated to use the ZeroMQ networking library from the old ring buffer and TCP/IP socket, and the new system is successfully operated. However, the online storage and express-reconstruction...

338. Awkward Just-In-Time (JIT) Compilation: A Developer’s Experience

Schreiner, Henry (Princeton University)

5/11/23, 11:45 AM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Awkward Arrays is a library for performing NumPy-like computations on nested, variable-sized data, enabling array-oriented programming on arbitrary data structures in Python. However, imperative (procedural) solutions can sometimes be easier to write or faster to run. Performant imperative programming requires compilation; JIT-compilation makes it convenient to compile in an interactive Python...

64. Commissioning of US HPC GPU resources for CMS Data Reconstrucion

Hufnagel, Dirk (Fermilab)

5/11/23, 11:45 AM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

With advances in the CMS CMSSW framework to support GPU, culminating with the deployment of GPU in the Run-3 HLT, CMS is also starting to look at integrating GPU resources into CMS Offline Computing as well. At US HPC Facilities a number of very large HPC with GPU have either just become available or will become available soon, offering opportunities for CMS if we would be able to make use of...

59. Job CPU Performance comparison based on MINIAOD reading options: local versus remote

Balcas, Justas (California Institute of Technology)

5/11/23, 11:45 AM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

A critical challenge of performing data transfers or remote reads is to be fast and efficient as possible while, at the same time, keeping the usage of system resources as low as possible. Ideally, the software that manages these data transfers should be able to organize them so that one can have them run up to the hardware limits. Significant portions of LHC analysis use the same datasets,...

478. New developments of TMVA/SOFIE: Code Generation and Fast Inference for Graph Neural Networks

Moneta, Lorenzo (CERN)

5/11/23, 11:45 AM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The recent developments in ROOT/TMVA focus on fast machine learning inference, which enables analysts to deploy their machine learning models rapidly on large scale datasets. A new tool has been recently developed, SOFIE, allowing for generating C++ code for evaluation of deep learning models, which are trained from external tools such as Tensorflow or PyTorch.
While Python-based deep...

397. Overview of the distributed image processing infrastructure to produce the Legacy Survey of Space and Time (LSST)

Hernandez, Fabio (IN2P3 / CNRS computing center)

5/11/23, 11:45 AM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The [Vera C. Rubin observatory][1] is preparing for execution of the most ambitious astronomical survey ever attempted, the Legacy Survey of Space and Time (LSST). Currently in its final phase of construction in the Andes mountains in Chile and due to start operations late 2024 for 10 years, its 8.4-meter telescope will nightly scan the southern sky and collect images of the entire visible sky...

94. Overview of the HL-LHC Upgrade for the CMS Level-1 Trigger

Savard, Claire (University of Colorado at Boulder)

5/11/23, 11:45 AM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The High-Luminosity LHC will open an unprecedented window on the weak-scale nature of the universe, providing high-precision measurements of the standard model as well as searches for new physics beyond the standard model. Such precision measurements and searches require information-rich datasets with a statistical power that matches the high-luminosity provided by the Phase-2 upgrade of the...

247. Portable Acceleration of CMS Mini-AOD Production with Coprocessors as a Service

McCormack, William (MIT)

5/11/23, 11:45 AM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Computing demands for large scientific experiments, such as the CMS experiment at CERN, will increase dramatically in the next decades. To complement the future performance increases of software running on CPUs, explorations of coprocessor usage in data processing hold great potential and interest. We explore the novel approach of Services for Optimized Network Inference on Coprocessors...

135. Refined drift chamber simulation in the CEPC experiment

Dr Fang, Wenxing (IHEP)

5/11/23, 11:45 AM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The Circular Electron Positron Collider (CEPC) [1] is one of the future experiments aiming to study the Higgs boson’s properties precisely. For this purpose, excellent track reconstruction and particle identification (PID) performance are required. Such as the tracking efficiency should be around 100%, the momentum resolution should be less than 0.1%, and the Kaon and pion should have 2 sigma...

330. Software Training Outreach In HEP

Cordero, Danelix (CROEM High School (Departamento de Educación de Puerto Rico))

5/11/23, 11:45 AM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Sudhir Malik, Peter Elmer, Adam LaMee, Ken Cecire

The NSF-funded IRIS-HEP "Training, Education & Outreach" program and QuarkNet are partnering to enable and expand software training for the high school teachers with a goal to tap, grow and diversify the talent pipeline from K-12 students for future cyberinfrastructure. IRIS-HEP (https://iris-hep.org/) is a software institute that aims to...

577. The LHCb simulation software Gauss and its Gaussino core framework

Mazurek, Michał (CERN)

5/11/23, 11:45 AM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The LHCb software has undergone a major upgrade in view of data taking with higher luminosity in Run3 of the LHC at CERN.
The LHCb simulation framework, Gauss, had to be adapted to follow the changes in modern technologies of the underlying experiment core software and to introduce new simulation techniques to cope with the increase of the required amount of simulated data. Additional...

290. A High-Speed Asynchronous Data I/O Method for HEPS

Fu, Shiyuan (IHEP)

5/11/23, 12:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

High Energy Photon Source（HEPS）is expected to generate a huge amount of data, which puts extreme pressure on data I/O in computing tasks. Meanwhile, inefficient data I/O significantly affect computing performance.

Firstly, the data reading mode and limitations of computing resources are taken into account，and we propose a method for automatic tuning of storage parameters, such as data block...

166. Embedded Continual Learning for HEP

Barbone, Marco (Imperial College London)

5/11/23, 12:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

Neural Networks (NN) are often trained offline on large datasets and deployed on specialized hardware for inference, with a strict separation between training and inference. However, in many realistic applications the training environment differs from the real world or data arrive in a streaming fashion and are continuously changing. In these scenarios, the ability to continuously train and...

205. EPN2EOS Data Transfer System

Ms Suiu, Alice Florenta

5/11/23, 12:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

ALICE is one of the four large experiments at the CERN LHC designed to study the structure and origins of matter in collisions of heavy ions (and protons) at ultra-relativistic energies. The experiment measures the particles produced as a result of collisions in its center so that it can reconstruct and study the evolution of the system produced during these collisions. To perform these...

80. Evaluating Performance Portability with the CMS Heterogeneous Pixel Reconstruction code

Dr Kortelainen, Matti (Fermi National Accelerator Laboratory)

5/11/23, 12:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

In the past years the landscape of tools for expressing parallel algorithms in a portable way across various compute accelerators has continued to evolve significantly. There are many technologies on the market that provide portability between CPU, GPUs from several vendors, and in some cases even FPGAs. These technologies include C++ libraries such as Alpaka and Kokkos, compiler directives...

580. From prototypes to large scale detectors: how to exploit the Gaussino simulation framework for detectors studies, with a detour into machine learning

Mazurek, Michał (CERN)

5/11/23, 12:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Gaussino is a new simulation experiment-independent framework based on the Gaudi data processing framework. It provides generic core components and interfaces to build a complete simulation application: generation, detector simulation, geometry, monitoring, and saving of the simulated data. Thanks to its highly configurable and extendable components Gaussino can be used both as a toolkit and a...

541. Results from Large-scale HPC deployment of Scalable CyberInfrastructure for Artificial Intelligence and Likelihood Free Inference (SCAILFIN)

Dr Hurtado Anampa, Kenyi (University of Notre Dame)

5/11/23, 12:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The NSF-funded Scalable CyberInfrastructure for Artificial Intelligence and Likelihood Free Inference (SCAILFIN) project has developed and deployed artificial intelligence (AI) and likelihood-free inference (LFI) techniques and software using scalable cyberinfrastructure (CI) built on top of existing CI elements. Specifically, the project has extended the CERN-based REANA framework, a...

344. Simulation of the MoEDAL-MAPP experiment at the LHC

Kalliokoski, Matti (Helsinki Institute of Physics)

5/11/23, 12:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

MoEDAL (the Monopole and Exotics Detector at the LHC) searches directly magnetic monopoles at the Interaction Point 8 of the Large Hadron Collider (LHC). As an upgrade of the experiment an addition, MAPP (MoEDAL Apparatus for Penetrating Particles) detector extends the physics reach by providing sensitivity to milli-charged and long-lived exotic particles. The MAPP detectors are scintillator...

362. The New Awkward Ecosystem

Ifrim, Ioana (Princeton University)

5/11/23, 12:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

In particle physics, data analysis frequently needs variable-length, nested data structures such as arbitrary numbers of particles per event and combinatorial operations to search for particle decay. Arrays of these data types are provided by the Awkward Array library.

The previous version of this library was implemented in C++, but this impeded its ability to grow. Thus, driven by this...

515. The role of strategy and evaluation in delivering effective and innovative public engagement

Collier, Ian (UKRI-STFC)

5/11/23, 12:00 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

UKRI/STFC’s Scientific Computing Department (SCD) runs a vibrant range of computing related public engagement activities. We benefit form the work done by the National Labs public engagement team to develop a well articulated PE strategy, and an accompanying evaluation framework, including the idea of defining formal generic learning outcomes (GLOs).

This paper presents how this combination...

232. Utilizing Distributed Heterogeneous Computing with PanDA in ATLAS

Barreiro Megino, Fernando (Unive)

5/11/23, 12:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

In recent years, advanced and complex analysis workflows have gained increasing importance in the ATLAS experiment at CERN, one of the large scientific experiments at the Large Hadron Collider (LHC). Support for such workflows has allowed users to exploit remote computing resources and service providers distributed worldwide, overcoming limitations on local resources and services. The spectrum...

187. ATLAS Virtual Visits: Bringing the world to our detector

Le Boulicaut, Elise

5/11/23, 12:15 PM

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Oral

Track 8 - Collaboration, Reinterpretation, Outreach and Education

The Virtual Visit service run by the ATLAS Collaboration has been active since 2010. The ATLAS Collaboration has used this popular and effective method to bring the excitement of scientific exploration and discovery into classrooms and other public places around the world. The programme, which uses a combination of video conferencing, webcasts, and video recording to communicate with remote...

465. Data Management Package for the novel data delivery system, ServiceX, and Applications to various physics analysis workflows

Choi, KyungEon (University of Texas at Austin)

5/11/23, 12:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Recent developments of HEP software allow novel approaches to physics analysis workflows. The novel data delivery system, ServiceX, can be very effective when accessing a fraction of large datasets at remote grid sites. ServiceX can deliver user-selected columns with filtering and run at scale. We will introduce the ServiceX data management package, ServiceX DataBinder, for easy manipulations...

82. FAIR AI Models in High Energy Physics

Li, Haoyang (UCSD)

5/11/23, 12:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The findable, accessible, interoperable, and reusable (FAIR) data principles have provided a framework for examining, evaluating, and improving how we share data with the aim of facilitating scientific discovery. Efforts have been made to generalize these principles to research software and other digital products. Artificial intelligence (AI) models---algorithms that have been trained on data...

522. FPGA-based real-time cluster finding for the LHCb silicon pixel detector

Bassi, Giovanni (SNS and INFN Pisa)

5/11/23, 12:15 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

We report the implementation details, commissioning results, and physics performances of a two-dimensional cluster finder for reconstructing hit positions in the new vertex pixel detector (VELO) that is part of the LHCb Upgrade. The associated custom VHDL firmware has been deployed to the existing FPGA cards that perform the readout of the VELO and fully commissioned during the start of LHCb...

216. FTS Service Evolution and LHC Run-3 Operations

Murray, Steven (CERN)

5/11/23, 12:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The File Transfer System (FTS) is a software system responsible for queuing, scheduling, dispatching and retrying file transfer requests, it is used by three of the LHC experiments, namely ATLAS, CMS and LHCb, as well as non LHC experiments including AMS, Dune and NA62. FTS is critical to the success of many experiments and the service must remain available and performant during the entire...

467. LCIO Turns 20

Graf, Norman (SLAC National Accelerator Laboratory)

5/11/23, 12:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

LCIO is a persistency framework and event data model originally developed to foster closer collaboration among the international groups conducting simulation studies for future linear colliders. In the twenty years since its introduction at CHEP 2003 it has formed the backbone for ILC and CLIC physics and detector studies. It has also been successfully employed to study and develop other...

243. Lifecycle Management, Business Continuity and Disaster Recovery Planning for the LHCb Experiment Control System Infrastructure

Cifra, Pierfrancesco (Nikhef National institute for subatomic physics (PL) and CERN)

5/11/23, 12:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

LHCb (Large Hadron Collider beauty) is one of the four large particle physics experiments aimed at studying differences between particles and anti-particles and very rare decays in the charm and beauty sector of the standard model at the LHC. The Experiment Control System (ECS) is in charge of the configuration, control, and monitoring of the various subdetectors as well as all areas of the...

428. Recent Developments in the FullSimLight Simulation Tool from ATLAS

Mr Khan, Raees (University of Pittsburgh)

5/11/23, 12:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

FullSimLight is a lightweight, Geant4-based command line
simulation utility intended for studies of simulation performance. It
is part of the GeoModel toolkit (geomodel.web.cern.ch) which has been
stable for more than one year. The FullSimLight component
has recently undergone renewed development aimed at extending its
functionality. It has been endowed with a GUI for fast,...

162. The integration of heterogeneous resources in the CMS Submission Infrastructure for the LHC Run 3 and beyond

Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)

5/11/23, 12:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The computing resources supporting the LHC experiments research programmes are still dominated by x86 processors deployed at WLCG sites. This will however evolve in the coming years, as a growing number of HPC and Cloud facilities will be employed by the collaborations in order to process the vast amounts of data to be collected in the LHC Run 3 and into the HL-LHC phase. Compute power in...

563. Towards a container-based architecture for CMS data acquisition

Simelevicius, Dainius (Vilnius University/CERN)

5/11/23, 12:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS data acquisition (DAQ) is implemented as a service-oriented architecture where DAQ applications, as well as general applications such as monitoring and error reporting, are run as self-contained services. The task of deployment and operation of services is achieved by using several heterogeneous facilities, custom configuration data and scripts in several languages. Deployment of all...

500. APEIRON: a Framework for High Level Programming of Dataflow Applications on Multi-FPGA Systems

Rossi, Cristian (INFN Roma1)

5/11/23, 12:30 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

High Energy Physics (HEP) Trigger and Data Acquisition systems (TDAQs) need ever increasing throughput and real-time data analytics capabilities either to improve particle identification accuracy and further suppress background events in trigger systems or to perform an efficient online data reduction for trigger-less ones.
As for the requirements imposed by HEP TDAQs applications in the...

461. Darshan for HEP application

Wang, Rui (Argonne national laboratory)

5/11/23, 12:30 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Modern HEP workflows must manage increasingly large and complex data collections. HPC facilities may be employed to help meet these workflows' growing data processing needs. However, a better understanding of the I/O patterns and underlying bottlenecks of these workflows is necessary to meet the performance expectations of HPC systems.
Darshan is a lightweight I/O characterization tool that...

438. Full Simulation of CMS for Run-3 and Phase-2

Srimanobhas, Phat

5/11/23, 12:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

In this contribution we report status of the CMS Geant4 simulation and the prospects for Run-3 and Phase-2.
Firstly, we report about our experience during the start of Run-3 with Geant4 10.7.2, the common software package DD4hep for geometry description, and VecGeom run time geometry library. In addition, FTFP_BERT_EMM Physics List and CMS configuration for tracking in magnetic field have...

34. Integrating FTS in the Fenix HPC infrastructure

Ms Long, Shiting (Forschungszentrum Jülich)

5/11/23, 12:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

HPC systems are increasingly often used for addressing various challenges in high-energy physics. But often the data infrastructures used in the latter area are not well integrated with infrastructures that include HPC resources. Here we will focus on a specific infrastructure, namely Fenix, which is based on a consortium of 6 leading European supercomputing centres. The Fenix sites are...

469. Interpreting C++20 and CUDA, with profiling and debugging

Canal, Philippe

5/11/23, 12:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Cling is a clang/LLVM-based, high-performance C++ interpreter originating from HEP. In ROOT, cling is used as the basis for language interoperability, it provides reflection data to ROOT's I/O system, and enables RDataFrame's dynamic type-safe interfaces.

Cling regularely moves to more recent LLVM versions to bring new features and support for new language standards. The recent LLVM 13...

84. Managing remote cloud resources for multiple HEP VO’s with cloudscheduler

Ebert, Marcus (University of Victoria)

5/11/23, 12:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

Cloudscheduler is a system to manage resources of local and remote compute clouds and makes those resources available to HTCondor pools. It examines the resource needs of idle jobs, then starts virtual machines (VMs) sized to suit those resource needs on allowed clouds with available resources. Using yaml files, cloudscheduler then provisions the VMs during the boot process with all necessary...

519. MiniDAQ-3: providing concurrent independent subdetector data-taking on CMS production DAQ resources

Brummer, Philipp (CERN)

5/11/23, 12:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The CMS experiment data acquisition (DAQ) collects data for events accepted by the Level-1 trigger from the different detector systems and assembles them in an event builder prior to making them available for further selection in the High Level Trigger, and finally storing the selected ones for offline analysis. In addition to the central DAQ providing global acquisition functionality, several...

339. Optimization of opportunistic utilization of the ATLAS High-Level Trigger farm for LHC Run3

Glushkov, Ivan (The University of Texas at Arlington)

5/11/23, 12:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The ATLAS Trigger and Data Acquisition (TDAQ) High Level Trigger (HLT) computing farm contains 120,000 cores. These resources are critical for online selection and collection of collision data in the ATLAS experiment during LHC operation. Since 2013, during longer period of LHC inactivity these resources are being used for offline event simulation via the "Simulation at Point One" project...

532. Symbolic Regression on FPGAs for Fast Machine Learning Inference

Mr Tsoi, Ho Fung (University of Wisconsin Madison)

5/11/23, 12:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The high-energy physics community is investigating the feasibility of deploying more machine-learning-based solutions on FPGAs to meet modern physics experiments' sensitivity and latency demands. In this contribution, we introduce a novel end-to-end procedure that utilises a forgotten method in machine learning, i.e. symbolic regression (SR). It searches equation space to discover algebraic...

46. ATLAS LAr Calorimeter Commissioning for LHC Run-3

Furukawa, Marin (University of Tokyo)

5/11/23, 2:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The Liquid Argon Calorimeters are employed by ATLAS for all electromagnetic calorimetry in the pseudo-rapidity region |η| < 3.2, and for hadronic and forward calorimetry in the region from |η| = 1.5 to |η| = 4.9. They also provide inputs to the first level of the ATLAS trigger. After successful period of data taking during the LHC Run-2 between 2015 and 2018 the ATLAS detector entered into the...

385. CP Algorithms: A common corrections framework for ATLAS

Krumnack, Nils (Iowa State University (US))

5/11/23, 2:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

During Run 2 the ATLAS experiment employed a large number of different user frameworks to perform the final corrections of its event data. For Run 3 a common framework was developed that incorporates the lessons learned from existing frameworks. Besides providing analysis standardization it also incorporates optimizations that lead to a substantial reduction in computing needs during analysis.

191. Exploring Future Storage Options for ATLAS at the BNL/SDCC facility

Huang, Qiulan (Brookhaven National Laboratory)

5/11/23, 2:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The rapid growth of scientific data and the computational needs of BNL-supported science programs will bring the Scientific Data and Computing Center (SDCC) to the Exabyte scale in the next few years. The SDCC Storage team is responsible for the symbiotic development and operations of storage services for all BNL experiment data, in particular for the data generated by the ATLAS experiment...

403. Framework for custom event sample augmentations for ATLAS analysis data

van Gemmeren, Peter (Argonne National Laboratory)

5/11/23, 2:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

For HEP event processing, data is typically stored in column-wise synchronized containers, such as most prominently ROOT’s TTree, which have been used for several decades to store by now over 1 exabyte. These containers can combine row-wise association capabilities needed by most HEP event processing frameworks (e.g. Athena for ATLAS) with column-wise storage, which typically results in better...

398. High-Throughput Machine Learning Inference with NVIDIA TensorRT

Aaij, Roel (Nikhef National institute for subatomic physics (NL))

5/11/23, 2:00 PM

Track 11 - Heterogeneous Computing and Accelerators

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

The first stage of the LHCb High Level Trigger is implemented as a GPU application. In 2023 it will run on 400 NVIDIA GPUs and its goal is to reduce the rate of incoming data from 5 TB/s to approximately 100 GB/s. A broad scala of reconstruction algorithms is implemented as approximately 70 kernels. Machine Learning algorithms are attractive to further extend the physics reach of the...

71. Implementation of New Security Features in the CMSWEB Kubernetes Cluster at CERN

Mascheroni, Marco (University of California San Diego)

5/11/23, 2:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The CMSWEB cluster is pivotal to the activities of the Compact Muon Solenoid (CMS) experiment, as it hosts critical services required for the operational needs of the CMS experiment. The security of these services and the corresponding data is crucial to CMS. Any malicious attack can compromise the availability of our services. Therefore, it is important to construct a robust security...

30. Integrating the PanDA Workload Management System with the Vera C. Rubin Observatory

Dr Karavakis, Edward (Brookhaven National Laboratory)

5/11/23, 2:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Vera C. Rubin Observatory will produce an unprecedented astronomical data set for studies of the deep and dynamic universe. Its Legacy Survey of Space and Time (LSST) will image the entire southern sky every three days and produce tens of petabytes of raw image data and associated calibration data. More than 20 terabytes of data must be processed and stored every night for ten...

48. Machine Learning for Real-Time Processing of ATLAS Liquid Argon Calorimeter Signals with FPGAs

Voigt, Johann (TU Dresden)

5/11/23, 2:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The Large Hadron Collider (LHC) at CERN is the largest and most powerful particle collider today. The Phase-II Upgrade of the LHC will increase the instantaneous luminosity by a factor of 7 leading to the High Luminosity LHC (HL-LHC). At the HL-LHC, the number of proton-proton collisions in one bunch crossing (called pileup) increases significantly, putting more stringent requirements on the...

57. Meld: Exploring the feasibility of a framework-less framework

Knoepfel, Kyle (Fermi National Accelerator Laboratory)

5/11/23, 2:00 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

HEP data-processing frameworks are essential ingredients in getting from raw data to physics results. But they are often tricky to use well, and they present a significant learning barrier for the beginning HEP physicist. In addition, existing frameworks typically support rigid, collider-based data models, which do not map well to neutrino-physics experiments like DUNE. Neutrino physicists...

100. Simulation of Hadronic Interactions with Deep Generative Models

Mr Pham, Tuan Minh (University of Wisconsin-Madison)

5/11/23, 2:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

Reliably simulating detector response to hadrons is crucial for almost all physics programs at the Large Hadron Collider. The core component of such simulation is the modeling of hadronic interactions. Unfortunately, there is no first-principle theory guidance. The current state-of-the-art simulation tool, Geant4, exploits phenomenology-inspired parametric models, each simulating a specific...

394. Adapting GitOps to manage Helmholtz Cloud services at DESY

Beermann, Thomas (DESY)

5/11/23, 2:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Helmholtz Federated IT Services (HIFIS) provides shared IT services in across all fields and centres in the Helmholtz Association. HIFIS is a joint platform in which most of the research centres in the Helmholtz Association collaborate and offer cloud and fundamental backbone services free of charge to scientists in Helmholtz and their partners. Furthermore, HIFIS provides a federated...

218. Applications of Lipschitz neural networks to the Run 3 LHCb trigger system

Delaney, Blaise (Massachusetts Institute of Technology)

5/11/23, 2:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The data-taking conditions expected of Run 3 pose unprecedented challenges for the DAQ systems of the LHCb experiment at the LHC. The LHCb collaboration is pioneering the adoption of a fully-software trigger to cope with the expected increase in luminosity and, thus, event rate. The upgraded trigger system has required advances in the use of hardware architectures, software and algorithms....

51. Commissioning of the ALICE readout software for LHC Run 3

Chapeland, Sylvain (CERN)

5/11/23, 2:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

ALICE (A Large Ion Collider Experiment) is a heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). During the second long shut-down of the LHC, the ALICE detector was upgraded to cope with an interaction rate of 50 kHz in Pb-Pb collisions, producing in the online computing system (O2) a sustained input...

Chuchuk, Olga (CERN, University of Cote d'Azur)

5/11/23, 2:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

In the HEP community, the prediction of Data Popularity is a topic that has been approached for many years. Nonetheless, while facing increasing data storage challenges, especially in the HL-LHC era, we are still in need for better predictive models to answer the questions of whether particular data should be kept, replicated, or deleted.

The usage of caches proved to be a convenient...

610. First Measurements With A Quantum Vision Transformer: A Naive Approach

Pasquali, Dominic (UC Santa Cruz / CERN)

5/11/23, 2:15 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

In mainstream machine learning, transformers are gaining widespread usage. As Vision Transformers rise in popularity in computer vision, they now aim to tackle a wide variety of machine learning applications. In particular, transformers for High Energy Physics (HEP) experiments continue to be investigated for tasks including jet tagging, particle reconstruction, and pile-up mitigation.

In a...

523. Generating Accurate Showers in Highly Granular Calorimeters Using Normalizing Flows

Mr Buss, Thorsten (Universität Hamburg)

5/11/23, 2:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

The full simulation of particle colliders incurs a significant computational cost. Among the most resource-intensive steps are detector simulations. It is expected that future developments, such as higher collider luminosities and highly granular calorimeters, will increase the computational resource requirement for simulation beyond availability. One possible solution is generative neural...

419. Optimizing ATLAS data storage: the impact of compression algorithms on ATLAS physics analysis data formats

Dr Marcon, Caterina (INFN Milano)

5/11/23, 2:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

The increased footprint foreseen for Run-3 and HL-LHC data will soon expose
the limits of currently available storage and CPU resources. Data formats
are already optimized according to the processing chain for which they are
designed. ATLAS events are stored in ROOT-based reconstruction output files
called Analysis Object Data (AOD), which are then processed within the
derivation...

462. PHYSLITE - a new reduced common data format for ATLAS

Schaarschmidt, Jana (University of Washington (US))

5/11/23, 2:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

ATLAS is one of the main experiments at the Large Hadron Collider, with a diverse physics program covering precision measurements as well as new physics searches in countless final states, carried out by more than 2600 active authors. The High Luminosity LHC (HL-LHC) era brings unprecedented computing challenges that call for novel approaches to reduce the amount of data and MC that is stored,...

67. Publication of the Belle II Software

Kuhr, Thomas (LMU Munich)

5/11/23, 2:15 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

The Belle II software was developed as closed source. As several HEP experiments released their code to the public, the topic of open source software was also discussed within the Belle II collaboration. A task force analyzed advantages and disadvantages and proposed a policy which was adopted by the collaboration in 2020. The Belle II offline software was then released under an open source...

521. The Rubin Observatory’s Legacy Survey of Space and Time DP0.2 processing campaign at CC-IN2P3

Hernandez, Fabio (IN2P3 / CNRS computing center)

5/11/23, 2:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Vera C. Rubin Observatory, currently in construction in Chile, will start performing the Large Survey of Space and Time (LSST) late 2024 for 10 years. Its 8.4-meter telescope will survey the southern sky in less than 4 nights in six optical bands, and repeatedly generate about 2000 exposures per night, corresponding to a data volume of about 20 TB every night. Three data facilities are...

294. Architecting the OpenSearch service at CERN

Papadopoulos, Sokratis (CERN)

5/11/23, 2:30 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The centralised Elasticsearch service has already been running at CERN for over 6 years, providing the search and analytics engine for numerous CERN users. The service has been based on the open-source version of Elasticsearch, surrounded by a set of external open-source plugins offering security, multi-tenancy, extra visualization types and more. The evaluation of OpenDistro for...

386. Columnar analysis and on-the-fly analysis corrections at ATLAS

Krumnack, Nils (Iowa State University (US))

5/11/23, 2:30 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

Current analysis at ATLAS always involves a step of producing an analysis-specific n-tuple that incorporates the final step of calibrations as well as systematic variations. The goal for Run 4 is to make these analysis-specific corrections fast enough that they can be applied "on-the-fly" without the need for an intermediate n-tuple. The main complications for this are that some of these...

47. Development of the ATLAS Liquid Argon Calorimeter Readout Electronics for the HL-LHC

Ventura Gonzalez, Salvador (CEA-SACLAY)

5/11/23, 2:30 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

A new era of hadron collisions will start around 2029 with the High-Luminosity LHC which will allow to collect ten times more data than what has been collected during 10 years of operation at LHC. This will be achieved by higher instantaneous luminosity at the price of higher number of collisions per bunch crossing.

In order to withstand the high expected radiation doses and the harsher...

518. Key4hep Progress Report on Integrations

Ganis, Gerardo (CERN), Sailer, Andre (CERN)

5/11/23, 2:30 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Detector studies for future experiments rely on advanced software
tools to estimate performance and optimize their design and technology
choices. The Key4hep project provides a flexible turnkey solution for
the full experiment life-cycle based on established community tools
such as ROOT, Geant4, DD4hep, Gaudi, podio and spack. Members of the
CEPC, CLIC, EIC, FCC, and ILC communities have...

268. Lightweight Distributed Computing System Oriented to LHHAASO Data Processing

CHENG, Yaodong (IHEP, CAS)

5/11/23, 2:30 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Large High Altitude Air Shower Observatory (LHAASO) is a large-scale astrophysics experiment led by China. The offline data processing was highly dependent on the Institute of High Energy Physics(IHEP) local cluster and the local file system.
As the LHAASO experimental cooperation groups’ resources are located geographically and most of them have the characteristics of limited scale, low...

179. New XrootD monitoring implementation

Mr Garrido Bear, Borja (CERN)

5/11/23, 2:30 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Complete and reliable monitoring of the WLCG data transfers is an important condition for effective computing operations of the LHC experiments. WLCG data challenges organised in 2021 and 2022 highlighted the need for improvements in the monitoring of data traffic on the WLCG infrastructure. In particular, it concerns the implementation of the monitoring of the remote data access via the...

549. The CaloChallenge insights & findings

Dr Faucci Giannelli, Michele (INFN Roma Tor Vergata)

5/11/23, 2:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

For High Energy Physics (HEP) experiments, the calorimeter is a key detector to measure the energy of particles. Particles interact with the material of the calorimeter, creating cascades of secondary particles, the so-called showers. Description of the showering process relies on simulation methods that precisely describe all particle interactions with matter. Constrained by the complexity of...

76. The Deployment of Realtime ML in Changing Environments

Brown, Christopher (Imperial College (GB))

5/11/23, 2:30 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The High-Luminosity LHC upgrade of the CMS experiment will utilise a large number of Machine Learning (ML) based algorithms in its hardware-based trigger. These ML algorithms will facilitate the selection of potentially interesting events for storage and offline analysis. Strict latency and resource requirements limit the size and complexity of these models due to their use in a high-speed...

531. The Role of Data in Projected Quantum Kernels: the Higgs Boson Discrimination

VALLECORSA, SOFIA (CERN)

5/11/23, 2:30 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

A new theoretical framework in Quantum Machine Learning (QML) allows to compare the performances of Quantum and Classical ML models on supervised learning tasks. We assess the performance of a quantum and classic support vector machine for a High Energy Physics dataset: the Higgs tt ̄H(b ̄b) decay dataset, grounding our study in a new theoretical framework based on three metrics: the geometry...

429. Towards a distributed heterogeneous task scheduler for the ATLAS offline software framework

Esseiva, Julien (Lawrence Berkeley National Laboratory)

5/11/23, 2:30 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

With the increased data volumes expected to be delivered by the HL-LHC, it becomes critical for the ATLAS experiment to maximize the utilization of available computing resources ranging from conventional GRID clusters to supercomputers and cloud computing platforms. To be able to run its data processing applications on these resources, the ATLAS software framework must be capable of...

614. 400Gbps benchmark of XRootD HTTP-TPC

Arora, Aashay (UCSD)

5/11/23, 2:45 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

Due to the increased demand of network traffic expected during the HL-LHC era, the T2 sites in the USA will be required to have 400Gbps of available bandwidth to their storage solution.
With the above in mind we are pursuing a scale test of XRootD software when used to perform Third Party Copy transfers using the HTTP protocol. Our main objective is to understand the possible limitations in...

612. Artificial Intelligence for improved facilities operation

Jain, Milan (PNNL)

5/11/23, 2:45 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The LCAPE project develops artificial intelligence to improve operations in the FNAL control room by reducing the time to identify the cause of an outage, improving the reproducibility of labeling it, predicting their duration and forecasting their occurrence.
We present our solution for incorporating information from ~2.5k monitored devices to distinguish between dozens of different causes...

506. Current Status and Future Developments for Automation of Data Calibration and Processing Procedures at the Belle II Experiment

Dossett, David (University of Melbourne)

5/11/23, 2:45 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Since March 2019 the Belle II detector has collected data from e+ e- collisions at the SuperKEKB collider. For Belle II analyses to be competitive it is crucial that calibration constants are calculated promptly so that the reconstructed datasets can be provided to analysts. A subset of calibration constants also benefit by being re-derived during yearly recalibration campaigns to give...

54. Fast and Accurate Calorimeter Simulation with Diffusion Models

Amram, Oz (Fermilab)

5/11/23, 2:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

Simulation is a crucial part of all aspects of collider data analysis. However, the computing challenges of the High Luminosity era will require simulation to use a smaller fraction of computing resources, at the same time as more complex detectors are introduced, requiring more detailed simulation. This motivates the use of machine learning (ML) models as surrogates to replace full...

303. FELIX: first operational experience with the new ATLAS readout system and perspectives for HL-LHC

Hoya, Joaquin (CERN)

5/11/23, 2:45 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

Over the next decade, the ATLAS detector will be required to operate in an increasingly harsh collision environment. To maintain physics performance, the detector will undergo a series of upgrades during major shutdowns. A key goal of these upgrades is to improve the capacity and flexibility of the detector readout system. To this end, the Front-End Link eXchange (FELIX) system was developed...

437. Hybrid actor-critic scheme for quantum reinforcement learning

VALLECORSA, SOFIA (CERN)

5/11/23, 2:45 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Free energy-based reinforcement learning (FERL) with clamped quantum Boltzmann machines (QBM) was shown to significantly improve the learning efficiency compared to classical Q-learning with the restriction, however, to discrete state-action space environments. We extended FERL to continuous state-action space environments by developing a hybrid actor-critic scheme combining a classical...

129. Improved Clustering in the Belle II Electromagnetic Calorimeter with Graph Neural Networks

Haide, Isabel (Karlsruhe Institute of Technology)

5/11/23, 2:45 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

For the Belle II experiment, the electromagnetic calorimeter (ECL) plays a crucial role in both the trigger decisions and the offline analysis.
The performance of existing clustering algorithms degrades with rising backgrounds that are expected for the increasing luminosity in Belle II. In offline analyses, this mostly impacts the energy resolution for low-energy photons; for the trigger,...

487. podio v1.0 - A first stable release of the EDM toolkit

Madlener, Thomas (DESY)

5/11/23, 2:45 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

A performant and easy-to-use event data model (EDM) is a key component of any HEP software stack.The podio EDM toolkit provides a user friendly way of generating such a performant implementation in C++ from a high level description in yaml format. Finalizing a few important developments, we release v1.0 of podio, a stable release with backward compatibility for datafiles written with podio...

367. The Cherenkov Telescope Array Observatory workflow management system

Faure, Alice (LUPM)

5/11/23, 2:45 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The Cherenkov Telescope Array Observatory (CTAO) is the next generation ground-based observatory for gamma-ray astronomy at very high energies. It will consist of tens of Cherenkov telescopes, spread between two array sites: one in the Northern hemisphere in La Palma (Spain), and one in the Southern hemisphere in Paranal (Chile). Currently under construction, CTAO will start scientific...

576. The Open Data Detector

Dr Salzburger, Andreas

5/11/23, 2:45 PM

Track 5 - Sustainable and Collaborative Software Engineering

Oral

Track 5 - Sustainable and Collaborative Software Engineering

Open Data Detector (ODD) is a detector for algorithm research and development. The tracking system is an evolution of the detector used in the successful Tracking Machine Learning Challenge. It offers a more realistic design, with a support structure, cables, and cooling pipes. The ODD got extended with granular calorimetry and can be completed in future with a muon system. The magnetic field...

170. Development of a Deep-learning based Full Event Interpretation (DFEI) algorithm for the future LHCb trigger

Mr Eschle, Jonas (University of Zürich)

5/11/23, 3:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

In a decade from now, the Upgrade II of LHCb experiment will face an instantaneous luminosity ten times higher than in the current Run 3 conditions. This will bring LHCb to a new era, with huge event sizes and typically several signal heavy-hadron decays per event. The trigger scope will shift from selecting interesting events to select interesting parts of multi-signal events. To allow for an...

471. Improving Noisy Hybrid Quantum Graph Neural Networks for Particle Decay Tree Reconstruction

Strobl, Melvin (Karlsruhe Institute of Technology)

5/11/23, 3:00 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

With the emergence of the research field Quantum Machine Learning, interest in finding advantageous real-world applications is growing as well.
However challenges concerning the number of available qubits on Noisy Intermediate Scale Quantum (NISQ) devices and accuracy losses due to hardware imperfections still remain and limit the applicability of such approaches in real-world scenarios.
For...

473. Management of Batch worker node lifecycle and maintenance at CERN with BrainSlug (our in-house state-manager daemon), Rundeck and StackStorm

Mr Jones, Ben (CERN)

5/11/23, 3:00 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

Batch@CERN team manages over 250k CPU cores for Batch processing of LHC data with our HTCondor cluster comprising ~5k nodes. We will present a lifecycle management solution of our systems with our in-house developed state-manager daemon BrainSlug and how it handles draining, rebooting, interventions and other actions on the worker nodes, with Rundeck as our human-interaction endpoint and using...

371. RenderCore – a new WebGPU-based rendering engine for ROOT-EVE

Tadel, Matevz (UCSD)

5/11/23, 3:00 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

REve, the new generation of the ROOT event-display module, uses a web server-client model to guarantee exact data translation from the experiments' data analysis frameworks to users' browsers. Data is then displayed in various views, including high-precision 2D and 3D graphics views, currently driven by THREE.js rendering engine based on WebGL technology.

RenderCore, a computer graphics...

168. Scale tests of the new DUNE data pipeline

Timm, Steven (Fermi National Accelerator Laboratory)

5/11/23, 3:00 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

In preparation for the second runs of the ProtoDUNE detectors at CERN (NP02 and NP04), DUNE has established a new data pipeline for bringing the data from the EHN-1 experimental hall at CERN to primary tape storage at Fermilab and CERN, and then spreading it out to a distributed disk data store at many locations around the world. This system includes a new Ingest Daemon and a new Declaration...

317. Streaming Readout and Data-Stream Processing With ERSAP

Gyurjyan, Vardan (Jefferson Lab)

5/11/23, 3:00 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

The volume and complexity of data produced at HEP and NP research facilities have grown exponentially. There is an increased demand for new approaches to process data in near-real-time. In addition, existing data processing architectures need to be reevaluated to see if they are adaptable to new technologies. A unified programming model for event processing and distribution that can exploit...

498. The ALICE Grid Workflow for LHC Run 3

Storetvedt, Maksim (CERN)

5/11/23, 3:00 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

In preparation for LHC Run 3 and 4, the ALICE Collaboration has moved to a new Grid middleware, JAliEn, and workflow management system. The migration was dictated by the substantially higher requirements on the Grid infrastructure in terms of payload complexity, increased number of jobs and managed data volume, all of which required a complete rewrite of the middleware using modern software...

602. The QuantOm Event-Level Inference Framework

Lersch, Daniel (Florida State University)

5/11/23, 3:00 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

In the last two decades, there have been major breakthroughs in Quantum Chromodynamics (QCD), the theory of the strong interaction of quark and gluons, as well as major advances in the accelerator and detector technologies that allow us to map the spatial distribution and motion of the quarks and gluons in terms of quantum correlation functions (QCF). This field of research broadly known as...

405. Transformers for Generalized Fast Shower Simulation

Raikwar, Piyush (CERN)

5/11/23, 3:00 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

Recently, transformers have proven to be a generalized architecture for various data modalities, i.e., ranging from text (BERT, GPT3), time series (PatchTST) to images (ViT) and even a combination of them (Dall-E 2, OpenAI Whisper). Additionally, when given enough data, transformers can learn better representations than other deep learning models thanks to the absence of inductive bias, better...

548. Acceleration of a CMS DNN based Algorithm

Barbone, Marco (Imperial College London)

5/11/23, 3:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 9 - Artificial Intelligence and Machine Learning

The search for exotic long-lived particles (LLP) is a key area of the current LHC physics programme and is expected to remain so into the High-Luminosity (HL)-LHC era. As in many areas of the LHC physics programme Machine Learning algorithms play a crucial role in this area, in particular Deep Neural Networks (DNN), which are able to use large numbers of low-level features to achieve enhanced...

28. Dynamic scheduling using CPU oversubscription in the ALICE Grid

Bertran Ferrer, Marta (CERN)

5/11/23, 3:15 PM

Track 4 - Distributed Computing

Oral

Track 4 - Distributed Computing

The ALICE Grid is designed to perform a realtime comprehensive monitoring of both jobs and execution nodes in order to maintain a continuous and consistent status of the Grid infrastructure. An extensive database of historical data is available and is periodically analyzed to tune the workflow and data management to optimal performance levels. This data, when evaluated in real time, has the...

627. ICARUS signal processing with HEPnOS

Syed, S. (FNAL)

5/11/23, 3:15 PM

Track 1 - Data and Metadata Organization, Management and Access

Oral

Track 1 - Data and Metadata Organization, Management and Access

The LArSoft/art framework is used at Fermilab’s liquid argon time projection chamber experiments such as ICARUS to run traditional production workflows in a grid environment. It has become increasingly important to utilize HPC facilities for experimental data processing tasks. As part of the SciDAC-4 HEP Data Analytics on HPC and HEP Event Reconstruction with Cutting Edge Computing...

312. Overcoming obstacles to IPv6 on WLCG

Kelsey, David (UKRI-STFC)

5/11/23, 3:15 PM

Track 7 - Facilities and Virtualization

Oral

Track 7 - Facilities and Virtualization

The transition of WLCG storage services to dual-stack IPv4/IPv6 is nearing completion after more than 5 years, thus enabling the use of IPv6-only CPU resources as agreed by the WLCG Management Board and presented by us at earlier CHEP conferences. Much of the data is transferred by the LHC experiments over IPv6. All Tier-1 storage and over 90% of Tier-2 storage is now IPv6-enabled, yet we...

229. Potentiality of automatic parameter tuning suite available in ACTS track reconstruction software framework

Garg, rocky (Stanford University)

5/11/23, 3:15 PM

Track 3 - Offline Computing

Oral

Track 3 - Offline Computing

Particle tracking is among the most sophisticated and complex part of the full event reconstruction chain. A number of reconstruction algorithms work in a sequence to build these trajectories from detector hits. Each of these algorithms use many configuration parameters that need to be fine-tuned to properly account for the detector/experimental setup, the available CPU budget and the desired...

83. Refining fast simulation using machine learning

Wolf, Moritz (Hamburg University)

5/11/23, 3:15 PM

Track 9 - Artificial Intelligence and Machine Learning

Oral

Track 3+9 Crossover

At the CMS experiment, a growing reliance on the fast Monte Carlo application (FastSim) will accompany the high luminosity and detector granularity expected in Phase 2. The FastSim chain is roughly 10 times faster than the application based on the GEANT4 detector simulation and full reconstruction referred to as FullSim. However, this advantage comes at the price of decreased accuracy in some...

443. Speeding up amplitude analysis with a Computer Algebra System

de Boer, Remco (Ruhr University Bochum)

5/11/23, 3:15 PM

Track 6 - Physics Analysis Tools

Oral

Track 6 - Physics Analysis Tools

In the ideal world, we describe our models with recognizable mathematical expressions and directly fit those models to large data sample with high performance. It turns out that this can be done with a CAS, using its symbolic expression trees as template to computational back-ends like JAX. The CAS can in fact further simplify the expression tree, which can result in speed-ups in the numerical...

466. Symmetry Invariant Quantum Machine Learning models for classification problems in Particle Physics

Mr Heredge, Jamie (University of Melbourne)

5/11/23, 3:15 PM

Track 12 - Quantum Computing

Oral

Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

In the search for advantage in Quantum Machine Learning, appropriate encoding of the underlying classical data into a quantum state is a critical step. Our previous work [1] implemented an encoding method inspired by underlying physics and achieved AUC scores higher than that of classical techniques for a reduced dataset of B Meson Continuum Suppression data. A particular problem faced by...

19. The CMS Inner Tracker DAQ system for the High Luminosity upgrade of LHC: from single-chip testing, to large-scale assembly qualification

Dinardo, Mauro (Università degli Studi di Milano - Bicocca)

5/11/23, 3:15 PM

Track 2 - Online Computing

Oral

Track 2 - Online Computing

To improve the potential for discoveries at the LHC, a significant luminosity increase of the accelerator (HL-LHC) is foreseen in the late 2020s, to achieve a peak luminosity of 7.5x10^34 cm^-2 s^-1, in the ultimate performance scenario. HL-LHC is expected to run with a bunch-crossing separation of 25 ns and a maximum average of 200 events (pile-up) per crossing. To maintain or even improve...

655. Data-Intensive search for Dark Matter with LUX-ZEPLIN

Monzani, Maria Elena (SLAC)

5/11/23, 4:30 PM

Plenary

Plenary Session

The nature and origin of dark matter are among the most compelling mysteries of contemporary science. There is strong evidence for dark matter from its role in shaping the galaxies and galaxy clusters that we observe in the universe. Still, physicists have tried to detect dark matter particles for over three decades with little success.

This talk will describe the leading effort in that...

656. Massive Scale Data Analytics at LCLS-II

Thayer, Jana (SLAC)

5/11/23, 5:00 PM

Plenary

Plenary Session

The increasing volumes of data produced at light sources such as the Linac Coherent Light Source (LCLS) enable the direct observation of materials and molecular assemblies at the length and timescales of molecular and atomic motion. This exponential increase in the scale and speed of data production is prohibitive to traditional analysis workflows that rely on scientists tuning parameters...

657. Global Data Management in Astronomy and the link to HEP: are we collaborators, consumers or competitors?

Bolton, Rosie (SKA Observatory)

5/11/23, 5:30 PM

Plenary

Plenary Session

The astronomy world is moving towards exascale observatories and experiments, with data distribution and access challenges that are comparable with - and on similar timescales to - those of HL-LHC. I will present some of these use cases and show progress towards prototyping work in the Square Kilometre Array (SKA) Regional Centre Network, and present the conceptual architectural view of the...

670. Student Poster Winners!

Britton, Thomas (JLab), Childers, Taylor (Argonne National Laboratory)

5/11/23, 6:00 PM

Poster

659. Track 1 Highlights

Barisits, Martin (CERN)

5/12/23, 9:00 AM

Plenary Session

660. Track 2 Highlights

Yamada, Satoru (KEK)

5/12/23, 9:15 AM

Plenary Session

661. Track 3 Highlights

Bandieramonte, Marilena (University of Pittsburgh)

5/12/23, 9:30 AM

Plenary Session

662. Track 4 Highlights

Ellis, Katy (STFC-RAL)

5/12/23, 9:45 AM

Plenary Session

663. Track 5 Highlights

Sexton, Elizabeth (FNAL)

5/12/23, 10:00 AM

Plenary Session

664. Track 6 Highlights

Hageboeck, Stephan (CERN)

5/12/23, 10:15 AM

Plenary Session

665. Track 7 Highlights

Martinez Outschoorn, Verena Ingrid (University of Massachusetts Amherst)

5/12/23, 11:00 AM

Plenary Session

666. Track 8 Highlights

Hernandez Villanueva, Michel (DESY)

5/12/23, 11:15 AM

Plenary Session

667. Track 9 Highlights

VALLECORSA, SOFIA (CERN)

5/12/23, 11:30 AM

Plenary Session

668. Track X Highlights

Timm, Steven (Fermi National Accelerator Laboratory)

5/12/23, 11:45 AM

Plenary Session

669. Conference Wrap-up

5/12/23, 12:00 PM

Plenary Session

298. A blueprint for a contemporary Storage Element - building a new WLCG storage system with widely available hardware and software components: Ceph, XRootD, and Prometheus.

Jones, Roger (Lancaster University)

Poster

Poster Session

When a new long-term storage facility was needed at the Lancaster WLCG Tier-2 Site, an architecture was chosen involving CephFS as a failure-tolerant back-end volume, and load-balanced XRootD as an endpoint exposing the volume via the HTTPS/DAVS protocols increasingly favoured by the WLCG and other users. This allows operations to continue in the face of disc/node failures with minimal...

230. A DNN for CMS track classification and selection

Giannini, Leonardo (UCSD)

Poster

Poster Session

The CMS tracking follows an iterative approach. Tracks are reconstructed in multiple passes starting from the ones that are easiest to find and moving to the ones with more complex topologies (lower transverse momentum, high displacement). A track classification is applied after each iteration using multivariate analysis and several selection requirements are defined: loose, tight, and high...

529. A fast and optimized data-acquisition and control software for the Timepix4 ASIC

Cavallini, Viola (INFN and University of Ferrara)

Poster

Poster Session

Timepix4 is a hybrid pixel detector readout ASIC developed by the Medipix4 Collaboration. It consists of a matrix of about 230 k pixels with 55 micron pitch, equipped with amplifier, discriminator and time-to-digital converter with 195 ps bin size that allows to measure time-of-arrival and time-over-threshold. It is equipped with two different types of links: slow control links, for the...

282. A multi-purpose reconstruction method based on machine learning for atmospheric neutrinos at JUNO

Duyang, Hongyue (Shandong University)

Poster

Poster Session

The Jiangmen Underground Neutrino Observatory (JUNO) experiment is designed to measure the neutrino mass ordering (NMO) using a 20-kton liquid scintillator (LS) detector. Besides the precise measurement of the reactor neutrinos oscillation spectrum, an atmospheric neutrino oscillation measurement in JUNO offers independent sensitivity for NMO, which can potentially increase JUNO’s total...

311. A new portable random number generator wrapper library

Wang, Tianle (Brookhaven National Lab)

Poster

Poster Session

Random number generator is an important component of many scientific projects. Many of those projects are written using programming models (like OpenMP and SYCL) to target different architectures. However, some of the programming models do not provide a random number generator. In this talk we are going to introduce our random number generator wrapper. It is a header-only library that supports...

533. A RESTful approach to tape management in StoRM

Vianello, Enrico (INFN)

Poster

Poster Session

Currently, the StoRM storage manager relies on the SRM specification to recall files from tape. Although SRM has served us well for many years, its complexity has pushed the WLCG community to adopt a simpler approach, more in line with modern web technologies.
The WLCG tape REST API offers a common HTTP interface allowing clients to manage disk residency of tape-stored files and observe the...

169. A return to in-person public engagement at STFC

Dack, Tom (STFC UKRI)

Poster

Poster Session

Since 2020, UKRI/STFC’s Scientific Computing Department (SCD) have developed several remote-first public engagement activities, drawing on its long and rich history of delivering face to face public engagement and outreach, as part of the wider STFC programme.

With COVID-19 restrictions lifted in the UK, STFC has been able to resume in-person public engagement, both on site and in public...

75. A Volunteer Computing Project to Run Fast Simulation with Delphes for CEPC

Fu, Shiyuan ( Institute of High Energy Physics, CAS)

Poster

Poster Session

Delphes is a C++ framework to perform a fast multipurpose detector response simulation. The Circular Electron Positron Collider (CEPC) experiment runs fast simulation with a modified Delphes based on its own scientific objectives. The CEPC fast simulation with Delphes is a High Throughput Computing (HTC) application with small input and output files. Besides, to compile and run Delphes, only...

74. A web workbench system for the Slurm cluster at IHEP

Fu, Shiyuan ( Institute of High Energy Physics, CAS)

Poster

Poster Session

Slurm REST APIs are released since version 20.02. With those REST APIs one can interact with slurmctld and slurmdbd daemons in a RESTful way. As a result, job submission and cluster status query can be achieved with a web system. To take advantage of Slurm REST APIs, a web workbench system is developed for the Slurm cluster at IHEP.
The workbench system consists with four subsystems:...

250. Accounting and monitoring tools enhancements for Run 3 in ATLAS distributed computing

Chudoba, Jiri

Poster

Poster Session

The ATLAS experiment at the LHC utilizes complex multicomponent distributed systems for processing (PanDA WMS) and managing (Rucio) of data. The complexity of the relationships between components, the amount of data being processed and the continuous development of new functionality of the critical systems are the main challenges to consider when creating monitoring and accounting tools able...

399. Achieving highly automated, high-density web hosting on multi-tenant Kubernetes clusters

Posada Trobo, Ismael (CERN)

Poster

Poster Session

The web hosting infrastructure at CERN provides a range of solutions fulfilling the diverse needs of the community, from simple/static sites to content management systems (Drupal) to complex web applications (Platform-as-a-Service).
An ambitious plan to modernize the web hosting infrastructure started in 2019, aiming at consolidating the various hosting technologies on a shared multi-tenant...

42. All grown-up; 18 years of LHC@home

Mr Jones, Ben (CERN)

Poster

Poster Session

LHC@home was launched as a BOINC project in 2004 as an outreach project for CERN's 50 years anniversary. Initially focused on the accelerator physics simulation code SixTrack, the project was expanded in 2011 to run other physics simulation codes on Linux thanks to virtualisation. Later on the experiment and theory applications running on the LHC@home platform have evolved to use containers...

130. All Things Computing, DUNE

David, Claire (York University (CA) / FNAL (US))

Track 8 - Collaboration, Reinterpretation, Outreach and Education

Poster

Poster Session

Documentation on all things computing is vital for an evolving collaboration of scientists, technicians, and students. Using the MediaWiki software as the framework, a searchable knowledge base is provided via a secure web interface to the large cadre of colleagues working on the DUNE far and near detectors. Organization information, links relevant to working groups, operations and...

293. Analyzing, Identifying and Alerting on Network Issues

McKee, Shawn (University of Michigan Physics)

Poster

Poster Session

WLCG relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues, including connection failures, congestion and traffic routing. The OSG Networking Area, in partnership with the WLCG Throughput working group, has created a monitoring infrastructure that gathers metrics from the...

512. Anomaly Detection in Data Center IT and physical Infrastructure

Dr Giommi, Luca (INFN CNAF)

Poster

Poster Session

Anomaly detection in data center IT and physical infrastructures is challenging due to the amount of heterogeneous data to be analyzed. Indeed, they include, among others, CPU and memory consumption, network traffic, cooling and electrical states. Defining a solution that early identifies unexpected anomalies is particularly important to prevent data losses, breakdown of the system, and any...

604. Anomaly Detection in the LZ Dark Matter Experiment using Variational Autoencoders

Anderson, Tyler (Stanford)

Poster

Poster Session

Characterizing backgrounds is an extremely important task for any dark matter search, and almost every new detector encounters unexpected backgrounds. Such was the case in LZ's first science run, in which several detector effects contributed novel backgrounds. Where the new background events exhibited abnormal PMT waveforms, these features allowed them to be identified using pulse shape...

591. ANTARES: the new tape archive service at RAL Tier-1

Byrne, Thomas (UKRI - STFC)

Poster

Poster Session

ANTARES (A New Tape ARchive for STFC) is the new tape archive service at RAL Tier-1 that went into production on 4th of March 2022. The service is built around the EOS/CTA technologies developed at CERN. EOS is the user facing service that manages the incoming namespace requests and a thin SSD buffer, and CTA is deployed as the tape back-end system. In this talk, we describe the setup of...

40. Application of Machine Learning to Particle Identification at the BESIII experiment

Chen, Zhengyuan (Institute of High Energy Physics Chinese Academy of Sciences)

Poster

Poster Session

Particle identification is an important ingredient to particle physics experiments. Distinguishing the charged hadrons (pions, kaons, protons and their antiparticles) is often crucial, in particular for hadronic decays which could be studied with an efficient particle identification to obtain a desirable signal-to-background ratio. An optimal performance of particle identification in a large...

633. Application of ML in GEP Update

JIANG, Zhixing

Poster

Poster Session

During the next update of the High-Luminosity Large Hadron Collider (HL-LHC) of ATLAS, a new global trigger subsystem will be installed into the L0 Trigger. New and improved hardware and algorithms will be deployed during the upgrade to increase the performance of the trigger system. The global trigger subsystem consists of various components, including the FPGA-based Global Event Processor...

175. ARC and the EuroScienceGateway project

Smirnova, Oxana (Lund University)

Poster

Poster Session

[Nordugrid ARC][1] is widely used today in the Worldwide LHC Computing Grid as one of the two recommended middlewares connecting the grid sites. It has served the WLCG and the High Energy Physics Communities very well since its birth in 2002. Now though, ARC aims to reach more communities outside of HEP like for instance in the fields of Bioinformatics, Astrophysics and Climate Research to...

422. ATLAS software tutorial: evolving to accelerate integration

Garg, rocky (Stanford University)

Poster

Poster Session

The ATLAS software tutorial is a centrally organized suite of educational materials that helps to prepare newcomers for work in ATLAS. The broad objective is to familiarize participants with the basic skills needed to accomplish data analysis tasks, with a strong focus on software tools. In this talk, we will outline the recent changes to the ATLAS software tutorial to follow a project-based...

609. Automatic monitoring of large scale computing infrastructure

Bourilkov, Dimitri (University of Florida), Kim, Bockjoo (University of Florida)

Poster

Poster Session

Modern large distributed computing systems produce large amounts of monitoring data. In order for these systems to operate smoothly, under-performing or failing components have to be identified quickly, and preferably automatically, enabling the system managers to react accordingly.
In this contribution, we analyze job and data transfer data collected in the running of the LHC computing...

153. Backfilling a cloud with grid jobs

Whitehouse, Dan (Imperial College)

Poster

Poster Session

Imperial College London hosts a large Tier-2 WLCG grid site based around a HTCondor batch system; additionally it provides cloud computing facilities using Openstack to non-WLCG activities. These cloud resources are open to opportunistic usage, provided the impact on the primary cloud users remains low.
In common with most Tier 2 sites we see constant job pressure from the WLCG VOs, while the...

479. Batch Generator for training Machine Learning models from ROOT datasets

Moneta, Lorenzo (CERN)

Poster

Poster Session

Through its TMVA package, ROOT provides and connects to machine learning tools for data analysis at HEP experiments and beyond. In addition, ROOT provides through its powerful I/O system and RDataFrame analysis tools the capability to efficiently select and query input data from large data sets as typically used in HEP analysis. At the same time, several existing Machine Learning tools exist...

384. Bookkeeping, a new logbook system for ALICE

Raduta, George (CERN)

Poster

Poster Session

The ALICE experiment at CERN's Large Hadron Collider is designed to study the physics of strongly interacting matter at extreme energy densities. After a major upgrade, the new ALICE Online-Offline computational system expects to read an estimated throughput of 3.5TB/s of raw data and store and index 900Gb/s reconstructed data. The complexity of this endeavour implies the need for a...

359. Bridging Gaps between Simulations and Experiments with Generative Adversarial Networks

Ren, Yihui (Brookhaven National Laboratory)

Poster

Poster Session

Many large-scale physics experiments, such as ATLAS at the Large Hadron Collider, Deep Underground Neutrino Experiment and sPHENIX at the Realistic Heavy Ion Collider, rely on accurate simulations to inform data analysis and derive scientific results. Their inevitable inaccuracies may be detected and corrected using heuristics in a conventional analysis workflow.
However, residual errors...

508. Building a Flexible and Resource-Light Monitoring Platform for a WLCG-Tier2

Currie, Robert (University of Edinburgh)

Poster

Poster Session

Software development projects at Edinburgh identified a desire to build and manage our own monitoring platform. This better allows us to support the developing and varied physics and computing interests of our Experimental Particle Physics group. This production platform enables oversight of international experimental data management, local software development projects and active monitoring...

211. CERNBox: Storage gateway for CERN and beyond

Gonzalez, Hugo (CERN)

Poster

Poster Session

The CERN IT Storage group ensures the coherent design, development, operation and evolution of storage and data management services at CERN for all aspects of physics, user and project data and general needs of the Laboratory. CERNBox is one of the services that actively contributes to this objective.

CERNBox is a cloud collaboration platform providing storage space (18PB) to users (37K...

78. CMSSW Scaling Limits on Many-Core Machines

Jones, Christopher (Fermilab)

Poster

Poster Session

Today the LHC offline computing relies heavily on CPU resources, despite the interest in compute accelerators, such as GPUs, for the longer term future. The number of cores per CPU socket has continued to increase steadily, reaching the levels of 64 cores (128 threads) with recent AMD EPYC processors, and 128 cores on Ampere Altra Max ARM processors. Over the course of the past decade, the CMS...

374. Computing Activities at the Spanish Tier-1 and Tier-2s for the ATLAS experiment in the LHC Run3 period and towards High Luminosity (HL-LHC)

Gonzalez De La Hoz, Santiago (IFIC - Univ. of Valencia and CSIC (ES))

Poster

Poster Session

The ATLAS Spanish Tier-1 and Tier-2s have more than 18 years of experience in the deployment and development of LHC computing components and their successful operations. The sites are actively participating in, and even coordinating, R&D computing activities in the LHC Run3 and developing the computing models needed in HL-LHC period.

In this contribution, we present details on the...

632. Constructing the Hyper-K computing model in the build-up to data taking

King, Sophie

Poster

Poster Session

Hyper-Kamiokande is a next generation underground water Cherenkov detector, with the primary goal of constraining CP-violation in the lepton sector. It has a rich science programme that includes neutrino oscillation studies, neutrino astrophysics and searches for BSM physics including proton decay. Based on the experience of its predecessor Super-Kamiokande, it has a total (fiducial) volume...

172. Controlling Quality for a Physics-Driven Generative Models and Auxiliary Regression Approach

Ratnikov, Fedor (HSE University)

Poster

Poster Session

High energy physics experiments heavily rely on the results of MC simulation of data used to extract physics results. However, the detailed simulation often requires tremendous amount of computation resources.

Using Generative Adversarial Networks and other deep learning generative techniques can drastically speed up the computationally heavy simulations like a simulation of the calorimeter...

431. CPU Performance Study for HEP Workloads with Respect to the Number of Single-core Slots

Schnepf, Matthias (Karlsruhe Institute of Technology)

Poster

Poster Session

Most common CPU architectures provide simultaneous multithreading ( SMT). Thereby, the operating system sees, per physical core, two logical cores and can schedule two processes to one physical CPU core. This overbooking of physical cores enabled a better usage of parallel pipelines and doubled components within a CPU core. On systems with several applications running in parallel, such as...

494. Creating a Grid Storage Element Using S3-compatible Object Storage and Dynamic Federation at the University of Melbourne

Dossett, David (University of Melbourne)

Poster

Poster Session

Until now the grid storage at Melbourne was provided by the DPM storage system which is now reaching the end of support, as well as the end of the disk lifetimes. Continuing to provide grid storage for the ATLAS and Belle II experiments requires that we move to a new solution; one that can be supported long term with minimal manpower by taking advantage of existing resources at Melbourne. Over...

109. CRIC as a core instrument for WLCG operations

Paparrigopoulos, Panos (CERN)

Poster

Poster Session

WLCG is a large heterogeneous computing infrastructure which provides resources for the LHC experiments. The Computing Resource Information Catalogue (CRIC) has been designed as the source for the WLCG topology information. CRIC aims to describe WLCG distributed sites, their services and how the provided resources are used by the LHC experiments. The CRIC instance dedicated to WLCG has become...

393. Czech national e-infrastructure services for HEP

Chudoba, Jiri

Poster

Poster Session

Czech e-infrastructure project e-INFRA CZ is the only project for ICT services defined on the Road Map of the Czech Republic for Large Infrastructures for Research, Experimental Development and Innovations. It was created by 3 national e-infrastructures: CESNET, CERIT-SC and IT4Innovations. The project provides ICT services mostly for Czech universities and research institutions. High energy...

107. dCache integration with CERN Tape Archive (CTA)

Mrs Sahakyan, Marina (DESY)

Poster

Poster Session

The ever increasing amount of data that is produced by modern scientific facilities like EuXFEL or LHC puts a high pressure on the data management infrastructure at the laboratories. This includes poorly shareable resources of archival storage, typically, tape libraries. To achieve maximal efficiency of the available tape resources a deep integration between hardware and software components...

13. De-Noising CLAS12 Drift Chambers using Convolutional Auto Encoders

Gavalian, Gagik (Jefferson Lab)

Poster

Poster Session

Modern Nuclear Physics experimental setups run experiments with higher beam intensity resulting in increased noise in detector components
used for particle track reconstruction. Increased uncorrelated signals (noise) result in decreased particle reconstruction efficiency.
In this work, we investigate the usage of Machine Learning, specifically Convolutional Neural Network Auto-Encoders...

561. Deep Learning approaches for LHCb ECAL reconstruction

Ratnikov, Fedor (HSE University)

Poster

Poster Session

The aim of the LHCb Upgrade II at the LHC is to operate at a luminosity of 1.5 x 1034 cm-2 s-1 to collect a data set of 300 fb-1. This will require a substantial modification of the current LHCb ECAL due to high radiation doses in the central region and increased particle densities.
Advanced detector R&D for both new and ongoing experiments in HEP requires performing computationally intensive...

36. Deep Learning to improve Experimental Sensitivity and Generative Models for Monte Carlo simulations for searching for New Physics in LHC experiments

Salt Cairols, José Francisco (Instituto de Física Corpuscular), Gonzalez De La Hoz, Santiago (IFIC - Univ. of Valencia and CSIC (ES))

Poster

Poster Session

ML/DL techniques have shown their power in the improvement of several studies and tasks in HEP, in particular, especially in physics analysis. Our approach has been to take a number of the ML/DL tools provided by several Open Source platforms applying them to several classification problems, for instance, to the ttbar resonance extraction in LHC experiment.
A comparison has been made...

184. DeepTreeGAN: Fast Generation of High Dimensional Point Clouds

Scham, Moritz (Deutsches Elektronen-Synchrotron (DESY))

Poster

Poster Session

In High Energy Physics, detailed and time-consuming simulations are used for particle interactions with detectors. To bypass these simulations with a generative model, the generation of large point clouds in a short time is required, while the complex dependencies between the particles must be correctly modeled. Particle showers are inherently tree-based processes, as each particle is produced...

409. Deploying a machine learning model catalog at CERN

Mr Da Costa Cardoso, Renato Paulo (CERN)

Poster

Poster Session

With the extended usage of machine learning models, more and more complex algorithms are being studied. On the one hand, the development and optimisation processes become more challenging, on the other hand studies about models generalisation and re-usability become interesting. In this context, efficient, flexible ways to track continuous changes during development, as well as relevant...

530. Deploying and Running Ceph Clusters for Analysis Facilities

Appleyard, Rob (UKRI STFC)

Poster

Poster Session

The RAL Scientific Computing Department provides support for several large experimental facilities. These include, among others, the ISIS neutron spallation source, the Diamond X-Ray Synchrotron, the Rosalind Franklin Institute, and the RAL Central Laser Facility. We use a number of Ceph storage clusters to support the diverse requirements of these users.

These include Deneb, a...

401. Deploying the ATLAS Metadata Interface (AMI) stack in a Docker Compose or Kubernetes environment.

Mr Lambert, Fabian (CNRS/LPSC), Dr Odier, Jérôme (CNRS/LPSC)

Poster

Poster Session

ATLAS Metadata Interface (AMI) is a generic ecosystem for metadata aggregation, transformation and cataloging. Benefiting from more than 20 years of feedback in the LHC context, the second major version was released in 2018. This poster describes how a renewed architecture and integration with modern technologies ease the usage and deployment of a complete AMI stack. It describes how to deploy...

255. Deployment and Operation of the ATLAS EventIndex for LHC Run 3

Gallas, Elizabeth (Oxford)

Poster

Poster Session

The ATLAS EventIndex is the global catalogue of all ATLAS real and simulated events. During the LHC long shutdown between Run 2 (2015-2018) and Run 3 (2022-2025) all its components were substantially revised and a new system was deployed for the start of Run 3 in Spring 2022. The new core storage system, based on HBase tables with a Phoenix interface rather than HDFS MapFiles, allows much...

274. Design and implementation of experimental data access security policy for HEPS computing platform

Fu, Shiyuan (IHEP)

Poster

Poster Session

Based on k8s cluster, High Energy Photon Source (HEPS) computing platform creates a container computing environment to provide analysis services for users. The computing platform provides a container data analysis environment based on jupyterlib with the jupyterhub web page as the entry point. The platform uses CVMFS to store the software library, and the container environment accesses the...

470. Differentiable Programming: Neural Networks and Selection Cuts Working Together

Watts, Gordon (University of Washington)

Poster

Poster Session

Differentiable Programming could open even more doors in HEP analysis and computing to Artificial Intelligence/Machine Learning. Current common uses of AI/ML in HEP are deep learning networks – providing us with sophisticated ways of separating signal from background, classifying physics, etc. This is only one part of a full analysis – normally skims are made to reduce dataset sizes by...

245. Differentiable Simulation of the DUNE Near Detector Liquid Argon Time Projection Chamber

Chen, Yifan (SLAC National Accelerator Laboratory)

Poster

Poster Session

Liquid argon time projection chambers (TPCs) are widely used in particle detection. High quality physics simulators have been developed for such detectors in a variety of experiments, and the resulting simulations are used to aid in reconstruction and analysis of collected data. However, the degree to which these simulations are reflective of real data is limited by the knowledge of the...

482. Digital Twin of a Data Storage System based on Generative Models

Ratnikov, Fedor (HSE University)

Poster

Poster Session

High-precision modeling of systems is one of the main areas of industrial data analysis today. Models of the systems, their digital twins, are used to predict their behavior under various conditions. We have developed a digital twin of a data storage system using generative models of machine learning. The system consists of several types of components: HDD and SSD disks, disk pools with...

491. Distributed Parallel Computing Strategy For Ptycho-DM Reconstruction By Finite Number of Nvidia GPU and Hygon DCU for HEPS

Liu, Jianli (Institute of High Energy Physics Chinese Academy of Sciences)

Track 11 - Heterogeneous Computing and Accelerators

Poster

Poster Session

As a conventional ptychography iteration reconstruction method, Difference Map (DM) is suitable for computing large datasets of X-ray diffraction patterns on multi-GPU heterogeneous system at HEPS (High Energy Photon Source). However, it is based on the fact that the GPU memory is big enough to handle CUDA FFT/IFFT to all the patterns to GPU from RAM. Meanwhile, the intermediate data during...

70. Efficient interface to the GridKa tape storage system

Schnepf, Matthias (Karlsruhe Institute of Technology)

Poster

Poster Session

Providing high performance and reliable tape storage system is GridKa's top priority. The GridKa tape storage system was recently migrated from IBM SP to HighPerformance Storage System (HPSS) for LHC and non-LHC HEP experiments. These are two different tape backends and each has its own design and specifics that need to be studied and understood very deeply. Taking into account the features...

631. End-to-end deep learning inference with CMSSW via ONNX using Docker

Chudasama, Ruchi

Poster

Poster Session

Deep learning techniques have been proven to provide excellent performance for a variety of high energy physics applications, such as particle identification, event reconstruction and trigger operations. Recently, we developed an end-to-end deep learning approach to identify various particles using low-level detector information from high energy collisions. These models will be incorporated in...

502. Energy Efficiency in H.E.P. (ARM vs. x86 and beyond)

Dr Simili, Emanuele (University of Glasgow)

Poster

Poster Session

We present a case for ARM chips as an alternative to standard x86 CPUs at WLCG sites, as we observed better performance and lower power consumption in a number of benchmarks based on actual HEP workloads, from ATLAS simulation and reconstruction tasks to the most recent HEP-Score containerised jobs.

To support our case, we present novel measurements on the performance and energy...

639. Event and data persistency models for the LHCb Real Time Analysis System

Prof. Vilasís-Cardona, Xavier (La Salle URL)

Poster

Poster Session

Starting this year, the upgraded LHCb detector is collecting data with a pure software trigger. In its first stage, reducing the rate from 30MHz to about 1MHz, GPUs are used to reconstruct and trigger on B and D meson topologies and high-pT objects in the event. In its second stage, a CPU farm is used to reconstruct the full event and perform candidate selections, which are persisted for...

124. Evolving the LHCOPN and LHCONE networks to support HL-LHC computing requirements

Misa Moreira, Carmen (CERN)

Poster

Poster Session

The LHCOPN network, which links CERN to all the WLCG Tier 1s,
and the LHCONE network, which connects WLCG Tier1s and Tier2s, have
succesfully provided the necessary bandwidth for the distribution of the data
generated by the LHC experiments during first two runs of the LHC accel-
erator. We give here an overview of the most significant achievements and
the current state of the two...

366. Examining the Impact of Data Layout on Tape on Data Recall Performance for ATLAS

Misawa, Shigeki (Brookhaven National Laboratory)

Poster

Poster Session

Increases in data volumes are forcing high-energy and nuclear physics experiments to store more frequently accessed data on tape. Extracting the maximum performance from tape drives is critical to make this viable from a data availability and system cost standpoint. The nature of data ingest and retrieval in an experimental physics environment make achieving high access performance difficult...

509. Exploring XRootD Optimisations Using Advanced Monitoring in the UK

Currie, Robert (University of Edinburgh)

Poster

Poster Session

XRootD servers are commonplace to many parts of HEP data management and are a key component to data access and management strategies in both the WLCG and OSG. Deployments of XRootD instances across the UK have demonstrated the versatility and expandability of this data management software. As we become more reliant on these services, there is a requirement to collect low-level metrics to...

299. Fast inference on FPGA for the ATLAS Muon Trigger

Carnesale, Maria

Poster

Poster Session

Track finding in high-density environments is a key challenge for experiments at mod-
ern accelerators. In this presentation we describe the performance obtained running
machine learning models studied for the ATLAS Muon High Level Trigger. These mod-
els are designed for hit position reconstruction and track pattern recognition with a
tracking detector, on a commercially available Xilinx...

599. Fast Integration of Poisson Distributions for Broken Sensor Marginalization

Bilodeau, Zoë (Skidmore College)

Poster

Poster Session

In experiments with a noble liquid time-projection chamber there are arrays of photosensors positioned to allow for inference of the locations of interactions within the detector. If there is a gap in data left by a broken or saturated photosensor, inference of the position is less precise and less accurate. As it is not practical to repair or replace photosensors once the experiment has...

136. Fast muon simulation in the JUNO experiment with neural networks

Fang, Wenxing

Poster

Poster Session

Jiangmen Underground Neutrino Observatory (JUNO) experiment will start data taking in 2023. It aims to determine the neutrino mass ordering (NMO) with 3 sigma significance within 6 years. To achieve this goal, the backgrounds should be suppressed as much as possible. The cosmic-ray muon is one of the most important backgrounds for NMO study which should be taken carefully. Due to the...

301. Fast tracking on heterogeneous hardware for the ATLAS Event Filter

Kahn, Abraham

Poster

Poster Session

The upgrade of the ATLAS Trigger and Data Acquisition system must cope with
the high pileup environment of the High-Luminosity Large Hadron Collider (HL-LHC).
The new Event Filter Tracking system is a flexible, heterogeneous commercial system
consisting of CPU cores and possibly accelerators (e.g., FPGAs or GPGPUs) to perform
the compute-intensive Inner Tracker charged particle...

446. General partial wave analysis tool TF-PWA and its applications

Jiang, Yi (University of Chinese Academy of Sciences)

Poster

Poster Session

TFPWA is a general framework for partial wave analysis developed based on TensorFlow2. Partial wave analysis is a powerful method to determinate the 4-momentum distribution of multi-body decay final states and to extract the interested internal information. Base on a simple topological representation, TF-PWA can deal with most of the processes of partial wave analysis automatically and...

404. General purpose data streaming platform for log analysis, anomaly detection and security protection

Dr Fornari, Federico (INFN-CNAF)

Poster

Poster Session

INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing computing, networking and storage resources to a wide variety of scientific collaborations, not limited to the four LHC experiments. The INFN-CNAF data center will move to a new location next year. At the same time, the requirements from our experiments and users are becoming increasingly challenging and...

525. GNN-based neutron reconstruction in the neutron detector at BM@N experiment

Ratnikov, Fedor (HSE University)

Poster

Poster Session

The compact highly granular time-of-flight neutron detector is designed for the fixed target BM@N experiment at Nuclotron (JINR). This detector is aimed to measure anisotropy of azimuthal neutron flows, that are sensitive to the equation of state of dense nuclear matter. Graph neural network (GNN) method is proposed to reconstruct fast neutrons with energies up to few GeV produced in...

127. Graph Neural Network based Track Finding in the Central Drift Chamber at Belle II

Reuter, Lea (Karlsruhe Institute of Technology)

Poster

Poster Session

Decay products of long-lived particles are an important signature in dark sector searches in collider experiments. The current Belle II tracking algorithm is optimized for tracks originating from the interaction point at the cost of a lower track finding efficiency for displaced tracks. This is especially the case for low-momentum displaced tracks that are crucial for dark sector searches in...

155. Gravitational wave alert generation infrastructure on your laptop

Bagnasco, Stefano (Istituto Nazionale di Fisica Nucleare)

Poster

Poster Session

Multi-messenger astrophysics provides valuable insights into the properties of the physical Universe. These insights arise from the complementary information carried by photons, gravitational waves, neutrinos and cosmic rays about individual cosmic sources and source populations.
When a gravitational wave (GW) candidate is identified by the Ligo, Virgo and Kagra (LVK) observatory network, an...

308. Handling Detector Characterization Data (Metadata) in XENONnT

Sanchez, Luis (Rice University)

Poster

Poster Session

Many data-intensive experiments, need to track large amounts of data, large code bases to process the data, and configuration data (or metadata). All of these datasets have unique requirements for version control. Furthermore, these datasets must have rules and requirements about how they can interact with the code. For this it is crucial to make the storage of data, the metadata in particular...

260. HBase/Phoenix-based Data Collection and Storage for the ATLAS EventIndex

Salt Cairols, José Francisco (Instituto de Física Corpuscular)

Poster

Poster Session

The ATLAS EventIndex is the global catalogue of all ATLAS real and simulated events. During the LHC long shutdown between Run 2 (2015-2018) and Run 3 (2022-2025) its components were substantially revised, and a new system has been deployed for the start of Run 3 in Spring 2022. The new core storage system is based on HBase tables with a Phoenix interface. It allows faster data ingestion rates...

119. HEP Benchmark Suite: the centralized future of WLCG benchmarking

Menéndez Borge, Gonzalo (CERN)

Poster

Poster Session

When dealing with benchmarking, result collection and sharing is as essential as the production of the data itself. A central source of information allows for data comparison, visualization, and analysis, which the community both contributes to and profits from.

While this is the case in other fields, in the High Energy Physics (HEP) benchmarking community both script and result sharing...

269. HEP Orientated Quantum Computing Platform in IHEP

Fu, Shiyuan (IHEP)

Poster

Poster Session

Quantum computing technology is developing rapidly and has attracted increasing attention and interest from governments and institutions around the world. Although in the NISQ era, quantum computing has begun to play a notable role in more and more fields such as logistics scheduling, quantum finance, and quantum chemistry, thanks to its computing power far superior to classical computers. And...

426. High level reconstruction for e+e- Higgs factories with Machine Learning

Suehara, Taikan (Kyushu University)

Poster

Poster Session

There exist emerging interests for e+e- Higgs factories in the context of FCCee feasibility study in Europe, ILC in Japan, and new proposals (C3, HELEN) at US. As an original developer of a widely-used software package of jet analysis including flavor tagging for e+e- linear collider studies, LCFIPlus (published at NIMA and arXiv:1506.08371), we are now developing DNN-based flavor tagging...

343. High Performance Tape IO strategies for distributed Storage Infrastructure

Hess, Bryan (Jefferson Lab)

Poster

Poster Session

Jefferson Lab has developed and operated Jasmine, its tape storage system, since 1999. Over the life of the system, design iterations have benefitted from advances in disk and network performance. Now, as large-scale data processing is increasingly performed on distributed infrastructure using community supported software tools, we detail how Jasmine’s software strategy has been refocused....

160. HPC resources for CMS offline computing: an integration and scalability challenge for the Submission Infrastructure

Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)

Poster

Poster Session

The computing resource needs of LHC experiments, such as CMS, are expected to continue growing significantly over the next decade, during the Run 3 and especially the HL-LHC era. Additionally, the landscape of available resources will evolve, as HPC (and Cloud) resources will provide a comparable, or even dominant, fraction of the total capacity, in contrast with the current situation,...

592. IceCube Experience using XRootD-based Origins with GPU workflows in PNRP

Schultz, David (IceCube, University of Wisconsin-Madison), Sfiligoi, Igor

Poster

Poster Session

The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. Understanding detector systematic effects is a continuous process. This requires the Monte Carlo simulation to be updated periodically to quantify potential changes and improvements in science results with more detailed modeling of the systematic effects. IceCube’s largest systematic...

361. IceCube User Management - OAuth2 in Practice

Schultz, David (IceCube, University of Wisconsin-Madison)

Poster

Poster Session

The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. IceCube recently completed a multi-year user management migration from a manual LDAP-based process to an automated and integrated system built on Keycloak. A custom interface allows lead scientists to register and edit their institution’s user accounts, including group memberships...

253. Identifying points of failure in the ATLAS infrastructure with decision trees

Haslbeck, Florian (University of Oxford, CERN)

Poster

Poster Session

The complexity of the ATLAS detector and its infrastructure require an excellent understanding of the interdependencies between the components when identifying points of failure (POF). The ATLAS Technical Coordination Expert System features a graph-based inference engine that identifies a POF provided a list of faulty elements or Detector Safety System (DSS) alarms. However, the current...

121. Improved Pilot Logging in DIRAC

Martyniak, Janusz (Imperial College London, UK)

Poster

Poster Session

DIRAC is a widely used “framework for distributed computing”. It works by building a layer between the users and the resources offering a common interface to a number of heterogeneous providers. DIRAC, like many other workload management systems, uses pilot jobs to check and configure the worker-node environment before fetching a user payload. The pilot also records a number of different...

620. Improving Energy Reconstruction for Dual-Phase Time Projection Chambers

Li, Ivy (Rice University)

Poster

Poster Session

XENONnT is a dual-phase time projection chamber with a target of 6 tonnes of ultra-pure liquid xenon which aims to search for dark matter via its scattering process. In XENONnT, incoming particles scatter and excite the target xenon atoms, resulting in an initial scintillation signal and an ionization reaction. The freed electrons from the ionization cause a second, larger ionization signal at...

417. INFN Cloud Users and Projects Support, Training and Communication

Sinisi, Francesco (INFN-CNAF)

Poster

Poster Session

Having a long tradition in state-of-the-art distributed IT technologies, from the first small clusters to Grid and Cloud-based computing, in the last couple of years INFN made available to its users INFN Cloud: an easy to use, distributed, user-centric cloud infrastructure and services portfolio targeted to scientific communities.

Given the distributed nature of the infrastructure, the...

220. Job splitting on the ALICE grid, introducing the new job optimizer for the ALICE grid middleware.

Reme-Ness, Haakon André (Western Norway University of Applied Sciences)

Poster

Poster Session

This contribution introduces the job optimizer service for the next-
generation ALICE grid middleware, JAliEn (Java Alice Environment).
It is a continuous service running on central machines and is essentially
responsible for splitting jobs into subjobs, to then be distributed and
executed on the ALICE grid. There are several ways of creating subjobs
based on various strategies relevant...

547. K8s@CNAF and federations with INFN Cloud

Michelotto, Diego (INFN)

Poster

Poster Session

INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing computing, networking and storage resources also to a wide variety of scientific collaborations, ranging from physics to bioinformatics and industrial engineering.
Due to the massive adoption of containerised services and the increasing efficiency for application deployment of the recent virtualization...

14. Level-3 Electron trigger with Artificial Intelligence for CLAS12

Gavalian, Gagik (Jefferson Lab)

Poster

Poster Session

With ever increasing data rates in modern physics experiments there is a need to
partially analyze incoming data real time. Increasing data volumes create a need
to filter data to write out only events that are useful for physics analysis.
In this work we present a Level-3 trigger developed using Artificial Intelligence
to select electron trigger events at data acquisition level....

390. Likelihood analysis method for PID performance study of the DTOF detector at STCF

Qi, Binbin (University of Science and technology of China)

Poster

Poster Session

The Super Tau-Charm Facility (STCF) is a future electron-positron collider proposed in China. It has a peak luminosity of above $0.5×10^{35}$ $cm^{-2}s^{-1}$ and center-of-mass energy ranging from 2 to 7 GeV. A time-of-flight detector based on detection of internally reflected Cherenkov light (DTOF), is proposed for the endcap particle identification (PID) at the STCF. In this contribution, we...

596. Machine Learning for New Physics in $B \rightarrow K^{*} \ell^{+} \ell^{-}$ Decays

Dubey, Shawn (University of Hawaii at Manoa)

Poster

Poster Session

In this work, we report the status of a neural network regression model trained to extract new physics (NP) parameters in Monte Carlo (MC) data. We utilize a new EvtGen NP MC generator to generate $B \rightarrow K^{*} \ell^{+} \ell^{-}$ events according to the deviation of the Wilson Coefficient $C_{9}$ from its SM value, $\delta C_{9}$. We train a convolutional neural network regression...

472. Migration to WebDAV in Belle II Experiment

Pardi, Silvio (INFN)

Poster

Poster Session

The usage of WebDAV protocol has become more and more popular within the physics experiments using grid middleware in the last decade, and today it represents a valid alternative to the GridFTP currently supported at best-effort level after the retirement of Globus Toolkit.
Belle II experiment established the adoption of WebDAV protocol as the main protocol for data access and...

434. ML-based Tuning of RNTuple I/O Parameters

Mr Niewenhuis, Dante (University of Amsterdam)

Poster

Poster Session

ROOT RNTuple I/O subsystem has been designed to address performance bottlenecks and shortcomings of ROOT's current state of the art TTree I/O subsystem. RNTuple provides a backwards-incompatible redesign of the TTree binary format and API that evolves the ROOT event data I/O for the challenges of the upcoming decades. It has been engineered for high-performance on modern storage hardware, a...

334. Monitoring CERN Windows Infrastructure using Open Source products

Wiebalck, Arne (CERN)

Poster

Poster Session

More than 500 servers are actively managed by the Windows Infrastructure team on the CERN site. They run critical services for the laboratory, from controlling some of the accelerator systems to directory services, storage and databases. Having at all times visibility on their state is critical for a smooth operation of their services. This presentation will describe the implementation of an...

254. Mu2e Track Quality Selection Using scikit-learn

Israel, Scott (Boston University)

Poster

Poster Session

The Mu2e experiment will search for the neutrino-less conversion of muons to electrons in muonic aluminum. The Mu2e tracker detector measures the momentum of signal and background particles traveling down the beamline. One major background source is the $\mu\rightarrow e \bar{\nu} \nu$ decay of muons in orbit (DIO) process. Due to resolution and algorithm errors during reconstruction, these...

475. Multi-parameter detector optimization: SHiP case

Ratnikov, Fedor (HSE University)

Poster

Poster Session

SHiP (Search for Hidden Particles) and the associated CERN SPS Beam Dump Facility is a new general-purpose experiment proposed at the SPS to search for "hidden" particles which are predicted by many recently elaborated models of Hidden Sector of the Standard Model. The experiment searches for very weakly interacting long-lived particles and crucially depends on effective background...

452. Multidatabase

Hrivnac, Julius (IJCLab)

Poster

Poster Session

Particle Physics experiments and Astronomy telescopes store a large amount of data, most of those data are usually stored in simple files in various format. Databases are used only to a limited extend, while database technology has made a tremendous progress in recent years, offering a large spectrum of applications in all possible domains. Those possibilities are still largely...

441. Muon/Pion Identification at BESIII Based on Machine Learning Algorithm

Zhai, Yuncong (Shandong University)

Poster

Poster Session

Particle identification (PID) is one of the mostly commonly used tools for the physics analysis in collider physics experiments. To achieve good PID performance, information given by multiple sub-detectors are usually combined. This is in particular necessary for the discrimination of charged particles that have close masses (e.g. muon and pion). However, due to the intrinsic correlations...

585. Neural Networks for reweighting of Monte Carlo Events

Roy, Avik (University of Illinois at Urbana-Champaign)

Poster

Poster Session

Reweighting Monte Carlo (MC) events for alternate benchmarks of beyond standard model (BSM) physics is an effective way to reduce the computational cost of physics searches. However, applicability of reweighting is often constrained by technical limitations. We demonstrate how pre-trained neural networks can be used to obtain fast and reliable reweighting without relying on the full MC...

33. News, status, and future of the HSM/Tape storage interface at BNL

Liu, Zhenping (Brookhaven National Laboratory)

Poster

Poster Session

dCache at BNL has been in production for almost two decades. For years, dCache used the default driver included with the dCache software to interface with HPSS tape storage systems. Due to the synchronous nature of this approach and the high resource demands resulting from periodic script invocations, scalability was significantly limited. During the WLCG tape challenges, bottlenecks in dCache...

342. Next generation Linux Applications Gateway for CERN accelerator control systems

Jones, Ben (CERN)

Poster

Poster Session

CERN has been providing central Windows remote desktops via the Windows Terminal Infrastructure service for a number of years, and aims to provide a similar experience for Linux graphical environments. Different communities and experiments offer a series of tools to their users with this goal in mind, but the solutions are far from ideal and generate a support overhead for their respective...

370. Non-Invasive HVCM Capacitance Predictions using Uncertainty-Aware Convolutional Neural Networks

Goldenberg, Steven (Thomas Jefferson National Accelerator Facility)

Poster

Poster Session

Particle accelerators, such as the Spallation Neutron Source (SNS), require high beam availability in order to maximize scientific discovery. Recently, researchers have made significant progress utilizing machine learning (ML) models to identify anomalies, prevent damage, reduce beam loss, and tune accelerator parameters in real time. In this work, we study the use of uncertainty aware...

214. Operation of the CERN disk storage infrastructure during LHC Run-3

Caffy, Cédric (CERN)

Poster

Poster Session

The CERN IT Storage group operates multiple distributed storage systems to support all CERN data storage requirements. The storage and distribution of physics data generated by LHC and non-LHC experiments is one of the biggest challenges the group has to take on during LHC Run-3.

EOS, the CERN distributed disk storage system is playing a key role in LHC data-taking. During the first ten...

427. Optimising the configuration of the CMS GPU reconstruction

Mohamed, Abdulla (University of Bahrain)

Poster

Poster Session

During the long shutdown prior to LHC Run 3, the CMS reconstruction software (CMSSW) was upgraded to offload 40% of the High Level Trigger (HLT) processing to GPUs. This upgrade accelerated the reconstruction algorithms and improved the efficiency of the HLT farm, however it introduced new parameters to the system that had to be selected carefully to maximise performance. When offloading...

623. Optimizing AI-based HEP algorithms using HPC and Quantum Computing

Girone, Maria (CERN)

Poster

Poster Session

In the European Center of Excellence in Exascale Computing "Research on AI- and Simulation-Based Engineering at Exascale" (CoE RAISE), researchers from science and industry develop novel, scalable Artificial Intelligence technologies towards Exascale. In this work, we leverage European High Performance Computing (HPC) and Quantum Computing resources to perform large-scale hyperparameter...

272. PMT Pulse-Shape Analysis by Using A Convolutional Neutral Network in Nuclear Radiation Detection

QIAN, Sen (IHEP，CAS)

Poster

Poster Session

In recent years, some materials from the elpasolite crystal family have been under development for either or both gamma ray and neutron detection. These include Cs2LiYCl6 (CLYC) and Cs2LiLaBr6 (CLLB). Since these crystals have different luminescence decay times under neutron and gamma irradiation. The pulse shape discrimination (PSD) is widely used to discriminate between neutron and gamma...

363. Porting ATLAS FastCaloSim to GPUs with OpenMP Target Offloading

Mohammad Atif, FNU (Brookhaven National Laboratory)

Poster

Poster Session

OpenMP is a directive based shared-memory parallel programming model traditionally used for multicore CPUs. In its recent versions, OpenMP was extended to enable GPU computing via its “target
offloading” model. The architecture agnostic compiler directives can in principle offload to multiple types of GPUs and FPGAs, and its compiler support is under active development.

In this work, we...

225. Porting ATLAS FastCaloSim to GPUs with std::par and with Alpaka

Tsulaia, Vakho (LBNL)

Poster

Poster Session

std::execution::parallel (often shortened to std::par) is a method of executing algorithms in parallel that was introduced in the C++17 standard. It defines a number of execution policies that
target various levels of parallelization using threads and vectorization. While originally designed for CPUs, backends that can execute on GPUs have recently been introduced by NVIDIA and Intel. Unlike...

266. Porting LHAASO WFCTA simulation job to ARM computing cluster

Cheng, Yaosong (Institute of High Energy Physics Chinese Academy of Sciences)

Poster

Poster Session

The Wide Field-of-view Cerenkov Telescope Array (WFCTA) is an important component of Large High Altitude Air Shower Observatory (LHAASO), which aims to measure the individual energy spectra of cosmic rays from ~30 TeV to a couple of EeV. Since the experiment started running in 2020, WFCTA simulation jobs have been running on the Intel X86 cluster last year and only 25% of the first stage...

319. Preparing for the next WLCG Network Data Challenge: Site Network Monitoring

McKee, Shawn (University of Michigan Physics)

Poster

Poster Session

During the first WLCG Network Data Challenge in fall of 2021 some shortcomings in the monitoring that impeded the ability to fully understand the results collected during the data challenge were identified. One of the simplest missing components was site specific network information, especially information about traffic going into and out of any of the participating sites. Without this...

24. Proof-of-Useful-Work blockchain consensus for HEP experiments at CERN

Hoffmann, Felix

Poster

Poster Session

The High Luminosity LHC era at CERN will require Monte Carlo simulations to be at an even higher level of accuracy in order for them be suited for tasks such as background subtraction and filtering of rare events. In order to be able to keep up with the required amount of computing power, volunteer computing approaches can be used to complement existing Grid infrastructure.
In order to...

258. Query Service for the new ATLAS EventIndex system

Rybkin, Grigori (IJCLab Saclay)

Poster

Poster Session

The ATLAS EventIndex system consists of the catalogue of all events collected, processed or generated by the ATLAS experiment at the CERN LHC accelerator, and all associated software tools. The new system, developed for LHC Run 3, makes use of Apache HBase - the Hadoop database - and Apache Phoenix - an SQL/relational database layer for HBase - to store and access all the event metadata. The...

117. Reconstruction of displaced decay vertices in an inhomogeneous magnetic field with a Kalman Filter based tracking algorithm

Parschau, Mirco

Poster

Poster Session

The hight interaction rate, fixed target experiment HADES at GSI, lo-
cated in Darmstadt, Germany, investigates collisions of heavy-ion, proton
and secondary pion beams with a target material. Hyperons are one of the
key observables for both heavy-ion and elementary collisions. The
challenge is to detect displaced vertices with good accuracy without hav-
ing a dedicated vertex detector,...

589. RUCIO service for Gamma-ray astronomy projects

Delgado-Mengual, Jordi (PIC - Port d'Informació Científica (IFAE/CIEMAT))

Poster

Poster Session

Since 2009 Port d’Informació Científica (PIC), in Barcelona, has hosted the Major Atmospheric Gamma Imaging Cherenkov (MAGIC) Data Center. At the Observatorio del Roque de Los Muchachos (ORM) in the Canary Island of La Palma (Spain), data produced from observations by the 17m diameter MAGIC telescopes are transferred to PIC on a daily basis. More than 200 TB per year are being transferred to...

321. Running Qiskit on ROCm Platform

Fu, Shiyuan (IHEP)

Poster

Poster Session

With the computing power far superior to classical computers, quantum computing has attracted increasing attention and interest all over the world. Many prototypes are constructed like IBM Osprey, USTC Jiuzhang 2.0 to demonstrate the advantage of quantum computing, and corresponding developing SDKs are proposed to make the utmost of these devices.

Qiskit is one of the most popular quantum...

459. Scalable ATLAS pMSSM computational workflows using containerised REANA data analysis platform

Heinrich, Lukas (Max-Planck-Institut f\"ur Physik)

Poster

Poster Session

In this paper we describe the development of a streamlined framework for large-scale ATLAS pMSSM reinterpretations of LHC Run 2 analyses using containerised computational workflows. The project is looking to assess the global coverage of BSM physics and requires running O(5k) computational workflows representing pMSSM model points. Following ATLAS Analysis Preservation policies, many analyses...

450. Scientific Community Transfer Protocols, Tools, and their performance based on Network capabilities

Bhat, Preeti

Poster

Poster Session

The efficiency of high energy physics workflows relies on the ability to rapidly transfer data among the sites where the data is processed and analysed. The best data transfer tools should provide a simple and reliable solution for local, regional, national and in some cases intercontinental data transfers. This work outlines the results of data transfer tool tests using internal and external...

145. Smart ROOT plotting: good plots, automatically

Dr Couet, Olivier (CERN)

Poster

Poster Session

ROOT's users can produce the plots they want and tune them all the way they need to be ready for publication. This tuning can be quite invasive in a ROOT macro and represent a fair amount of code. The good practice is to have these personnel tunings in a dedicated macro or in a special style. That's what experienced users are doing and they get the nice results they want. New ROOT users or...

271. Solutions for non-web OAuth Authorisation at CERN

Aguado Corman, Asier (CERN)

Poster

Poster Session

Researchers are now well accustomed to the convenience of Single Sign-On for their web applications. A significant proportion of researchers' and laboratory staff's daily work, however, is not performed on the web today - to support these workflows we must go beyond the well understood web flows for SAML and OAuth2 and find ways to provision tokens on the command line. This paper will showcase...

622. Solving the dilemma of big data: live storage constraints versus full scientific reach

Van Buren, Gene (Brookhaven National Laboratory)

Poster

Poster Session

Cutting edge research has driven scientists in many fields into the world of big data. While data storage technologies continue to evolve, the costs remain high for rapid data access on such scales and are a major factor in planning and operations. As a joint effort spanning experiment scientists, developers for ROOT, and industrial leaders in data compression, we sought to address this...

81. Standardizing DIRAC’s Cloud Interfaces

Bauer, Daniela (Imperial College London)

Poster

Poster Session

DIRAC is a widely used "framework for distributed computing". It works by providing a layer between users and computing resources by offering a common interface to a number of heterogeneous resource providers. DIRAC originally provided support for dynamic workload management on a cloud via its VMDIRAC extension. When the VMDIRAC extension was envisaged, it was common to use commercial clouds...

215. Suppressing Beam Background and Fake Photons at Belle II using Machine Learning

Ms Cheema, Priyanka (University of Sydney)

Poster

Poster Session

The Belle II experiment situated at the SuperKEKB energy-asymmetric $e^+e^-$ collider began operation in 2019. It has since recorded half of the data collected by its predecessor, and reached a world record instantaneous luminosity of $4.7\times 10^{34}$ cm$^{-2}$s$^{-1}$. For distinguishing decays with missing energy from background events at Belle II, the residual calorimeter energy measured...

641. Sustainability in HEP Computing

Walker, Rodney (Ludwig-Maximilians-Universitaet Muenchen)

Poster

Poster Session

The European energy crisis during 2022 prompted many computing facilities to take urgent electricity saving measures. In part voluntary, in response to EU and national appeals, but also to keep within a flat energy budget, as the electricity price rose. We review some of these measures and, as the situation normalises, take a longer view on how the flexibility of high throughput computing can...

55. Taming the Menagerie: Benefits of Microarchitecture levels in distributed clusters

Sfiligoi, Igor

Poster

Poster Session

The x86_64 instruction set architecture, is not a single, consistent, compatible interface to execute computer programs. Since the initial release in 1999, every new generation has added new instructions, some of which were later removed. Most of these new instructions are intended to improve the performance of those programs which explicitly take advantage of them. However, running such a...

56. The ALICE Experiment Control System in LHC Run 3

Mrnjavac, Teo (CERN)

Poster

Poster Session

The ALICE Experiment at CERN's Large Hadron Collider (LHC) has undergone a major upgrade during LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O² (Online-Offline). To ensure the efficient operation of the upgraded experiment and of its newly designed computing system, a reliable, high performance, full-featured experiment control system has also been developed...

477. The ALICE Glance Membership system

De Souza Pereira, Jomar Junior (Federal University of Rio de Janeiro, COPPE/EE/IF, Brazil)

Poster

Poster Session

With over 2000 active members from 174 institutes over 41 countries in the world, the ALICE experiment is one of the 4 large experiments at CERN. With such numerous interactions, the experiment management needs a way to record members participation history and their current status, such as employments, institutes, appointments, clusters and funding agencies, as well as to automatically...

194. The ATLAS Workflow Management System Evolution in the LHC Run3 and towards the High-Luminosity LHC era

Pacheco Pages, Andreu (IFAE Barcelona (ES))

Poster

Poster Session

The ATLAS experiment has 18+ years of experience using workload management systems to deploy and develop workflows to process and to simulate data on the distributed computing infrastructure. Simulation, processing and analysis of LHC experiment data require the coordinated work of heterogeneous computing resources. In particular, the ATLAS experiment utilizes the resources of 250 computing...

79. The Bulk service and WLCG TAPE API support in dCache

Mr Litvintsev, Dmitry (Fermilab)

Poster

Poster Session

Over the past several years, the dCache collaboration has been working on developing a feature-rich, efficient and scalable data lifecycle and QoS service. With the ever-increasing volume of data anticipated by the LHC and Intensity Frontier experiments, their reliance on tape storage and the limited amount of disk cache available to them, this effort to provide for efficient staging of large...

453. The CERN Tape Archive Run3 production experience

Leduc, Julien (CERN)

Poster

Poster Session

The EOSCTA service is CERN physics data archival storage infrastructure for Run 3. It entered production at CERN during summer 2020 and has been serving all the LHC and non-LHC data tape workflows. It is a complete redesign of the tape software, tape cache and tape workflows designed for Run 3 rates that introduces scalability changes targeting Run4. It already set a new record of monthly...

207. The CloudVeneto initiative: 10 years of operations to support interdisciplinary open science.

Fanzago, Federica

Poster

Poster Session

CloudVeneto is a private cloud targeted to scientific communities based on OpenStack software. It was designed in 2013 and put in operation one year later, to support INFN projects, mainly HEP ones. Its resources are physically distributed among two sites: the Physics Department of University of Padova-INFN Padova Unit and the INFN Legnaro National Laboratories. During these 10 years...

131. The dCache namespace gets quota management, or Making a perfectly normal file system perfect.

Litvintsev, Dmitry (Fermilab)

Poster

Poster Session

dCache (https://dcache.org) is a highly scalable storage system providing location-independent access to data. The data are stored across multiple data servers as complete files presented to the end-user via a single-rooted namespace. From its inception, dCache has been designed as a caching disk buffer to a tertiary tape storage system with the assumption that the latter has virtually...

161. TileCalibWeb Robot tool for ATLAS Calorimeter conditions and calibration data handling in Run 3

Smirnov, Yuri (NIU)

Poster

Poster Session

Every sub-detector in the ATLAS experiment at the LHC, including ATLAS
Calorimeter, writes conditions and calibration data into an ORACLE
Database (DB). In order to provide an interface for reliable interactions
both with the Conditions Online and Offline DBs a unique semiautomatic
web based application, TileCalibWeb Robot, has been developed. TileCalibWeb is being used in LHC Run 3 by...

539. Towards high-performance AI4NP applications on modern GPU platforms

Dr Mei, Xinxin (Jefferson Lab)

Poster

Poster Session

The development of modern heterogeneous accelerators, such as GPUs, has boosted the prosperity of artificial intelligence (AI). Recent years have seen an increasing popularity of AI for the nuclear physics (AI4NP) domain. While most AI4NP studies focus on feasibility analysis, we target at their performance on the modern GPUs equipped with Tensor Cores.

We first benchmark the throughput...

21. Track Identification with ML for CLAS12

Gavalian, Gagik (Jefferson Lab)

Poster

Poster Session

This work shows the implementation of Artificial Intelligence models in track reconstruction
software for the CLAS12 detector at Jefferson Lab. The Artificial Intelligence-based approach resulted
in improved track reconstruction efficiency in high luminosity experimental conditions. The track
reconstruction efficiency increased by $10-12\%$ for a single particle, and statistics in...

278. Use of Anomaly Detection algorithms to unveil new physics in Vector Boson Scattering

Lavizzari, Giulia (INFN and University of Milano-Bicocca)

Poster

Poster Session

new methodology to improve the sensitivity to new physics contributions to the Standard Model processes at LHC is presented.

A Variational AutoEncoder trained on Standard Model processes is used to identify Effective Field Theory contributions as anomalies. While the output of the model is supposed to be very similar to the inputs for Standard Model events, it is expected to deviate...

147. Using Kerberos Token in Distributed Computing System at IHEP

JIANG, Xiaowei

Poster

Poster Session

The token-based certification method is spreading in the distributed computing system of high energy physics. More and more software and middleware are supporting token as one of certification methods. As an example, WLCG has upgraded all the services to supporting WLCG token. In IHEP(Institute of High Energy Physics in China), the Kerberos token has been used as the main certification method...

468. Using ML clustering tools to improve data transfer management operations

Rinaldi, Lorenzo

Poster

Poster Session

The GRID computing paradigms adopted by the main HEP experiments is based on the distribution of experimental data on computer resources located all over the world. In general, the data distribution operation is managed centrally by services, such as the CERN File Transfer Service (FTS), which interact with the local Storage Elements. While performing bulk data transfers at such large scale,...

402. Using MQTT and Node-RED to monitor the ATLAS Metadata Interface (AMI) stack and define metadata aggregation tasks in a pipelined way.

Dr Odier, Jérôme (CNRS/LPSC), Lambert, Fabian (CNRS/LPSC)

Poster

Poster Session

ATLAS Metadata Interface (AMI) is a generic ecosystem for metadata aggregation, transformation and cataloging. Benefiting from more than 20 years of feedback in the LHC context, the second major version was released in 2018. Each sub-system of the stack has recently be improved in order to acquire messaging/telemetry capabilities. This poster describes the whole stack monitoring with the...

61. Vectorization in CMSSW applications

Gartung, Patrick (Fermilab)

Poster

Poster Session

The CMS experiment has been utilizing vectorization, or SIMD, in parts of its data processing applications for over a decade. On x86 platforms the vectorization level is still SSE3. In the past attempts to use wider vector instruction sets such as AVX or AVX-512 have, in practice, not resulted in improvements in the overall event processing throughput, because the CPUs scale down their...

141. Vertex reconstruction with Graph Neural Network in JSNS2

Yoo, Changhyun

Poster

Poster Session

The JSNS$^2$ (J-PARC Sterile Neutrino Search at the J-PARC Spallation Neutron Source) experiment searches for neutrino oscillations at 24m baseline from the J-PARC’s 3 GeV 1 MW proton beam incident on a mercury target at the Materials and Life science experimental Facility (MLF). The JSNS$^2$ detector consists of three cylindrical layers including an inner most neutrino target, an intermediate...

325. VISPA – Cloud Services for Modern Data Analysis

Eich, Niclas (RWTH Aachen University)

Poster

Poster Session

VISPA (VISual Physics Analysis) realizes a scientific cloud enabling modern scientific data analysis in a web browser. Our local VISPA instance is backed by a small institute cluster and is dedicated to fundamental research and university education. By hardware upgrades (732 CPU threads, 29 workstation GPUs), we have tailored the cloud services to accomplish both, rapid turn-around when...

388. What Machine Learning Can Do for a Focusing Aerogel Detectors

Ratnikov, Fedor (HSE University)

Poster

Poster Session

Particle identification at the Super Charm-Tau factory experiment will be provided by a Focusing multilayer Aerogel Ring Imaging CHerenkov detector FARICH. Due to hardware constraints the detector captures a great amount of noise which must be mitigated to reduce both a data flow and further storage space.

In this presentation we present our approach to filtering signal hits. The approach...

116. Worker Node Farm migration to IPv6

Hoeft, Bruno (Karlsruhe Institute of Technology (KIT))

Poster

Poster Session

The Worldwide Large Hadron Collider Computing Grid (WLCG) actively pursues the migration from the protocol IPv4 to IPv6. For this purpose, the HEPiX-IPv6 working group was founded during the fall HEPiX Conference in 2010. One of the first goals was to categorize the applications running in the WLCG into different groups: the first group was easy to define, because it comprised of all...

1. Workshop

Workshop

2. Workshop

Workshop

3. Workshop

Workshop

4. Workshop Registration

280. XRootD Client: a robust technology for LHC Run-3 and beyond

Mr Caffy, Cedric (CERN)

Poster

Poster Session

During the LHC era the XRootD framework has proven to be a critical component of numerous data management and software defined storage solutions (most importantly EOS, the CERN storage technology used for the LHC experiments), and as such grew into one of the most strategic storage technologies in the High Energy Physics (HEP) community. Over the last year significant developments in the area...

283. Zc(3900) observation at BESIII with QVSM method

Ding, Biao

Poster

Poster Session

The Beijing Spectrometer (BESIII) is a detector for hadron and tau-charm physics studies. It’s located at the Beijing Electron Positron Collider (BEPCII), which runs at center-of-mass energy 2.0-5.0GeV. Zc(3900) was firstly discovered by BESIII collaboration in 2013. This exotic resonance structure is considered to be a tetra-quark state, which scientists have been looking for a long time. In...

Choose timezone

26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)

Conference Secretariat