May 8 – 12, 2023
Norfolk Waterside Marriott
US/Eastern timezone

Contribution List

587 out of 587 displayed
  1. Boehnlein, Amber (JLAB), Dodge, Gail (Old Dominion University), Henderson, Stuart (Jefferson Lab), Scott, Bobby (United States Congressman)
    5/8/23, 9:00 AM
  2. Dean, David (TJNAF)
    5/8/23, 9:30 AM

    The world is full of computing devices that calculate, monitor, analyze, and control processes. The underlying technical advances within computing hardware have been further enhanced by tremendous algorithmic advances across the spectrum of the sciences. The quest -- ever present in humans -- to push the frontiers of knowledge and understanding requires continuing advances in the development...

    Go to contribution page
  3. Diefenthaler, Markus (Jefferson Lab)
    5/8/23, 10:00 AM
    Plenary

    In today's Nuclear Physics (NP), the exploration of the origin, evolution, and structure of the universe's matter is pursued through a broad research program at various collaborative scales, ranging from small groups to large experiments comparable in size to those in high-energy physics (HEP). Consequently, software and computing efforts vary from DIY approaches among a few researchers to...

    Go to contribution page
  4. Kwok, Ka Hei Martin (Fermilab)
    5/8/23, 11:00 AM

    Next generation High-Energy Physics (HEP) experiments are presented with significant computational challenges, both in terms of data volume and processing power. Using compute accelerators, such as GPUs, is one of the promising ways to provide the necessary computational power to meet the challenge. The current programming models for compute accelerators often involve using...

    Go to contribution page
  5. Mr Litvintsev, Dmitry (Fermilab)
    5/8/23, 11:00 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The dCache project provides open-source software deployed internationally to satisfy
    ever more demanding storage requirements. Its multifaceted approach provides an integrated
    way of supporting different use-cases with the same storage, from high throughput data
    ingest, data sharing over wide area networks, efficient access from HPC clusters and long
    term data persistence on a tertiary...

    Go to contribution page
  6. Weber, Christian (Brookhaven National Laboratory)
    5/8/23, 11:00 AM
    Track 4 - Distributed Computing
    Oral

    Machine learning has become one of the important tools for High Energy Physics analysis. As the size of the dataset increases at the Large Hadron Collider (LHC), and at the same time the search spaces become bigger and bigger in order to exploit the physics potentials, more and more computing resources are required for processing these machine learning tasks. In addition, complex advanced...

    Go to contribution page
  7. Mambelli, Marco (Fermilab)
    5/8/23, 11:00 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Since 1984 the Italian groups of the Istituto Nazionale di Fisica Nucleare (INFN) and Italian Universities, collaborating with the
    DOE laboratory of Fermilab (US) have been running a two-month summer training program for Italian university students. While
    in the first year the program involved only four physics students of the University of Pisa, in the following years it was extended
    to...

    Go to contribution page
  8. Gál, Tamás (Erlangen Centre for Astroparticle Physics)
    5/8/23, 11:00 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The Julia programming language was created 10 years ago and is now a mature and stable language with a large ecosystem including more than 8,000 third-party packages. It was designed for scientific programming to be a high-level and dynamic language as Python is, while achieving runtime performances comparable to C/C++ or even faster. With this, we ask ourselves if the Julia language and its...

    Go to contribution page
  9. Allaire, Corentin (IJCLab)
    5/8/23, 11:00 AM
    Track 3 - Offline Computing
    Oral

    The reconstruction of particle trajectories is a key challenge of particle physics experiments, as it directly impacts particle identification and physics performances while also representing one of the main CPU consumers of many high energy physics experiments. As the luminosity of particle collider increases, this reconstruction will become more challenging and resource intensive. New...

    Go to contribution page
  10. Wilkinson, Michael (University of Cincinnati)
    5/8/23, 11:00 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Hadronization is an important step in Monte Carlo event generators, where quarks and gluons are bound into physically observable hadrons. Today’s generators rely on finely-tuned empirical models, such as the Lund string model; while these models have been quite successful overall, there remain phenomenological areas where they do not match data well. In this talk, we present MLHad, a...

    Go to contribution page
  11. Pisani, Flavio (CERN)
    5/8/23, 11:00 AM
    Track 2 - Online Computing
    Oral

    The LHCb experiment is one of the four large experiments on the LHC at CERN. This forward spectrometer is designed to investigate differences between matter and antimatter by studying beauty and charm Physics. The detector and the entire DAQ chain have been upgraded, to profit from the higher luminosity delivered by the particle accelerator during Run3. The new DAQ system introduces a...

    Go to contribution page
  12. Rembser, Jonas (CERN)
    5/8/23, 11:00 AM
    Track 6 - Physics Analysis Tools
    Oral

    RooFit is a library for building and fitting statistical models that is part of ROOT. It is used in most experiments in particle physics, in particular, the LHC experiments. Recently, the backend that evaluates the RooFit likelihood functions was rewritten to support performant computations of model components on different hardware. This new backend is referred to as the "batch mode". So far,...

    Go to contribution page
  13. Bockelman, Brian (Morgridge Institute for Research)
    5/8/23, 11:00 AM
    Track 7 - Facilities and Virtualization
    Oral

    Managing a secure software environment is essential to a trustworthy cyberinfrastructure. Software supply chain attacks may be a top concern for IT departments, but they are also an aspect of scientific computing. The threat to scientific reputation caused by problematic software can be just as dangerous as an environment contaminated with malware. The issue of managing environments affects...

    Go to contribution page
  14. Sandesara, Jay (University of Massachusetts, Amherst)
    5/8/23, 11:15 AM
    Track 4 - Distributed Computing
    Oral

    We present a new implementation of simulation-based inference using data collected by the ATLAS experiment at the LHC. The method relies on large ensembles of deep neural networks to approximate the exact likelihood. Additional neural networks are introduced to model systematic uncertainties in the measurement. Training of the large number of deep neural networks is automated using a...

    Go to contribution page
  15. DeMuth, David (Valley City State University)
    5/8/23, 11:15 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Providing computing training to the next generation of physicists is the
    principal driver for a biannual multi-day workshop hosted by the DUNE
    Computing Consortium. Materials are cast in the Software Carpentries
    templates, and to date topics have included storage space, data
    management, LArSoft, grid job submission and monitoring. Moreover,
    experts provide extended breakout sessions to...

    Go to contribution page
  16. Yang, Wei (SLAC National Accelerator Laboratory)
    5/8/23, 11:15 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    XRootD implemented a client-side erasure coding (EC) algorithm utilizing the Intel Intelligent Storage Acceleration Library. At SLAC, a prototype of XRootD EC storage was set up for evaluation. The architecture and configuration of the prototype is almost identical to that of a traditional non-EC XRootD storage behind a firewall: a backend XRootD storage cluster in its simplest form, and an...

    Go to contribution page
  17. Tadel, Matevz (UCSD)
    5/8/23, 11:15 AM
    Track 3 - Offline Computing
    Oral

    MkFit is an implementation of the Kalman filter-based track reconstruction algorithm that exploits both thread- and data-level parallelism. In the past few years the project transitioned from the R&D phase to deployment in the Run-3 offline workflow of the CMS experiment. The CMS tracking performs a series of iterations, targeting reconstruction of tracks of increasing difficulty after...

    Go to contribution page
  18. Fanzago, Federica
    5/8/23, 11:15 AM

    INFN has been running for more than 20 years a distributed infrastructure (the Tier-1 at Bologna-CNAF and 9 Tier-2 centers) which currently offers about 140000 CPU cores, 120 PB of enterprise-level disk space and 100 PB of tape storage, serving more than 40 international scientific collaborations.
    This Grid-based infrastructure was augmented in 2019 with the INFN Cloud: a production quality...

    Go to contribution page
  19. Singh, Garima (Princeton University (US)/CERN)
    5/8/23, 11:15 AM
    Track 6 - Physics Analysis Tools
    Oral

    With the growing datasets of current and next-generation High-Energy and Nuclear Physics (HEP/NP) experiments, statistical analysis has become more computationally demanding. These increasing demands elicit improvements and modernizations in existing statistical analysis software. One way to address these issues is to improve parameter estimation performance and numeric stability using...

    Go to contribution page
  20. Dr Al-Turany, Mohammad (GSI)
    5/8/23, 11:15 AM
    Track 7 - Facilities and Virtualization
    Oral

    New particle/nuclear physics experiments require a massive amount of computing power that is only achieved by using high performance clusters directly connected to the data acquisition systems and integrated into the online systems of the experiments. However, integrating an HPC cluster into the online system of an experiment means: Managing and synchronizing thousands of processes that handle...

    Go to contribution page
  21. Stewart, Graeme (CERN)
    5/8/23, 11:15 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The evaluation of new computing languages for a large community, like HEP, involves comparison of many aspects of the languages' behaviour, ecosystem and interactions with other languages. In this paper we compare a number of languages using a common, yet non-trivial, HEP algorithm: the tiled $N^2$ clustering algorithm used for jet finding. We compare specifically the algorithm implemented in...

    Go to contribution page
  22. Mr Alnuqaydan, Abdulhakim (University of Kentucky)
    5/8/23, 11:15 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The calculation of particle interaction squared amplitudes is a key step in the calculation of cross sections in high-energy physics. These lengthy calculations are currently done using domain-specific symbolic algebra tools, where the time required for the calculations grows rapidly with the number of final state particles involved. While machine learning has proven to be highly successful in...

    Go to contribution page
  23. Barroso, Vasco (CERN)
    5/8/23, 11:15 AM
    Track 2 - Online Computing
    Oral

    ALICE (A Large Ion Collider Experiment) has undertaken a major upgrade during the LHC Long Shutdown 2. The increase in the detector data rates led to a hundredfold increase in the input raw data, up to 3.5 TB/s. To cope with it, a new common Online and Offline computing system, called O2, has been developed and put in production.

    The O2/FLP system, successor of the ALICE DAQ system,...

    Go to contribution page
  24. Liang, Shixiao (Rice University)
    5/8/23, 11:30 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The common form of inter-institute particle physics experiment collaborations generates unique needs for member management including paper authorship, shift assignments, subscription to mailing lists and access to 3rd party applications such as Github and Slack. For smaller collaborations, typically no facility for centralized member management is available and these needs are usually manually...

    Go to contribution page
  25. Wolffs, Zef (Nikhef)
    5/8/23, 11:30 AM
    Track 6 - Physics Analysis Tools
    Oral

    RooFit is a toolkit for statistical modeling and fitting, presented first at CHEP2003, and together with RooStats is used for measurements and statistical tests by most experiments in particle physics, particularly the LHC experiments.

    As the LHC program progresses, physics analyses become more ambitious and computationally more demanding, with fits of hundreds of data samples to joint...

    Go to contribution page
  26. Dr Giffels, Manuel (Karlsruher Institut für Technologie (KIT))
    5/8/23, 11:30 AM
    Track 7 - Facilities and Virtualization
    Oral

    PUNCH4NFDI, funded by the Germany Research Foundation initially for five years, is a diverse consortium of particle, astro-, astroparticle, hadron and nuclear physics embedded in the National Research Data Infrastructure initiative.

    In order to provide seamless and federated access to the huge variaty of compute and storage systems provided by the participating communities covering their...

    Go to contribution page
  27. Mr Horzela, Maximilian (Karlsruhe Institute of Technology)
    5/8/23, 11:30 AM
    Track 4 - Distributed Computing
    Oral

    Predicting the performance of various infrastructure design options in complex federated infrastructures with computing sites distributed over a wide area that support a plethora of users and workflows, such as the Worldwide LHC Computing Grid (WLCG), is not trivial. Due to the complexity and size of these infrastructures, it is not feasible to deploy experimental test-beds at large scales...

    Go to contribution page
  28. Heinrich, Lukas (Technical University of Munich)
    5/8/23, 11:30 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The recent advances in Machine Learning and high-dimensional gradient-based optimization has led to increased interest in the question of whether we can use such methods to optimize the design of future detectors for high-level physics objectives. However this program faces a fundamental obstacle: The quality of a detector design must be judged on the physics inference it enables, but both...

    Go to contribution page
  29. Poreba, Aleksandra (CERN)
    5/8/23, 11:30 AM
    Track 2 - Online Computing
    Oral

    Athena is the software framework used in the ATLAS experiment throughout the data processing path, from the software trigger system through offline event reconstruction to physics analysis. For Run 3 data taking (which started in 2022) the framework has been reimplemented into a multi-threaded framework. In addition to having to be remodelled to work in this new framework, the ATLAS High Level...

    Go to contribution page
  30. Dr Martinelli, Michele (INFN)
    5/8/23, 11:30 AM

    RED-SEA (https://redsea-project.eu/) is a European project funded in the framework of the H2020-JTI-EuroHPC-2019-1 call that started in April 2021. The goal of the project is to evaluate the architectural design of the main elements of the interconnection networks for the next generation of HPC systems supporting hundreds of thousands of computing nodes enabling the Exa-scale for HPC, HPDA and...

    Go to contribution page
  31. Dr Fornari, Federico (INFN-CNAF)
    5/8/23, 11:30 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing support in terms of computing, networking, storage resources and services also to a wide variety of scientific collaborations, ranging from physics to bioinformatics and industrial engineering.
    Recently, several collaborations working with our data center have developed computing and data management...

    Go to contribution page
  32. Elmsheuser, Johannes (Brookhaven National Laboratory)
    5/8/23, 11:30 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    With an increased dataset obtained during the Run-3 of the LHC at CERN and the even larger expected increase of the dataset by more than one order of magnitude for the HL-LHC, the ATLAS experiment is reaching the limits of the current data processing model in terms of traditional CPU resources based on x86_64 architectures and an extensive program for software upgrades towards the HL-LHC has...

    Go to contribution page
  33. Krasznahorkay, Attila (CERN)
    5/8/23, 11:30 AM
    Track 3 - Offline Computing
    Oral

    Despite recent advances in optimising the track reconstruction problem for high particle multiplicities in high energy physics experiments, it remains one of the most demanding reconstruction steps in regards to complexity and computing ressources. Several attemps have been made in the past to deploy suitable algorithms for track reconstruction on hardware accelerators, often by tailoring the...

    Go to contribution page
  34. Fuhrmann, Patrick (DESY)
    5/8/23, 11:45 AM
    Track 4 - Distributed Computing
    Oral

    InterTwin is an EU-funded project that started on the 1st of September 2022. The project will work with domain experts from different scientific domains in building a technology to support digital twins within scientific research. Digital twins are models for predicting the behaviour and evolution of real-world systems and applications.

    InterTwin will focus on employing machine-learning...

    Go to contribution page
  35. Bocchi, Enrico (CERN)
    5/8/23, 11:45 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The Storage Group in the CERN IT Department operates several Ceph storage clusters with an overall capacity exceeding 100 PB. Ceph is a crucial component of the infrastructure delivering IT services to all the users of the Organization as it provides: i) Block storage for the OpenStack infrastructure, ii) CephFS used as persistent storage by containers (OpenShift and Kubernetes) and as shared...

    Go to contribution page
  36. Erice Cid, Carlos Francisco (Boston University (US))
    5/8/23, 11:45 AM
    Track 3 - Offline Computing
    Oral

    The high luminosity expected from the LHC during the Run 3 and, especially, the HL-LHC of data taking introduces significant challenges in the CMS event reconstruction chain. The additional computational resources needed to treat this increased quantity of data surpass the expected increase in processing power for the next years. In order to fit the projected resource envelope, CMS is...

    Go to contribution page
  37. Grandi, Claudio (INFN Bologna)
    5/8/23, 11:45 AM

    ICSC is one of the five Italian National Centres created in the framework of the Next Generation EU funding by the European Commission. The aim of ICSC, designed and approved through 2022 and eventually started in September 2022, is to create the national digital infrastructure for research and innovation, leveraging exixting HPC, HTC and Big Data infrastructures evolving towards a cloud...

    Go to contribution page
  38. Mr Fang, Ronglong
    5/8/23, 11:45 AM
    Track 2 - Online Computing
    Oral

    The INDRA-ASTRA project is part of the ongoing R&D on streaming readout and AI/ML at Jefferson Lab. In the interdisciplinary project, nuclear physicists and data scientists work towards a prototype for an autonomous, responsive detector system as a first step towards a fully autonomous experiment. In our presentation, we will present our method for autonomous calibration of DIS experiments...

    Go to contribution page
  39. Mr Alanazi, Yasir (Jefferson Lab)
    5/8/23, 11:45 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    We present a Multi-Module framework based on Conditional Variational Autoencoder (CVAE) to detect anomalies in the High Voltage Converter Modulators (HVCMs) which have historically been a cause of major down time for the Spallation Neutron Source (SNS) facility. Previous studies using machine learning techniques were to predict faults ahead of time in the SNS accelerator using a Single...

    Go to contribution page
  40. Hrivnac, Julius (IJCLab)
    5/8/23, 11:45 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    High Energy Physics software has been a victim of the necessity to choose one implementation language as no really usable multi-language environment existed. Even a co-existence of two languages in the same framework (typically C++ and Python) imposes a heavy burden on the system. The role of different languages was generally limited to well encapsulated domains (like Web applications,...

    Go to contribution page
  41. Moneta, Lorenzo (CERN)
    5/8/23, 11:45 AM
    Track 6 - Physics Analysis Tools
    Oral

    Minuit is a program implementing a function minimisation algorithm written at CERN more than 50 years ago. It is still used by almost all statistical analysis in High Energy Physics to find optimal likelihood and best parameter values. A new version, Minuit2, has been re-implemented the original algorithm in C++ a few years ago and it is provided as a ROOT library or a standalone C++...

    Go to contribution page
  42. Giommi, Luca (INFN Bologna)
    5/8/23, 11:45 AM
    Track 7 - Facilities and Virtualization
    Oral

    Nowadays Machine Learning (ML) techniques are successfully used in many areas of High-Energy Physics (HEP) and will play a significant role also in the upcoming High-Luminosity LHC upgrade foreseen at CERN, when a huge amount of data will be produced by LHC and collected by the experiments, facing challenges at the exascale. To favor the usage of ML in HEP analyses, it would be useful to have...

    Go to contribution page
  43. Reinsvold Hall, Allison (US Naval Academy)
    5/8/23, 11:45 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    We will discuss the training and on-boarding initiatives currently adopted by a range of High Energy Physics (HEP) experiments. On-boarding refers to the process by which new members of a collaboration gain the knowledge and skills needed to become effective members. Fast and efficient on-boarding is increasingly important for HEP experiments as physics analyses and, as a consequence, the...

    Go to contribution page
  44. Negri, Andrea (CERN)
    5/8/23, 12:00 PM
    Track 2 - Online Computing
    Oral

    The ATLAS experiment at CERN is constructing upgraded system
    for the "High Luminosity LHC", with collisions due to start in
    2029. In order to deliver an order of magnitude more data than
    previous LHC runs, 14 TeV protons will collide with an instantaneous
    luminosity of up to 7.5 x 10e34 cm^-2s^-1, resulting in much higher pileup and
    data rates than the current experiment was designed to...

    Go to contribution page
  45. Horstmann, Malin (TU Munich / Max Planck Institute for Physics)
    5/8/23, 12:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    Collider physics analyses have historically favored Frequentist statistical methodologies, with some exceptions of Bayesian inference in LHC analyses through use of the Bayesian Analysis Toolkit (BAT). We demonstrate work towards an approach for performing Bayesian inference for LHC physics analyses that builds upon the existing APIs and model building technology of the pyhf and PyMC Python...

    Go to contribution page
  46. Lange, David (Princeton University)
    5/8/23, 12:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Building successful multi-national collaborations is challenging. The scientific communities in a range of physical sciences have been learning how to build collaborations that build upon regional capabilities and interests over decades, iteratively with each new generation of large scientific facilities required to advance their scientific knowledge. Much of this effort has naturally focused...

    Go to contribution page
  47. Andrijauskas, Fabio, Schultz, David (IceCube, University of Wisconsin-Madison)
    5/8/23, 12:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The OSG-operated Open Science Pool is an HTCondor-based virtual cluster that aggregates resources from compute clusters provided by several organizations. A user can submit batch jobs to the OSG-maintained scheduler, and they will eventually run on a combination of supported compute clusters without any further user action. Most of the resources are not owned by, or even dedicated to OSG, so...

    Go to contribution page
  48. Walder, James (RAL, STFC, UKRI)
    5/8/23, 12:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Data access at the UK Tier-1 facility at RAL is provided through its ECHO storage, serving the requirements for the WLGC and increasing numbers of other HEP and astronomy related communities.
    ECHO is a Ceph-backed erasure-coded object store, currently providing in excess of 40PB of usable space, with frontend access to data provided via XRootD or gridFTP, using the libradosstriper library of...

    Go to contribution page
  49. Evans, Ric (IceCube, University of Wisconsin-Madison)
    5/8/23, 12:00 PM
    Track 4 - Distributed Computing
    Oral

    The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. To accurately and promptly reconstruct the arrival direction of candidate neutrino events for Multi-Messenger Astrophysics use cases, IceCube employs Skymap Scanner workflows managed by the SkyDriver service. The Skymap Scanner performs maximum-likelihood tests on individual pixels...

    Go to contribution page
  50. Lin, Meifeng (Brookhaven National Laboratory)
    5/8/23, 12:00 PM

    The upcoming exascale computers in the United States and elsewhere will have diverse node architectures, with or without compute accelerators, making it a challenge to maintain a code base that is performance portable across different systems. As part of the US Exascale Computing Project (ECP), the USQCD collaboration has embarked on a collaborative effort to prepare the lattice QCD software...

    Go to contribution page
  51. Humble, Ryan (Stanford University)
    5/8/23, 12:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Significant advances in utilizing deep learning for anomaly detection have been made in recent years. However, these methods largely assume the existence of a normal training set (i.e., uncontaminated by anomalies), or even a completely labeled training set. In many complex engineering systems, such as particle accelerators, labels are sparse and expensive; in order to perform anomaly...

    Go to contribution page
  52. Calafiura, Paolo (LBNL)
    5/8/23, 12:00 PM
    Track 3 - Offline Computing
    Oral

    Building on the pioneering work of the HEP.TrkX project [1], Exa.TrkX developed geometric learning tracking pipelines that include metric learning and graph networks. These end-to-end pipelines capture the relationships between spacepoint measurements belonging to a particle track. We tested the pipelines on simulated data from HL-LHC tracking detectors [2,5], Liquid Argon TPCs for neutrino...

    Go to contribution page
  53. Diefenthaler, Markus (Jefferson Lab)
    5/8/23, 12:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Software and computing are an integral part of our research. According to the survey for the “Future Trends in Nuclear Physics Computing” workshop in September 2020, students and postdocs spent 80% of their time on the software and computing aspects of your research. For the Electron-Ion Collider, we are looking for ways to make software (and computing) "easier" to use. All scientists of all...

    Go to contribution page
  54. Baldwin, Zachary (Carnegie Mellon University)
    5/8/23, 12:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    Many current analyses in nuclear and particle physics are in search for signals that are encompassed by irreducible background events. These background events, entirely surrounding a signal of interest, would lead to inaccurate results when extracting physical observables from the data, due to the inability to reduce the signal to background ratio using any type of selection criteria. By...

    Go to contribution page
  55. Koch, David (LMU)
    5/8/23, 12:15 PM
    Track 4 - Distributed Computing
    Oral

    A fast turn-around time and ease of use are important factors for systems supporting the analysis of large HEP data samples. We study and compare multiple technical approaches.
    This presentation will be about setting up and benchmarking the Analysis Grand Challenge (AGC) [1] using CMS Open Data. The AGC is an effort to provide a realistic physics analysis with the intent of showcasing the...

    Go to contribution page
  56. Mr Caffy, Cedric (CERN)
    5/8/23, 12:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    EOS has been the main storage system at CERN for more than a decade, continuously improving in order to meet the ever evolving requirements of the LHC experiments and the whole physics user community. In order to satisfy the demands of LHC Run-3, in terms of storage performance and tradeoff between cost and capacity, EOS was enhanced with a set of new functionalities and features that we will...

    Go to contribution page
  57. Dr Leight, William (University of Massachusetts, Amherst)
    5/8/23, 12:15 PM
    Track 3 - Offline Computing
    Oral

    The production of simulated datasets for use by physics analyses consumes a large fraction of ATLAS computing resources, a problem that will only get worse as increases in the instantaneous luminosity provided by the LHC lead to more collisions per bunch crossing (pile-up). One of the more resource-intensive steps in the Monte Carlo production is reconstructing the tracks in the ATLAS Inner...

    Go to contribution page
  58. Prof. Kisel, Ivan (Goethe University Frankfurt am Main)
    5/8/23, 12:15 PM
    Track 2 - Online Computing
    Oral

    The fast algorithms for data reconstruction and analysis of the FLES (First Level Event Selection) package of the CBM (FAIR/GSI) experiment were successfully adapted to work on the High Level Trigger (HLT) of the STAR (BNL) experiment online. For this purpose, a so-called express data stream was created on the HLT, which enabled full processing and analysis of the experimental data in real...

    Go to contribution page
  59. Hays, Jonathan (Queen Mary University of London)
    5/8/23, 12:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The MoEDAL experiment at CERN (https://home.cern/science/experiments/moedal-mapp) carries out searches for highly ionising exotic particles such as magnetic monopoles. One of the technologies deployed in this task is the Nuclear Track Detector (NTD). In the form of plastic films, these are passive detectors that are low cost and easy to handle. After exposure to the LHC collision environment...

    Go to contribution page
  60. Dr Lin, Tao (IHEP)
    5/8/23, 12:15 PM

    Opticks is an open source project that accelerates optical photon simulation by
    integrating NVIDIA GPU ray tracing, accessed via the NVIDIA OptiX 7 API, with
    Geant4 toolkit based simulations. A single NVIDIA Turing architecture GPU has
    been measured to provide optical photon simulation speedup factors exceeding
    1500 times single threaded Geant4 with a full JUNO analytic GPU...

    Go to contribution page
  61. Dr Pellegrino, Carmelo (INFN-CNAF)
    5/8/23, 12:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The Italian WLCG Tier-1 located in Bologna and managed by INFN-CNAF has a long tradition in supporting several research communities in the fields of High-Energy Physics, Astroparticle Physics, Gravitational Waves, Nuclear Physics and others, to which provides computing resources in the form of batch computing, both HPC, HTC and Cloud, and storage. Although the LHC experiments at CERN represent...

    Go to contribution page
  62. Malik, Sudhir (University of Puerto Rico Mayaguez)
    5/8/23, 12:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The HSF/IRIS-HEP Software Training group provides software training skills to new researchers in High Energy Physics (HEP) and related communities. These skills are essential to produce high-quality and sustainable software needed to do the research. Given the thousands of users in the community, sustainability, though challenging, is the centerpiece of its approach. The training modules are...

    Go to contribution page
  63. Balcas, Justas (California Institute of Technology)
    5/8/23, 2:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The Large Hadron Collider (LHC) experiments distribute data by leveraging a diverse array of National Research and Education Networks (NRENs), where experiment data management systems treat networks as a “blackbox” resource. After the High Luminosity upgrade, the Compact Muon Solenoid (CMS) experiment alone will produce roughly 0.5 exabytes of data per year. NREN Networks are a critical part...

    Go to contribution page
  64. Wenzel, Hans (Fermilab)
    5/8/23, 2:00 PM

    CaTS is a Geant4 advanced example that is part of Geant4[1] since version 11.0. It demonstrates the use of Opticks[2] to offload the simulation of optical photons to GPUs. Opticks interfaces with the Geant4 toolkit to collect all the necessary information to generate and trace optical photons, re-implements the optical physics processes to be run on the GPU, and automatically translates the...

    Go to contribution page
  65. Alves, Bruno (LLR, École Polytechnique, Paris)
    5/8/23, 2:00 PM
    Track 2 - Online Computing
    Oral

    The CMS collaboration has chosen a novel high granularity calorimeter (HGCAL) for the endcap regions as part of its planned upgrade for the high luminosity LHC. The calorimeter will have fine segmentation in both the transverse and longitudinal directions and will be the first such calorimeter specifically optimised for particle flow reconstruction to operate at a colliding-beam experiment....

    Go to contribution page
  66. Dr Phelps, William (Christopher Newport University and Jefferson Lab)
    5/8/23, 2:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Abstract

    We present results on Deep Learning applied to Amplitude and Partial Wave Analysis (PWA) for spectroscopic analyses. Experiments in spectroscopy often aim to observe strongly-interacting, short-lived particles that decay to multi-particle final states. These particle decays have angular distributions that our deep learning model has been trained to identify. Working with TensorFlow...

    Go to contribution page
  67. DOnofrio, Adelina (INFN-Roma Tre)
    5/8/23, 2:00 PM
    Track 3 - Offline Computing
    Oral

    IDEA (Innovative Detector for an Electron-positron Accelerator) is an innovative general-purpose detector concept, designed to study electron-positron collisions at future e$^+$e$^-$ circular colliders (FCC-ee and CEPC).
    The detector will be equipped with a dual read-out calorimeter able to measure separately the hadronic component and the electromagnetic component of the showers initiated...

    Go to contribution page
  68. Dr Chowdhury, Barnali
    5/8/23, 2:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    The Deep Underground Neutrino Experiment (DUNE) has historically represented data using a combination of custom data formats and those based on ROOT I/O. Recently, DUNE has begun using the Hierarchical Data Format (HDF5) for some of its data storage applications. HDF5 provides high-performance, low-overhead I/O in DUNE’s data acquisition (DAQ) environment. DUNE will use HDF5 to record raw...

    Go to contribution page
  69. Mainetti, Gabriele (CC-IN2P3/CNRS)
    5/8/23, 2:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The Vera C. Rubin observatory is preparing for the execution of the most ambitious astronomical survey ever attempted, the Legacy Survey of Space and Time (LSST). Currently, in its final phase of construction in the Andes mountains in Chile and due to start operations in late 2024 for 10 years, its 8.4-meter telescope will nightly scan the southern sky and collect images of the entire visible...

    Go to contribution page
  70. Molina, Roméo (IJCLab)
    5/8/23, 2:00 PM
    Track 3 - Offline Computing
    Oral

    The AGATA project (1) aims at building a 4pi gamma-ray spectrometer consisting of 180 germanium crystals, each crystal being divided into 36 segments. Each gamma ray produces an electrical signal within several neighbouring segments, which is compared with a data base of reference signals, enabling to locate the interaction. This step is called Pulse-Shape Analysis (PSA).

    In the execution...

    Go to contribution page
  71. Zhang, Xiaomei (Institute of High Energy Physics)
    5/8/23, 2:00 PM
    Track 4 - Distributed Computing
    Oral

    The Jiangmen Underground Neutrino Observatory (JUNO) is a multipurpose neutrino experiment and the determination of the neutrino mass hierarchy is its primary physics goal. JUNO is going to take data in 2024 with 2PB raw data each year and use distributed computing infrastructure for simulation, reconstruction and analysis tasks. The JUNO distributed computing system has been built up based on...

    Go to contribution page
  72. Kauffman, Elliott (Princeton University)
    5/8/23, 2:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Machine learning (ML) has become an integral component of high energy physics data analyses and is likely to continue to grow in prevalence. Physicists are incorporating ML into many aspects of analysis, from using boosted decision trees to classify particle jets to using unsupervised learning to search for physics beyond the Standard Model. Since ML methods have become so widespread in...

    Go to contribution page
  73. Shannigrahi, Susmit (Tennessee Tech University)
    5/8/23, 2:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    We present an NDN-based Open Storage System (OSS) plugin for XRootD instrumented with an accelerated packet forwarder, built for data access in the CMS and other experiments at the LHC, together with its current status, performance as compared to other tools and applications, and plans for ongoing developments.

    Named Data Networking (NDN) is a leading Future Internet Architecture where data...

    Go to contribution page
  74. Pardi, Silvio (INFN)
    5/8/23, 2:15 PM
    Track 4 - Distributed Computing
    Oral

    The discovery of gravitational waves, first observed in September 2015 following the merger of a binary black hole system, has already revolutionised our understanding of the Universe. This was further enhanced in August 2017, when the coalescence of a binary neutron star system was observed both with gravitational waves and a variety of electromagnetic counterparts; this joint observation...

    Go to contribution page
  75. Mr Joube, Sylvain (IJCLab - LISN, Universit Paris-Saclay, CNRS/IN2P3)
    5/8/23, 2:15 PM
    Track 3 - Offline Computing
    Oral

    Track reconstruction, also known as tracking, is a vital part of the HEP event reconstruction process, and one of the largest consumers of computing resources. The upcoming HL-LHC upgrade will exacerbate the need for efficient software able to make good use of the underlying heterogeneous hardware. However, this evolution should not imply the production of code unintelligible to most of its...

    Go to contribution page
  76. Mr Tyson, Richard (Glasgow University)
    5/8/23, 2:15 PM
    Track 2 - Online Computing
    Oral

    Fast, efficient and accurate triggers are a critical requirement for modern high energy physics experiments given the increasingly large quantities of data that they produce. The CEBAF Large Acceptance Spectrometer (CLAS12) employs a highly efficient Level 3 electron trigger to filter the amount of data recorded by requiring at least one electron in each event, at the cost of a low purity in...

    Go to contribution page
  77. Gallén, Axel (Lund University (SE)), Ekman, Alexander (Lund University (SE))
    5/8/23, 2:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    One common issue in vastly different fields of research and industry is the ever-increasing need for more data storage. With experiments taking more complex data at higher rates, the data recorded is quickly outgrowing the storage capabilities. This issue is very prominent in LHC experiments such as ATLAS where in five years the resources needed are expected to be many times larger than the...

    Go to contribution page
  78. Hageboeck, Stephan
    5/8/23, 2:15 PM

    Madgraph5_aMC@NLO is one of the workhorses for Monte Carlo event generation in the LHC experiments and an important consumer of compute resources. The software has been reengineered to maintain the overall look-and-feel of the user interface while achieving very large overall speedups. The computationally intensive part (the calculation of "matrix elements") is offloaded to new implementations...

    Go to contribution page
  79. Giommi, Luca (INFN Bologna)
    5/8/23, 2:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The ML_INFN initiative (“Machine Learning at INFN”) is an effort to foster Machine Learning activities at the Italian National Institute for Nuclear Physics (INFN).

    In recent years, AI inspired activities have flourished bottom-up in many efforts in Physics, both at the experimental and theoretical level.

    Many researchers have procured desktop-level devices, with consumer oriented GPUs,...

    Go to contribution page
  80. Farinelli, Riccardo (INFN Ferrara)
    5/8/23, 2:15 PM
    Track 3 - Offline Computing
    Oral

    PARSIFAL (PARametrized SImulation) is a software tool that can reproduce the complete response of both triple-GEM and micro-RWELL based trackers. It takes into account the involved physical processes by their simple parametrization and thus in a very fast way. Existing software as GARFIELD++ are robust and reliable, but very CPU time consuming. The implementation of PARSIFAL was driven by the...

    Go to contribution page
  81. Eichlersmith, Tom (University of Minnesota)
    5/8/23, 2:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    ROOT's TTree data structure has been highly successful and useful for HEP; nevertheless, alternative file formats now exist which may offer broader software tool support and more-stable in-memory interfacing. We present a data serialization library that produces a similar data structure within the HDF5 data format; supporting C++ standard collections, user-defined data types, and schema...

    Go to contribution page
  82. Dr Delgado Peris, Antonio (CIEMAT - PIC), Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)
    5/8/23, 2:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    The increasingly larger data volumes that the LHC experiments will accumulate in the coming years, especially in the High-Luminosity LHC era, call for a paradigm shift in the way experimental datasets are accessed and analyzed. The current model, based on data reduction on the Grid infrastructure, followed by interactive data analysis of manageable size samples on the physicists’ individual...

    Go to contribution page
  83. Clemencic, Marco (CERN EP-LBC)
    5/8/23, 2:30 PM
    Track 3 - Offline Computing
    Oral

    The LHCb software stack is developed in C++ and uses the Gaudi framework for event processing and DD4hep for the detector description. Numerical computations are done either directly in the C++ code or by an evaluator used to process the expressions embedded in the XML describing the detector geometry.
    The current system relies on conventions for the physical units used (identical as what...

    Go to contribution page
  84. Wettersten, Zenny (CERN)
    5/8/23, 2:30 PM

    An important area of HEP studies at the LHC currently concerns the need for more extensive and precise comparison data. Important tools in this realm are event reweighting and the evaluation of more precise next-to-leading order (NLO) physics processes via Monte Carlo (MC) event generators, especially in the context of the upcoming High Luminosity LHC phase. Current event generators need to...

    Go to contribution page
  85. Yang, Ryan (Choate Rosemary Hall)
    5/8/23, 2:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    There is increasing demand for the efficiency and flexibility of data transport systems supporting data-intensive sciences. With growing data volume, it is essential that the transport system of a data-intensive science project fully utilize all available transport resources (e.g., network bandwidth); to achieve statistical multiplexing gain, there is an increasing trend that multiple projects...

    Go to contribution page
  86. Prof. Vilasís-Cardona, Xavier (La Salle URL)
    5/8/23, 2:30 PM
    Track 2 - Online Computing
    Oral

    Long-lived particles (LLPs) are very challenging to search for with current detectors and computing requirements, due to their very displaced vertices. This study evaluates the ability of the trigger algorithms used in the Large Hadron Collider beauty (LHCb) experiment to detect long-lived particles and attempts to adapt them to enhance the sensitivity of this experiment to undiscovered...

    Go to contribution page
  87. Li, Teng
    5/8/23, 2:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The Super Tau Charm Facility (STCF) proposed in China is a new-generation electron–positron collider with center-of-mass energies covering 2-7 GeV. In STCF, the discrimination of high momentum hadrons is a challenging and critical task for various physics studies. In recent years, machine learning methods have gradually become one of the mainstream methods in the PID field of high energy...

    Go to contribution page
  88. Blomer, Jakob (CERN)
    5/8/23, 2:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    The RNTuple I/O subsystem is ROOT's future event data file format and access API. It is driven by the expected data volume increase at upcoming HEP experiments, e.g. at the HL-LHC, and recent opportunities in the storage hardware and software landscape such as NVMe drives and distributed object stores. RNTuple is a redesign of the TTree binary format and API and has shown to deliver...

    Go to contribution page
  89. Gonzalez, Hugo (CERN)
    5/8/23, 2:30 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Over the last few years, Cloud Sync&Share platforms have become go-to services for collaboration in scientific, academic and research environments, providing users with coherent and simple ways to access their data assets. Collaboration within those platforms, between local users on local applications, has been demonstrated in various settings, with visible improvements in the research...

    Go to contribution page
  90. Dr Faucci Giannelli, Michele (INFN Roma Tor Vergata)
    5/8/23, 2:30 PM
    Track 3 - Offline Computing
    Oral

    AtlFast3 is the new, high-precision fast simulation in ATLAS that was deployed by the collaboration to replace AtlFastII, the fast simulation tool that was successfully used for most of Run2. AtlFast3 combines a parametrization-based Fast Calorimeter Simulation and a new machine-learning-based Fast Calorimeter Simulation based on Generative Adversarial Networks (GANs). The new fast simulation...

    Go to contribution page
  91. Rind, Ofer (Brookhaven National Laboratory)
    5/8/23, 2:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    Prior to the start of the LHC Run 3, the US ATLAS Software and Computing operations program established three shared Tier 3 Analysis Facilities (AFs). The newest AF was established at the University of Chicago in the past year, joining the existing AFs at Brookhaven National Lab and SLAC National Accelerator Lab. In this paper, we will describe both the common and unique aspects of these three...

    Go to contribution page
  92. Legger, Federica (INFN Torino)
    5/8/23, 2:30 PM
    Track 4 - Distributed Computing
    Oral

    The LIGO, VIRGO and KAGRA Gravitational-wave (GW) observatories are getting ready for their fourth observational period, O4, scheduled to begin in March 2023, with improved sensitivities and thus higher event rates.
    GW-related computing has both large commonalities with HEP computing, particularly in the domain of offline data processing and analysis, and important differences, for example in...

    Go to contribution page
  93. Nielsen, Lars Holm (CERN)
    5/8/23, 2:45 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Zenodo has over the past 10 years grown from a proof of concept to being the world's largest general-purpose research repository, cementing CERN’s image as a pioneer and leader in Open Science. We will review key challenges faced over the past 10 years and how we overcame them, from getting off the ground, over building trust to securing funding.

    Growing Zenodo was an enriching and...

    Go to contribution page
  94. Dengra, Carlos (CIEMAT/PIC)
    5/8/23, 2:45 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    In 2029 the LHC will start the High-Luminosity LHC (HL-LHC) program, with a boost in the integrated luminosity resulting in an unprecedented amount of experimental and simulated data samples to be transferred, processed and stored in disk and tape systems across the Worldwide LHC Computing Grid (WLCG). Content delivery network (CDN) solutions are being explored with the purposes of improving...

    Go to contribution page
  95. Mrs De Geus, Florine (University of Amsterdam)
    5/8/23, 2:45 PM
    Track 6 - Physics Analysis Tools
    Oral

    After using ROOT TTree for over two decades and storing more than an exabyte of compressed data, advances in technology have motivated a complete redesign, RNTuple, that breaks backward-compatibility to take better advantage of these storage options. The RNTuple I/O subsystem has been designed to address performance bottlenecks and shortcomings of ROOT's current state of the art TTree I/O...

    Go to contribution page
  96. Dr Meagher, Kevin (University of Wisconsin–Madison)
    5/8/23, 2:45 PM

    The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope
    located at the Geographic South Pole. For every observed neutrino event,
    there are over 10^6 background events caused by cosmic-ray air shower
    muons. In order to properly separate signal from background, it is
    necessary to produce Monte Carlo simulations of these air showers.
    Although to-date, IceCube has...

    Go to contribution page
  97. Kabus, Maja (CERN / Warsaw University of Technology)
    5/8/23, 2:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The main focus of the ALICE experiment, quark-gluon plasma measurements, requires
    accurate particle identification (PID). The ALICE detectors allow identifying particles over a broad momentum interval ranging from about 100 MeV/c up to 20 GeV/c.

    However, hand-crafted selections and the Bayesian method do not perform well in the
    regions where the particle signals overlap. Moreover, an ML...

    Go to contribution page
  98. Ju, Xiangyang (Lawrence Berkeley National Laboratory)
    5/8/23, 2:45 PM
    Track 3 - Offline Computing
    Oral

    Applying graph-based techniques, and graph neural networks (GNNs) in particular, has been shown to be a promising solution to the high-occupancy track reconstruction problems posed by the upcoming HL-LHC era. Simulations of this environment present noisy, heterogeneous and ambiguous data, which previous GNN-based algorithms for ATLAS ITk track reconstruction could not handle natively. We...

    Go to contribution page
  99. Summers, Sioni (CERN)
    5/8/23, 2:45 PM
    Track 2 - Online Computing
    Oral

    The Phase-2 Upgrade of the CMS Level-1 Trigger will reconstruct particles using the Particle Flow algorithm, connecting information from the tracker, muon, and calorimeter detectors, and enabling fine-grained reconstruction of high level physics objects like jets. We have developed a jet reconstruction algorithm using a cone centred on an energetic seed from these Particle Flow candidates. The...

    Go to contribution page
  100. Dr Yi, Difan (University of Chinese Academy of Sciences)
    5/8/23, 2:45 PM
    Track 3 - Offline Computing
    Oral

    The Large Field Low-energy X-ray Polarization Detector (LPD) is a gas photoelectric effect polarization detector designed for the detailed study of X-ray temporary sources in high-energy astrophysics. Previous studies have shown that the polarization degree of gamma ray bursts (GRBs) is generally low or unpolarized. Considering the spatial background and other interferences, We need high...

    Go to contribution page
  101. Gutsche, Oliver (Fermi National Accelerator Laboratory)
    5/8/23, 2:45 PM
    Track 4 - Distributed Computing
    Oral

    The HL-LHC run is anticipated to start at the end of this decade and will pose a significant challenge for the scale of the HEP software and computing infrastructure. The mission of the U.S. CMS Software & Computing Operations Program is to develop and operate the software and computing resources necessary to process CMS data expeditiously and to enable U.S. physicists to fully participate in...

    Go to contribution page
  102. Lawrence, John (University of Notre Dame)
    5/8/23, 2:45 PM
    Track 7 - Facilities and Virtualization
    Oral

    Effective analysis computing requires rapid turnaround times in order to enable frequent iteration, adjustment, and exploration, leading to discovery. An informal goal of reducing 10TB of experimental data in about ten minutes using campus-scale computing infrastructure is an achievable goal, just considering raw hardware capability. However, compared to production computing, which seeks to...

    Go to contribution page
  103. Prim, Markus
    5/8/23, 3:00 PM
    Track 3 - Offline Computing
    Oral

    The Belle II experiment has been accumulating data since 2019 at the SuperKEKB $e^+e^-$ accelerator in Tsukuba, Japan. The accelerator operates at the $\Upsilon(4S)$ resonance and is an excellent laboratory for precision flavor measurements and dark sector searches. The accumulated data are promptly reconstructed and calibrated at a dedicated calibration center in an automated process based on...

    Go to contribution page
  104. Johnson, Seth (Oak Ridge National Laboratory)
    5/8/23, 3:00 PM

    Celeritas is a new Monte Carlo detector simulation code designed for computationally intensive applications (specifically, HL-LHC simulation) on high-performance heterogeneous architectures. In the past two years Celeritas has advanced from prototyping a simple, GPU-based, single-physics-model infinite medium to implementing a full set of electromagnetic physics processes in complex...

    Go to contribution page
  105. Dr Mazzitelli, Giovanni (Laboratori Nazionali di Frascati)
    5/8/23, 3:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The INFN Cloud project was launched at the beginning of 2020, aiming to build a distributed Cloud infrastructure and provide advanced services for the INFN scientific communities. A Platform as a Service (PaaS) was created inside INFN Cloud that allows the experiments to develop and access resources as a Software as a Service (SaaS), and CYGNO is the beta-tester of this system. The aim of the...

    Go to contribution page
  106. Vaselli, Francesco (Scuola Normale Superiore & INFN Pisa)
    5/8/23, 3:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Analyses in HEP experiments often rely on large MC simulated datasets. These datasets are usually produced with full-simulation approaches based on Geant4 or exploiting parametric “fast” simulations introducing approximations and reducing the computational cost. With our work we created a prototype version of a new “fast” simulation that we named “flashsim” targeting analysis level data tiers...

    Go to contribution page
  107. McKee, Shawn (University of Michigan Physics)
    5/8/23, 3:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The High-Energy Physics (HEP) and Worldwide LHC Computing Grid (WLCG) communities have faced significant challenges in understanding their global network flows across the world’s research and education (R&E) networks. When critical links, such as transatlantic or transpacific connections, experience high traffic or saturation, it is very challenging to clearly identify which collaborations...

    Go to contribution page
  108. Canal, Philippe
    5/8/23, 3:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    Analysis performance has a significant impact on the productivity of physicists. The vast majority of analyses use ROOT (https://root.cern). For a few years now, ROOT has offered an analysis interface called RDataFrame which helps getting the best performance for analyses, ideally making them I/O limited, i.e. with their performance limited by the throughput of reading the input data.

    The...

    Go to contribution page
  109. South, David
    5/8/23, 3:00 PM
    Track 4 - Distributed Computing
    Oral

    The computing challenges at HL-LHC require fundamental changes to the distributed computing models that have served experiments well throughout LHC. ATLAS planning for HL-LHC computing started back in 2020 with a Conceptual Design Report outlining various challenges to explore. This was followed in 2022 by a roadmap defining concrete milestones and associated effort required. Today, ATLAS is...

    Go to contribution page
  110. De Souza Pereira, Jomar Junior (Universidade Federal do Rio De Janeiro, COPPE/EE/IF, Brazil)
    5/8/23, 3:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The "A Large Ion Collider Experiment" (ALICE), one of the four large experiments at the European Organization for Nuclear Research (CERN), is responsible for studying the physics of strongly interacting matter and the quark-gluon plasma.
    In order to ensure the full success of ALICE operation and data taking during the Large Hadron Collider Runs 3 and 4, a list of tasks identified as Service...

    Go to contribution page
  111. Barbetti, Matteo (INFN Firenze)
    5/8/23, 3:00 PM
    Track 3 - Offline Computing
    Oral

    Detailed detector simulation is the major consumer of CPU resources at LHCb, having used more than 80% of the total computing budget during Run 2 of the Large Hadron Collider at CERN. As data is collected by the upgraded LHCb detector during Run 3 of the LHC, larger requests for simulated data samples are necessary, and will far exceed the pledged resources of the experiment, even with...

    Go to contribution page
  112. Summers, Sioni (CERN)
    5/8/23, 3:00 PM
    Track 2 - Online Computing
    Oral

    The CMS experiment has greatly benefited from the utilization of the particle-flow (PF) algorithm for the offline reconstruction of the data. The Phase II upgrade of the CMS detector for the High Luminosity upgrade of the LHC (HL-LHC) includes the introduction of tracking in the Level-1 trigger, thus offering the possibility of developing a simplified PF algorithm in the Level-1 trigger. We...

    Go to contribution page
  113. Guiraud, Enrico (EP-SFT, CERN)
    5/8/23, 3:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    RDataFrame is ROOT's high-level interface for Python and C++ data analysis. Since it first became available, RDataFrame adoption has grown steadily and it is now poised to be a major component of analysis software pipelines for LHC Run 3 and beyond. Thanks to its design inspired by declarative programming principles, RDataFrame enables the development of high-performance, highly parallel...

    Go to contribution page
  114. Lawrence, David (Jefferson Lab)
    5/8/23, 3:15 PM
    Track 3 - Offline Computing
    Oral

    Development of the EIC project detector "ePIC" is now well underway and this includes the "single software stack" used for simulation and reconstruction. The stack combines several non-experiment-specific packages including ACTS, DD4hep, JANA2, and PODIO. The software stack aims to be forward looking in the era of AI/ML and heterogeneous hardware. A formal decision making process was...

    Go to contribution page
  115. Giannelli, Michele Faucci (INFN Roma Tor Vergata and Universita' di Roma Tor Vergata)
    5/8/23, 3:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    AtlFast3 is the new ATLAS fast simulation that exploits a wide range of ML techniques to achieve high-precision fast simulation. The latest version of the AtlFast3 used in Run3 deploys FastCaloGANV2 which consists of 500 Generative Adversarial Networks used to simulate the showers of all particles in the ATLAS calorimeter system. The Muon Punch Through tool has also been completely rewritten...

    Go to contribution page
  116. Klimentov, Alexei (Brookhaven National Laboratory)
    5/8/23, 3:15 PM
    Track 4 - Distributed Computing
    Oral

    In this talk, we discuss the evolution of the computing model of the ATLAS experiment at the LHC. After LHC Run 1, it became obvious that the available computing resources at the WLCG were fully used. The processing queue could reach millions of jobs during peak loads, for example before major scientific conferences and during large scale data processing. The unprecedented performance of the...

    Go to contribution page
  117. Mr Gaede, Frank (DESY)
    5/8/23, 3:15 PM
    Poster
    Oral

    Modern high energy physics experiments fundamentally rely on accurate simulation- both to characterise detectors and to connect observed signals to underlying theory. Traditional simulation tools are reliant upon Monte Carlo methods which, while powerful, consume significant computational resources. These computing pressures are projected to become a major bottleneck at the high luminosity...

    Go to contribution page
  118. Sciabà, Andrea (CERN)
    5/8/23, 3:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    The recent evolutions of the analysis frameworks and physics data formats of the LHC experiments provide the opportunity of using central analysis facilities with a strong focus on interactivity and short turnaround times, to complement the more common distributed analysis on the Grid. In order to plan for such facilities, it is essential to know in detail the performance of the combination of...

    Go to contribution page
  119. Dr Sauti, Godfrey (NASA)
    5/8/23, 3:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The capture and curation of all primary instrument data is a potentially valuable source of added insight into experiments or diagnostics in laboratory experiments. The data can, when properly curated, enable analysis beyond the current practice that uses just a subset of the as-measured data. Complete curated data can also be input for machine learning and other data exploration tools....

    Go to contribution page
  120. Bhattacharya, Meghna (Fermilab)
    5/8/23, 3:15 PM
    Track 2 - Online Computing
    Oral

    The current and future programs for accelerator-based neutrino imaging detectors feature the use of Liquid Argon Time Projection Chambers (LArTPC) as the fundamental detection technology. These detectors combine high-resolution imaging and precision calorimetry to enable the study of neutrino interactions with unparalleled capabilities. However, the volume of data from LArTPCs will exceed 25...

    Go to contribution page
  121. Feickert, Matthew (University of Wisconsin-Madison)
    5/8/23, 3:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    In November 2022, the HEP Software Foundation (HSF) and the Institute for Research and Innovation for Software in High-Energy Physics (IRIS-HEP) organized a workshop on the topic of “Software Citation and Recognition in HEP”. The goal of the workshop was to bring together different types of stakeholders whose roles relate to software citation and the associated credit it provides, in order to...

    Go to contribution page
  122. Cranmer, Kyle (University of Wisconsin-Madison)
    5/8/23, 4:00 PM
    Plenary

    Instead of focusing on the concrete challenges of incremental changes to HEP driven by AI/ML, it is perhaps a useful exercise to think through more radical, speculative changes. What might be enabled if we embraced a dramatically different approach? What would we lose? How would those changes impact the computational, organizational, and epistemological nature of the field?

    Go to contribution page
  123. Pedro, Kevin (Fermilab)
    5/8/23, 4:30 PM
    Plenary

    Simulation is a critical component of high energy physics research, with a corresponding computing footprint. Generative AI has emerged as a promising complement to intensive full simulation with relatively high accuracy compared to existing classical fast simulation alternatives. Such algorithms are naturally suited to acceleration on coprocessors, potentially running fast enough to match the...

    Go to contribution page
  124. Little, Jared (LAPP/CNRS)
    5/8/23, 5:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Plenary

    A [Dark Matter Science Project][1] is being developed in the context of the [ESCAPE project][2] as a collaboration between scientists in European Research Infrastructures and experiments seeking to explain the nature of dark matter (such as HL-LHC, KM3NeT, CTA, DarkSide).
    The goal of this ESCAPE Science Project is to highlight the synergies between different dark matter communities and...

    Go to contribution page
  125. Lamanna, Giovanni (CNRS-LAPP)
    5/8/23, 5:30 PM
    Track 10 - Exascale Science
    Plenary

    The EU-funded ESCAPE project has brought together the ESFRI and other world class Research Infrastructures in High Energy and Nuclear Physics, Astro-Particle Physics, and Astronomy.  In the 3 years of the project many synergistic and collaborative aspects have been highlighted and explored,  from pure technical collaboration on common solutions for data management, AAI, and workflows, through...

    Go to contribution page
  126. Gazzarrini, Elena (CERN)
    5/9/23, 9:00 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Plenary

    One of the objectives of the EOSC (European Open Science Cloud) Future Project is to integrate diverse analysis workflows from Cosmology, Astrophysics and High Energy Physics in a common framework. The project’s development relies on the implementation of the Virtual Research Environment (VRE), a prototype platform supporting the goals of Dark Matter and Extreme Universe Science Projects in...

    Go to contribution page
  127. Guok, Chin (ESnet)
    5/9/23, 9:30 AM

    The Energy Sciences Network (ESnet) is the high performance network of the US Department of Energy Office of Science. Over its 36-year span, ESnet has evolved to meet the requirements of ever changing scientific workflows. This presentation will provide a brief history of ESnet's generational changes and highlight the capabilities of its current generation network ESnet6. This presentation...

    Go to contribution page
  128. Shadura, Oksana (University Nebraska-Lincoln (US))
    5/9/23, 10:00 AM
    Track 7 - Facilities and Virtualization
    Plenary

    The large data volumes expected from the High Luminosity LHC (HL-LHC) present challenges to existing paradigms and facilities for end-user data analysis. Modern cyberinfrastructure tools provide a diverse set of services that can be composed into a system that provides physicists with powerful tools that give them straightforward access to large computing resources, with low barriers to entry....

    Go to contribution page
  129. Dr Wu, Linghui (Institute of High Energy Physics)
    5/9/23, 11:00 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Future e+e- colliders are crucial to extend the search for new phenomena possibly related to the open questions that the Standard Model presently does not explain. Among the major physics programs, the flavor physics program requires particle identification (PID) performances well beyond that of most detectors designed for the current generation. The cluster counting, which measures the number...

    Go to contribution page
  130. Joosten, Sylvester (Argonne National Laboratory)
    5/9/23, 11:00 AM
    Track 3 - Offline Computing
    Oral

    The EPIC collaboration at the Electron-Ion Collider recently laid the groundwork for its software infrastructure. Large parts of the software ecosystem for EPIC mirror the setup from the Key4hep project, for example DD4hep for geometry description, and EDM4hep/PODIO for the data model. However, other parts of the EPIC software ecosystem diverge from Key4hep, for example for the event...

    Go to contribution page
  131. Eich, Niclas (RWTH Aachen University)
    5/9/23, 11:00 AM
    Track 6 - Physics Analysis Tools
    Oral

    The usage of Deep Neural Networks (DNNs) as multi-classifiers is widespread in modern HEP analyses. In standard categorisation methods, the high-dimensional output of the DNN is often reduced to a one-dimensional distribution by exclusively passing the information about the highest class score to the statistical inference method. Correlations to other classes are hereby omitted.
    Moreover, in...

    Go to contribution page
  132. Mr Zaytsev, Alexandr (Brookhaven National Laboratory (BNL))
    5/9/23, 11:00 AM
    Track 7 - Facilities and Virtualization
    Oral

    Computational science, data management and analysis have been key factors in the success of Brookhaven National Laboratory's scientific programs at the Relativistic Heavy Ion Collider (RHIC), the National Synchrotron Light Source (NSLS-II), the Center for Functional Nanomaterials (CFN), and in biological, atmospheric, and energy systems science, Lattice Quantum Chromodynamics (LQCD) and...

    Go to contribution page
  133. Rajula, Vineet Reddy
    5/9/23, 11:00 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    CERN hosts more than 1200 websites essential for the mission of the Organization, internal and external collaboration and communicaiton as well as public outreach. The complexity and scale of CERN’s online presence is very diverse with some websites, like https://home.cern/

    , accommodating more than one million unique visitors in a day.

    However, regardless of their diversity, all...

    Go to contribution page
  134. Leggett, Charles (Lawrence Berkeley National Laboratory)
    5/9/23, 11:00 AM

    High energy physics is facing serious challenges in the coming decades due to the projected shortfall of CPU and storage resources compared to our anticipated budgets. In the past, HEP has not made extensive use of HPCs, however the U.S. has had a long term investment in HPCs and it is the platform of choice for many simulation workloads, and more recently, data processing for projects such as...

    Go to contribution page
  135. Amerl, Maximilian
    5/9/23, 11:00 AM
    Track 2 - Online Computing
    Oral

    The ATLAS jet trigger is instrumental in selecting events both for Standard Model measurements and Beyond the Standard Model physics searches. Non-standard triggering strategies, such as saving only a small fraction of trigger objects for each event, avoids bandwidth limitations and increases sensitivity to low-mass and low-momentum objects. These events are used by Trigger Level Analyses,...

    Go to contribution page
  136. Gooding, Jamie (Dortmund)
    5/9/23, 11:00 AM

    Synergies between MAchine learning, Real-Time analysis and Hybrid architectures for efficient Event Processing and decision making (SMARTHEP) is a European Training Network with the aim of training a new generation of Early Stage Researchers to advance real-time decision-making, effectively leading to data-collection and analysis becoming synonymous.

    SMARTHEP...

    Go to contribution page
  137. Paschos, Pascal (University of Chicago)
    5/9/23, 11:00 AM
    Track 4 - Distributed Computing
    Oral

    We present a collection of tools and processes that facilitate onboarding a new science collaboration onto the OSG Fabric of Services. Such collaborations typically rely on computational workflows for simulations and analysis that are ideal for executing on OSG's distributed High Throughput Computing environment (dHTC). The produced output can be accumulated and aggregated at available...

    Go to contribution page
  138. Yang, Wei (SLAC National Accelerator Laboratory)
    5/9/23, 11:00 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The Xrootd S3 Gateway is a universal high performance proxy service that can be used to access S3 portals using existing HEP credentials (e.g. JSON Web Tokens and x509). This eliminates one of the biggest roadblocks to using public cloud storage resources. This paper describes how the S3 Gateway leverages existing HEP software (e.g. Davix and XRootD) to provide a familiar scalable service that...

    Go to contribution page
  139. Ju, Xiangyang (Lawrence Berkeley National Laboratory)
    5/9/23, 11:15 AM
    Track 6 - Physics Analysis Tools
    Oral

    The search for the dimuon decay of the Standard Model (SM) Higgs boson looks for a tiny peak on top of a smoothly falling SM background in the dimuon invariant mass spectrum 𝑚(𝜇𝜇). Due to the very small signal-to-background ratio, which is at the level of 0.2% in the region 𝑚(𝜇𝜇) = 120–130 GeV for an inclusive selection, an accurate determination of the background is of paramount importance....

    Go to contribution page
  140. Dr Dal Pra, Stefano (INFN-CNAF)
    5/9/23, 11:15 AM

    The INFN-CNAF Tier-1 located in Bologna (Italy) is a center of the WLCG e-Infrastructure providing computing power to the four major LHC collaborations and also supports the computing needs of about fifty more groups - also from non HEP research domains. The CNAF Tier1 center has been historically very active putting effort in the integration of computing resources, proposing and prototyping...

    Go to contribution page
  141. Matev, Rosen
    5/9/23, 11:15 AM
    Track 2 - Online Computing
    Oral

    The LHCb experiment started taking data with an upgraded detector in the Run 3 of the LHC, reading at 30 MHz full detector data with a software-only trigger system. In this context, live data monitoring is crucial to ensure that the quality of the recorded data is optimal. Data from the experiment control system, as well as raw detector data and the output of the software trigger, is used as...

    Go to contribution page
  142. Bockelman, Brian (Morgridge Institute for Research)
    5/9/23, 11:15 AM
    Track 4 - Distributed Computing
    Oral

    There is no lack of approaches for managing the deployment of distributed services – in the last 15 years of running distributed infrastructure, the OSG Consortium has seen many of them. One persistent problem has been each physical site has its style of configuration management and service operations, leading to a partitioning of the staff knowledge and inflexibility in migrating services...

    Go to contribution page
  143. Owen, Alex (Queen Mary University of London)
    5/9/23, 11:15 AM
    Track 7 - Facilities and Virtualization
    Oral

    Moving towards Net-Zero requires robust information to enable good decision making at all levels: covering hardware procurement, workload management and operations, as well as higher level aspects encompassing grant funding processes and policy framework development.

    The IRISCAST project is a proof-of-concept study funded as part of the UKRI Net-Zero Scoping Project. We have performed an...

    Go to contribution page
  144. Dr Lin, Tao (IHEP)
    5/9/23, 11:15 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The Jiangmen Underground Neutrino Observatory (JUNO), under construction in South China, primarily aims to determine the neutrino mass hierarchy and the precise measure oscillation parameters. The data-taking is expected to start in 2024 and plans to run for more than 20 years. The development of JUNO offline software (JUNOSW) started in 2012, and it is quite challenging to maintain the JUNOSW...

    Go to contribution page
  145. Li, Teng (Shandong University)
    5/9/23, 11:15 AM
    Track 3 - Offline Computing
    Oral

    The reconstruction of charged particles’ trajectories is one of the most complex and CPU-consuming event processing chains in high energy physics (HEP) experiments. Meanwhile, the precision of track reconstruction has direct and significant impact on vertex reconstruction, physics flavour tagging and particle identfication, and eventually on physics precision, in particular for HEP experiments...

    Go to contribution page
  146. Sim, Alex (Lawrence Berkeley National Laboratory)
    5/9/23, 11:15 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    There has been a significant increase in data volume from various large scientific projects, including the Large Hadron Collider (LHC) experiment. The High Energy Physics (HEP) community requires increased data volume on the network, as the community expects to produce almost thirty times annual data volume between 2018 and 2028 [1]. To mitigate the repetitive data access issue and network...

    Go to contribution page
  147. Drielsma, Francois (SLAC)
    5/9/23, 11:15 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Recent inroads in Computer Vision (CV), enabled by Machine Learning (ML), have motivated a new approach to the analysis of particle imaging detector data. Unlike previous efforts which tackled isolated CV tasks, this paper introduces an end-to-end, ML-based data reconstruction chain for Liquid Argon Time Projection Chambers (LArTPCs), the state-of-the-art in precision imaging at the intensity...

    Go to contribution page
  148. Bocchi, Enrico (CERN)
    5/9/23, 11:15 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    In this contribution we describe the 2022 reboot of the ScienceBox project, the containerised SWAN/CERNBox/EOS demonstrator package for CERN storage and analysis services. We evolved the original implementation to make use of Helm charts across the entire dependency stack. Charts have become the de-facto standard for application distribution and deployment in managed clusters (e.g.,...

    Go to contribution page
  149. Promberger, Laura (CERN)
    5/9/23, 11:30 AM
    Track 4 - Distributed Computing
    Oral

    The CernVM File System (CVMFS) provides the software distribution backbone for High Energy and Nuclear Physics experiments and many other scientific communities in the form of a globally available shared software area. It has been designed for the software distribution problem of experiment software for LHC Runs 1 and 2. For LHC Run 3 and even more so for HL-LHC (Runs 4-6), the complexity of...

    Go to contribution page
  150. Schneide, Christiane (DESY)
    5/9/23, 11:30 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    In the frame of the German NFDI (National Research Data Infrastructure), by now 27 consortia across all domains of science have been setup in order to enhance the FAIR usage and re-usage of scientific data. The consortium PUNCH4NFDI, composed of the German particle, astroparticle, hadron&nuclear, and astrophysics communities, has been approved for initially 5 years of significant...

    Go to contribution page
  151. Mr Petrucci, Andrea (University of California, San Diego, San Diego, California, USA)
    5/9/23, 11:30 AM
    Track 2 - Online Computing
    Oral

    The CMS Online Monitoring System (OMS) aggregates and integrates different sources of information into a central place and allows users to view, compare and correlate information. It displays real-time and historical information.
    The tool is heavily used by run coordinators, trigger experts and shift crews, to achieve optimal trigger and efficient data taking. It provides aggregated...

    Go to contribution page
  152. Dr Gessinger, Paul (CERN)
    5/9/23, 11:30 AM
    Track 3 - Offline Computing
    Oral

    ACTS is an experiment independent toolkit for track reconstruction, which is designed from the ground up for thread-safety and high performance. It is built to accommodate different experiment deployment scenarios, and also serves as community platform for research and development of new approaches and algorithms.

    The Event Data Model (EDM) is a critical piece of the tracking library that...

    Go to contribution page
  153. Mieskolainen, Mikael (Imperial College London)
    5/9/23, 11:30 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    I will introduce a new neural algorithm -- HyperTrack, designed for exponentially demanding combinatorial inverse problems of high energy physics final state reconstruction and high-level analysis at the LHC and beyond. Many of these problems can be formulated as clustering on a graph resulting in a hypergraph. The algorithm is based on a machine learned geometric-dynamical input graph...

    Go to contribution page
  154. Undrus, Alexander (BNL)
    5/9/23, 11:30 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The ATLAS Continuous Integration (CI) System is the major component of the ATLAS software development infrastructure, synchronizing efforts of several hundred software developers working around the world and around the clock. Powered by 700 fast processors, it is based on the ATLAS GitLab code management service and Jenkins CI server and performs daily up to 100 ATLAS software builds probing...

    Go to contribution page
  155. Ms Lazzari Miotto, Giovanna (UFRGS (BR))
    5/9/23, 11:30 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Current and future distributed HENP data analysis infrastructures rely increasingly on object stores in addition to regular remote file systems. Such file-less storage systems are popular as a means to escape the inherent scalability limits of the POSIX file system API. Cloud storage is already dominated by S3-like object stores, and HPC sites are starting to take advantage of object stores...

    Go to contribution page
  156. Dr Luitz, Steffen (SLAC National Accelerator Laboratory)
    5/9/23, 11:30 AM
    Track 7 - Facilities and Virtualization
    Oral

    LUX-ZEPLIN (LZ) is a direct detection dark matter experiment currently operating at the Sanford Underground Research Facility (SURF) in Lead, South Dakota. The core component is a liquid xenon time projection chamber with an active mass of 7 tonnes.
    To meet the performance, availability, and security requirements for the LZ DAQ, Online, Slow Control and data transfer systems located at SURF,...

    Go to contribution page
  157. Grosso, Gaia (Universita e INFN, Padova (IT))
    5/9/23, 11:30 AM
    Track 6 - Physics Analysis Tools
    Oral

    We present New Physics Learning Machine (NPLM), a machine learning-based strategy to detect data departures from a Reference model, with no prior bias on the source of discrepancy. The main idea behind the method is to approximate the optimal log-likelihood-ratio hypothesis test parametrising the data distribution with a universal approximating function, and solving its maximum-likelihood fit...

    Go to contribution page
  158. Bashyal, Amit (Argonne National Lab)
    5/9/23, 11:30 AM

    The computing and storage requirements of the energy and intensity frontiers will grow significantly during the Run 4 & 5 and the HL-LHC era. Similarly, in the intensity frontier, with larger trigger readouts during supernovae explosions, the Deep Underground Neutrino Experiment (DUNE) will have unique computing challenges that could be addressed by the use of parallel and accelerated...

    Go to contribution page
  159. Dr Boehler, Michael (Freiburg University)
    5/9/23, 11:45 AM
    Track 4 - Distributed Computing
    Oral

    The increasing computational demand in High Energy Physics (HEP) as well as increasing concerns about energy efficiency in high performance/throughput computing are driving forces in the search for more efficient ways to utilize available resources. Since avoiding idle resources is key in achieving high efficiency, an appropriate measure is sharing of idle resources of under-utilized sites...

    Go to contribution page
  160. Jeske, Torri (JLAB)
    5/9/23, 11:45 AM
    Track 2 - Online Computing
    Oral

    Hydra is a system which utilizes computer vision to monitor data quality in near real time. Currently, it is deployed in all of Jefferson Lab’s experimental halls and lightens the load on shift takers by autonomously monitoring diagnostic plots in near real time. Hydra is constructed from “off-the-shelf” technologies and is backed up by a full MySQL database. To aid with both labeling and...

    Go to contribution page
  161. Lohezic, Victor (Irfu, CEA Saclay - Université Paris-Saclay)
    5/9/23, 11:45 AM
    Track 6 - Physics Analysis Tools
    Oral

    Data-driven methods are widely used to overcome shortcomings of Monte Carlo (MC) simulations (lack of statistics, mismodeling of processes, etc.) in experimental High Energy Physics. A precise description of background processes is crucial to reach the optimal sensitivity for a measurement. However, the selection of the control region used to describe the background process in a region of...

    Go to contribution page
  162. Chudoba, Jiri
    5/9/23, 11:45 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Planned EOSC-CZ projects will significantly improve data management in many scientific fields in the Czech Republic. Several calls for projects are under preparation according to the implementation architecture document created in 2021. Emerging National data infrastructure will build basic infrastructure with significant storage capacity for long term archive of scientific data and their...

    Go to contribution page
  163. Whitehouse, Dan (Imperial College)
    5/9/23, 11:45 AM
    Track 7 - Facilities and Virtualization
    Oral

    Recent years have seen an increasing interest in the environmental impact, especially the carbon footprint, generated by the often large scale computing facilities used by the communities represented at CHEP. As this is a fairly new requirement, this information is not always readily available, especially at universities and similar institutions which do not necessarily see large scale...

    Go to contribution page
  164. Barbone, Marco (Imperial College London)
    5/9/23, 11:45 AM

    Random number generation is key to many applications in a wide variety of disciplines. Depending on the application, the quality of the random numbers from a particular generator can directly impact both computational performance and critically the outcome of the calculation.

    High-energy physics applications use Monte Carlo simulations and machine learning widely, which both require...

    Go to contribution page
  165. Tsang, Patrick (SLAC)
    5/9/23, 11:45 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Modern neutrino experiments employ hundreds to tens of thousands of photon detectors to detect scintillation photons produced from the energy deposition of charged particles. A traditional approach of modeling individual photon propagation as a look-up table requires high computational resources, and therefore it is not scalable for future experiments with multi-kiloton target volume.

    We...

    Go to contribution page
  166. Wilken, Timo (CERN)
    5/9/23, 11:45 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The ALICE experiment at CERN uses a cluster consisting of virtual and bare-metal machines to build and test proposed changes to the ALICE Online-Offline (O2) software in addition to building and publishing regular software releases.

    Nomad is a free and open-source job scheduler for containerised and non-containerised applications developed by Hashicorp. It is integrated into an...

    Go to contribution page
  167. Snyder, Scott (Brookhaven National Laboratory)
    5/9/23, 11:45 AM
    Track 3 - Offline Computing
    Oral

    For Run 3, ATLAS redesigned its offline software, Athena, so that the
    main workflows run completely multithreaded. The resulting substantial
    reduction in the overall memory requirements allows for better use
    of machines with many cores. This talk will discuss the performance
    achieved by the multithreaded reconstruction as well as the process
    of migrating the large ATLAS code base and...

    Go to contribution page
  168. Wu, Kesheng (Lawrence Berkeley National Laboratory)
    5/9/23, 11:45 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    At Brookhaven National Lab, the dCache storage management system is used as a disk cache for large high-energy physics (HEP) datasets primarily from the ATLAS experiment[1]. Storage space on dCache is considerably smaller than the full ATLAS data collection. Therefore, a policy is needed to determine what data files to keep in the cache and what files to evict. A good policy is to keep...

    Go to contribution page
  169. Smith, Nick (Fermilab)
    5/9/23, 12:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    In this talk, we present a novel data format design that obviates the need for data tiers by storing individual event data products in column objects. The objects are stored and retrieved through Ceph S3 technology, and a companion metadata system handles tracking of the object lifecycle. Performance benchmarks of data storage and retrieval will be presented, along with scaling tests of the...

    Go to contribution page
  170. Weber, Christian (Brookhaven National Laboratory)
    5/9/23, 12:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    Many theories of Beyond Standard Model (BSM) physics feature multiple BSM particles. Generally, these theories live in higher dimensional phase spaces that are spanned by multiple independent BSM parameters such as BSM particle masses, widths, and coupling constants. Fully probing these phase spaces to extract comprehensive exclusion regions in the high dimensional space is challenging....

    Go to contribution page
  171. Couturier, Benjamin (CERN)
    5/9/23, 12:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    High Energy Physics experiments at the Large Hadron Collider generate petabytes of data that go though multiple transformation before final analysis and paper publication. Recording the provenance of these data is therefore crucial to maintain the quality of the final results. While the tools are in place within LHCb to keep this information for the common experiment-wide transforms, analysts...

    Go to contribution page
  172. Gyurjyan, Vardan (Jefferson Lab)
    5/9/23, 12:00 PM
    Track 4 - Distributed Computing
    Oral

    The JIRIAF project aims to combine geographically diverse computing facilities into an integrated science infrastructure. This project starts by dynamically evaluating temporarily unallocated or idled compute resources from multiple providers. These resources are integrated to handle additional workloads without affecting local running jobs. This paper describes our approach to launch...

    Go to contribution page
  173. Benelli, Gabriele (Brown University)
    5/9/23, 12:00 PM
    Track 3 - Offline Computing
    Oral

    During the long shutdown between LHC Run 2 and 3, a reprocessing of 2017 and 2018 CMS data with higher granularity data quality monitoring (DQM) harvesting was done. The time granularity of DQM histograms in this dataset is increased by 3 orders of magnitude. In anticipation of deploying this higher granularity DQM harvesting in the ongoing Run 3 data taking, this dataset is used to study the...

    Go to contribution page
  174. Dr Pellegrino, Carmelo (INFN-CNAF)
    5/9/23, 12:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The INFN Tier1 data center is currently located in the premises of the Physics Department of the University of Bologna, where CNAF is also located. During 2023 it will be moved to the “Tecnopolo”, the new facility for research, innovation, and technological development in the same city area; the same location is also hosting Leonardo, the pre-exascale supercomputing machine managed by CINECA,...

    Go to contribution page
  175. Chappell, Andrew (University of Warwick)
    5/9/23, 12:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The Deep Underground Neutrino Experiment (DUNE) will operate four large-scale Liquid-Argon Time-Projection Chambers (LArTPCs) at the far site in South Dakota, producing high-resolution images of neutrino interactions.

    LArTPCs represent a step-change in neutrino interaction imaging and the resultant images can be highly detailed and complex. Extracting the maximum value from LArTPC hardware...

    Go to contribution page
  176. Konopka, Piotr (CERN)
    5/9/23, 12:00 PM
    Track 2 - Online Computing
    Oral

    ALICE (A Large Ion Collider Experiment) has undertaken a major upgrade during the Long Shutdown 2. The increase in the detector data rates, and in particular the continuous readout of the TPC, led to a hundredfold increase in the input raw data, up to 3.5 TB/s. To cope with it, a new common Online and Offline computing system, called O2, has been developed and put in production.

    The online...

    Go to contribution page
  177. Posada Trobo, Ismael (CERN)
    5/9/23, 12:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    GitLab has been running at CERN since 2012. It is a self-service code hosting application based on Git that provides collaboration and code review features, becoming one of the key infrastructures at CERN. It is being widely used at CERN, with more than 17 000 active users, hosting more than 120 000 projects and triggering more than 5 000 jobs per hour.

    On its initial stage, a custom-made...

    Go to contribution page
  178. CHENG, Yaodong (IHEP, CAS)
    5/9/23, 12:00 PM

    Large-scale high-energy physics experiments generate petabytes or even exabytes of scientific data, and high-performance data IO is required during their processing. However, computing and storage devices are often separated in large computing centers, and large-scale data transmission has become a bottleneck for some data-intensive computing tasks, such as data encoding and decoding,...

    Go to contribution page
  179. Britton, Thomas (JLab)
    5/9/23, 12:15 PM
    Track 2 - Online Computing
    Oral

    One critical step on the path from data taking to physics analysis is calibration. For many experiments this step is both time consuming and computationally expensive. The AI Experimental Calibration and Control project seeks to address these issues, starting first with the GlueX Central Drift Chamber (CDC). We demonstrate the ability of a Gaussian Process to estimate the gain correction...

    Go to contribution page
  180. Traynor, Daniel (Queen Mary University of London)
    5/9/23, 12:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    Queen Mary University of London (QMUL) as part of the refurbishment of one of its's data centres has installed water to water heat pumps to use the heat produced by the computing servers to provide heat for the university via a district heating system. This will enable us to reduce the use of high carbon intensity natural gas heating boilers, replacing them with electricity which has a lower...

    Go to contribution page
  181. Neubauer, Mark (University of Illinois)
    5/9/23, 12:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    The matrix element method (MEM) is a powerful technique that can be used for the analysis of particle collider data utilizing an ab initio calculation of the approximate probability density function for a collision event to be due to a physics process of interest. The most serious difficulty with the ME method, which has limited its applicability to searches for beyond-the-SM physics and...

    Go to contribution page
  182. Sfiligoi, Igor
    5/9/23, 12:15 PM

    The IceCube experiment has substantial simulation needs and is in continuous search for the most cost-effective ways to satisfy them. The most CPU-intensive part relies on CORSIKA, a cosmic ray air shower simulation. Historically, IceCube relied exclusively on x86-based CPUs, like Intel Xeon and AMD EPYC, but recently server-class ARM-based CPUs are also becoming available, both on-prem and in...

    Go to contribution page
  183. Lassnig, Mario (CERN)
    5/9/23, 12:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Rucio is a software framework that provides scientific collaborations with the ability to organise, manage and access large volumes of data using customisable policies. The data can be spread across globally distributed locations and across heterogeneous data centres, uniting different storage and network technologies as a single federated entity. Rucio offers advanced features such as...

    Go to contribution page
  184. Hewes, V (University of Cincinnati)
    5/9/23, 12:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The Exa.TrkX team has developed a Graph Neural Network (GNN) for reconstruction of liquid argon time projection chamber (LArTPC) data. We discuss the network architecture, a multi-head attention message passing network that classifies detector hits according to the particle type that produced them. By utilizing a heterogeneous graph structure with independent subgraphs for each 2D plane’s hits...

    Go to contribution page
  185. Lorusso, Marco (University of Bologna)
    5/9/23, 12:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The increasingly pervasive and dominant role of machine learning (ML) and deep learning (DL) techniques in High Energy Physics is posing challenging requirements to effective computing infrastructures on which AI workflows are executed, as well as demanding requests in terms of training and upskilling new users and/or future developers of such technologies.

    In particular, a growth in the...

    Go to contribution page
  186. Marcon, Caterina (INFN Milan (IT))
    5/9/23, 12:15 PM
    Track 4 - Distributed Computing
    Oral

    The Worldwide LHC Computing Grid (WLCG) is a large-scale collaboration which gathers the computing resources of around 170 computing centres from more than 40 countries. The grid paradigm, unique to the realm of high energy physics, has successfully supported a broad variety of scientific achievements. To fulfil the requirements of new applications and to improve the long-term sustainability...

    Go to contribution page
  187. Brown, David (LBL)
    5/9/23, 2:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    The primary physics goal of the Mu2e experiment requires reconstructing an isolated 105 MeV electron with better than 500 KeV/c momentum resolution. Mu2e uses a low-mass straw tube tracker, and a CsI crystal calorimeter, to reconstruct tracks.
    In this paper, we present the design and performance of a track reconstruction algorithm optimized for Mu2e’s unusual requirements. The algorithm is...

    Go to contribution page
  188. Davis, Michael (CERN), Mr Afonso, Joao (CERN)
    5/9/23, 2:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The goal of the “HTTP REST API for Tape” project is to provide a simple, minimalistic and uniform interface to manage data transfers between Storage Endpoints (SEs) where the source file is on tape. The project is a collaboration between the developers of WLCG storage systems (EOS+CTA, dCache, StoRM) and data transfer clients (gfal2, FTS). For some years, HTTP has been growing in popularity as...

    Go to contribution page
  189. Amado, Jhonatan
    5/9/23, 2:00 PM
    Track 3 - Offline Computing
    Oral

    The CMS Tier-0 service is responsible for the prompt processing and distribution of the data collected by the CMS Experiment. A number of upgrades were implemented during the long shutdown of the Large Hadron Collider, which improved the performance and reliability of the service. We report our experience of the data taking during Run-3 detector commissioning as well as performance of the...

    Go to contribution page
  190. Calafiura, Paolo (LBNL)
    5/9/23, 2:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Significant progress has been made in applying graph neural networks (GNNs) and other geometric ML ideas to the track reconstruction problem. State-of-the-art results are obtained using approaches such as the Exatrkx pipeline, which currently applies separate edge construction, classification and segmentation stages. One can also treat the problem as an object condensation task, and cluster...

    Go to contribution page
  191. Dr Sobie, Randall (University of Victoria)
    5/9/23, 2:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    HEPscore is a CPU benchmark, based on HEP applications, that the HEPiX Working Group is proposing as a replacement of the HEPSpec06 benchmark (HS06), which is currently used by the WLCG for procurement, computing resource requests and pledges, accounting and performance studies. At the CHEP 2019 conference, we presented the reasons for building a benchmark for the HEP community that is based...

    Go to contribution page
  192. Cruz, Ana Clara Loureiro (Universidade Federal do Rio De Janeiro)
    5/9/23, 2:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The ATLAS experiment involves almost 6000 members from approximately 300 institutes spread all over the globe and more than 100 papers published every year. This dynamic environment brings some challenges such as how to ensure publication deadlines, communication between the groups involved, and the continuity of workflows. The solution found for those challenges was automation, which was...

    Go to contribution page
  193. Chang, Philip (University of Florida)
    5/9/23, 2:00 PM
    Track 2 - Online Computing
    Oral

    The Large Hadron Collider (LHC) will be upgraded to High-luminosity LHC, increasing the number of simultaneous proton-proton collisions (pile-up, PU) by several-folds. The harsher PU conditions lead to exponentially increasing combinatorics in charged-particle tracking, placing a large demand on the computing resources. The projection on required computing resources exceeds the computing...

    Go to contribution page
  194. Cerati, Giuseppe (Fermilab)
    5/9/23, 2:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Among liquid argon time projection chamber (LArTPC) experiments MicroBooNE is the one that continually took physics data for the longest time (2015-2021), and represents the state of the art for reconstruction and analysis with this detector. Recently published analyses include oscillation physics results, searches for anomalies and other BSM signatures, and cross section measurements. LArTPC...

    Go to contribution page
  195. Koraka, Charis Kleio (University of Wisconsin Madison (US))
    5/9/23, 2:00 PM

    The CMS experiment at CERN accelerates several stages of its online reconstruction by making use of GPU resources at its High Level Trigger (HLT) farm for LHC Run 3. Additionally, during the past years, computing resources available to the experiment for performing offline reconstruction, such as Tier-1 and Tier-2 sites, have also started to integrate accelerators into their systems. In order...

    Go to contribution page
  196. Legger, Federica (INFN Torino)
    5/9/23, 2:00 PM
    Track 4 - Distributed Computing
    Oral

    Data taking at the Large Hadron Collider (LHC) at CERN restarted in 2022. The CMS experiment relies on a distributed computing infrastructure based on WLCG (Worldwide LHC Computing Grid) to support the LHC Run 3 physics program. The CMS computing infrastructure is highly heterogeneous and relies on a set of centrally provided services, such as distributed workload management and data...

    Go to contribution page
  197. Lieret, Kilian (Princeton University)
    5/9/23, 2:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Recent work has demonstrated that graph neural networks (GNNs) trained for charged particle tracking can match the performance of traditional algorithms while improving scalability. Most approaches are based on the edge classification paradigm, wherein tracker hits are connected by edges, and a GNN is trained to prune edges, resulting in a collection of connected components representing...

    Go to contribution page
  198. Krasznahorkay, Attila (CERN)
    5/9/23, 2:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The ATLAS Open Data project aims to deliver open-access resources for education and outreach in High Energy Physics using real data recorded by the ATLAS detector. The Open Data release so far has resulted in the release of a substantial amount of data from 8 TeV and 13 TeV collisions in an easily-accessible format and supported by dedicated software and documentation to allow its fruitful use...

    Go to contribution page
  199. Klimentov, Alexei (Brookhaven National Laboratory)
    5/9/23, 2:15 PM
    Track 4 - Distributed Computing
    Oral

    Monitoring services play a crucial role in the day-to-day operation of distributed computing systems. The ATLAS experiment at LHC uses the production and distributed analysis workload management system (PanDA WMS), which allows a million computational jobs to run daily at over 170 computing centers of the WLCG and other opportunistic resources, utilizing 600k cores simultaneously on average....

    Go to contribution page
  200. Ahn, Sang Un (KISTI)
    5/9/23, 2:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    CDS (Custodial Disk Storage), a disk-based custodial storage powered by CERN EOS storage system, has been operating for the ALICE experiment at the KISTI Tier-1 Centre since November 2021. The CDS replaced existing tape storage operated for almost a decade, after its stable demonstration in the WLCG Tape Challenges in October 2021. We tried to challenge the economy of tape storage in the...

    Go to contribution page
  201. Lazzari, Federico (Pisa University & INFN)
    5/9/23, 2:15 PM
    Track 2 - Online Computing
    Oral

    The LHCb experiment is currently taking data with a completely renewed DAQ system, capable for the first time of performing a full real-time reconstruction of all collision events occurring at LHC point 8.
    The Collaboration is now pursuing a further upgrade (LHCb "Upgrade-II"), to enable the experiment to retain the same capability at luminosities an order of magnitude larger than the maximum...

    Go to contribution page
  202. Aleksandravicius, Gabriel de Aragão (Universidade Federal do Rio De Janeiro)
    5/9/23, 2:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    As the largest particle physics laboratory in the world, CERN has more than 17000 collaborators spread around the globe. ATLAS, one of CERN’s experiments, has around 6000 active members and 300 associate institutes, all of which must go through the standard registration and updating procedures within CERN’s HR (Foundation) database. Simultaneously, the ATLAS Glance project, among other...

    Go to contribution page
  203. HUANG, Xingtao (Shandong University)
    5/9/23, 2:15 PM
    Track 3 - Offline Computing
    Oral

    (on behalf of the JUNO Collaboration)

    Jiangmen Underground Neutrino Observatory (JUNO), under construction in southern China, is a multi-purpose neutrino experiment designed to determine the neutrino mass hierarchy and precisely measure oscillation parameters. Equipped with a 20-kton liquid scintillator central detector viewed by 17,612 20-inch and 25,6000 3-inch photomultiplier tubes, JUNO...

    Go to contribution page
  204. Fitzpatrick, Conor (CERN)
    5/9/23, 2:15 PM

    The LHCb experiment uses a triggerless readout system where its first stage (HLT1) is implemented on GPU cards. The full LHC event rate of 30 MHz is reduced to 1 MHz using efficient parallellisation techniques in order to meet throughput requirements. The GPU cards are hosted in the same servers as the FPGA cards receiving the detector data which reduces the network to a minimum. In this talk,...

    Go to contribution page
  205. Dr Ricci, Alessandro Maria (University of Pisa and INFN Pisa)
    5/9/23, 2:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    Among the biggest computational challenges for High Energy Physics (HEP) experiments there are the increasingly larger datasets that are being collected, which often require correspondingly complex data analyses. In particular, the PDFs used for modeling the experimental data can have hundreds of free parameters. The optimization of such models involves a significant computational effort and a...

    Go to contribution page
  206. Andrijauskas, Fabio
    5/9/23, 2:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    Creating new materials, discovering new drugs, and simulating systems are essential processes for research and innovation, and require substantial computational power. While many applications can be split into many smaller independent tasks, some cannot and may take hours or weeks to run to completion. To better manage those longer-running jobs, it would be desirable to stop them at any...

    Go to contribution page
  207. Ebert, Marcus (University of Victoria)
    5/9/23, 2:30 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The BaBar experiment collected electron-positron collisions at the SLAC National Accelerator Laboratory from 1999-2008. Although data taking has stopped 15 years ago, the collaboration is still actively doing data analyses, publishing results, and giving presentations at international conferences. Special considerations were needed to do analyses using a computing environment that was...

    Go to contribution page
  208. Jia, Xiaoqian (Shandong University, CN)
    5/9/23, 2:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Track reconstruction is one of the most important and challenging tasks in the offline data processing of collider experiments. For the BESIII detector working in the tau-charm energy region, plenty of efforts were made previously to improve the tracking performance with traditional methods, such as template matching and Hough transform etc. However, for difficult tracking tasks, such as the...

    Go to contribution page
  209. Ju, Xiangyang (Lawrence Berkeley National Laboratory)
    5/9/23, 2:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    To accurately describe data, tuning the parameters of MC event Generators is essential. At first, experts performed tunings manually based on their sense of physics and goodness of fit. The software, Professor, made tuning more objective by employing polynomial surrogate functions to model the relationship between generator parameters and experimental observables (inner-loop optimization),...

    Go to contribution page
  210. Ferreira Brito Filho, Carlos Henrique
    5/9/23, 2:30 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The LHCb experiment is one of the 4 LHC experiments at CERN. With more than 1500 members and tens of thousands of assets, the Collaboration requires systems that allow the extraction of data from many databases according to some very specific criteria. In LHCb there are 4 production web applications responsible for managing members and institutes, tracking assets and their current status,...

    Go to contribution page
  211. Dr Wiebalck, Arne (CERN)
    5/9/23, 2:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    CERN IT has consolidated all life-cycle management of its physical server fleet on the Ironic bare-metal API. From the initial registration upon the first boot, over the inventory checking, the burn-in and the benchmarking for acceptance, the provisioning to the end users and the repairs during its service, up to the retirement at the end of the servers’ life, all stages can be managed within...

    Go to contribution page
  212. Mascheroni, Marco (University of California San Diego)
    5/9/23, 2:30 PM
    Track 3 - Offline Computing
    Oral

    The former CMS Run 2 High Level Trigger (HLT) farm is one of the largest contributors to CMS compute resources, providing about 30k job slots for offline computing. The role of this farm has been evolving, from an opportunistic resource exploited during inter-fill periods in the LHC Run 2, to a nearly transparent extension of the CMS capacity at CERN during LS2 and into the LHC Run 3 started...

    Go to contribution page
  213. Parida, Ganesh (University of Wisconsin Madison)
    5/9/23, 2:30 PM

    The software based High Level Trigger (HLT) of CMS reduces the data readout rate from 100kHz (obtained from Level 1 trigger) to around 2kHz. It makes use of all detector subsystems and runs a streamlined version of CMS reconstruction. Run-1 and Run-2 of the LHC saw the reconstruction algorithm run on a CPU farm (~30000 CPUs in 2018). But the need to have increased computational power as we...

    Go to contribution page
  214. Storetvedt, Maksim (CERN)
    5/9/23, 2:30 PM
    Track 4 - Distributed Computing
    Oral

    The ALICE experiment at the CERN Large Hadron Collider relies on a massive, distributed Computing Grid for its data processing. The ALICE Computing Grid is built by combining a large number of individual computing sites distributed globally. These Grid sites are maintained by different institutions across the world and contribute thousands of worker nodes possessing different capabilities and...

    Go to contribution page
  215. Dittmeier, Sebastian (Heidelberg University)
    5/9/23, 2:30 PM
    Track 2 - Online Computing
    Oral

    The High-Luminosity LHC (HL-LHC) will provide an order of magnitude increase in integrated luminosity and enhance the discovery reach for new phenomena. The increased pile-up foreseen during the HL-LHC necessitates major upgrades to the ATLAS detector and trigger. The Phase-II trigger will consist of two levels, a hardware-based Level-0 trigger and an Event Filter (EF) with tracking...

    Go to contribution page
  216. Zhao, Xin (Brookhaven National Laboratory (US))
    5/9/23, 2:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The High Luminosity upgrade to the LHC (HL-LHC) is expected to deliver scientific data at the multi-exabyte scale. In order to address this unprecedented data storage challenge, the ATLAS experiment launched the Data Carousel project in 2018. Data Carousel is a tape-driven workflow whereby bulk production campaigns with input data resident on tape are executed by staging and promptly...

    Go to contribution page
  217. Taylor, Ryan Paul (University of Victoria)
    5/9/23, 2:45 PM
    Track 7 - Facilities and Virtualization
    Oral

    The University of Victoria (UVic) operates an Infrastructure-as-a-Service science cloud for Canadian researchers, and a WLCG T2 grid site for the ATLAS experiment at CERN. At first, these were two distinctly separate systems, but over time we have taken steps to migrate the T2 grid services to the cloud. This process has been significantly facilitated by basing our approach on Kubernetes, a...

    Go to contribution page
  218. Concas, Matteo (CERN)
    5/9/23, 2:45 PM
    Track 2 - Online Computing
    Oral

    During the LHC Run 3, the significant upgrades on many detectors and a brand new reconstruction software allows the ALICE experiment to record Pb-Pb collisions at an interaction rate of 50 kHz in a trigger-less continuous readout mode.

    The key to process the 1TB/s peak data rate in ALICE is the usage of GPUs. There are two main data processing phases: the synchronous phase, where the TPC...

    Go to contribution page
  219. Dr Simko, Tibor (CERN)
    5/9/23, 2:45 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    In this paper we discuss the CMS open data publishing workflows, summarising experience with eight releases of CMS open data on the CERN Open Data portal since its initial launch in 2014. We present the recent enhancements of data curation procedures, including (i) mining information about collision and simulated datasets with accompanying generation parameters and processing configuration...

    Go to contribution page
  220. Mr Rottler, Benjamin (Freiburg University)
    5/9/23, 2:45 PM
    Track 4 - Distributed Computing
    Oral

    HammerCloud (HC) is a testing service and framework for continuous functional tests, on-demand large-scale stress tests, and performance benchmarks. It checks the computing resources and various components of distributed systems with realistic full-chain experiment workflows.

    The HammerCloud software was initially developed in Python 2. After support for Python 2 was discontinued in 2020,...

    Go to contribution page
  221. Dr Rademakers, Fons (CERN)
    5/9/23, 2:45 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The CERN IT Department is responsible for ensuring the integrity and security of data stored in the IT Storage Services. General storage backends such as EOSHOME/PROJECT/MEDIA and CEPHFS are used to store data for a wide range of use cases for all stakeholders at CERN, including experiment project spaces and user home directories.

    In recent years a backup system, CBACK, was developed based...

    Go to contribution page
  222. Boyer, Alexandre (CERN)
    5/9/23, 2:45 PM

    To better understand experimental conditions and performances of the Large Hadron Collider (LHC), CERN experiments execute tens of thousands of loosely-coupled Monte Carlo simulation workflows per hour on hundreds of thousands - small to mid-size - distributed computing resources federated by the Worldwide LHC Computing Grid (WLCG). While this approach has been reliable during the first LHC...

    Go to contribution page
  223. Li, Teng (Shandong University)
    5/9/23, 2:45 PM
    Track 3 - Offline Computing
    Oral

    The Super Tau Charm Facility (STCF) proposed in China is a new-generation electron–positron collider with center-of-mass energies covering 2-7 GeV and a peak luminosity of 5*10^34 cm^-2s^-1. The offline software of STCF (OSCAR) is developed to support the offline data processing, including detector simulation, reconstruction, calibration as well as physics analysis. To meet STCF’s specific...

    Go to contribution page
  224. Ferreira Brito Filho, Carlos Henrique (UFRJ)
    5/9/23, 2:45 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The Glance project is responsible for over 20 systems across three CERN experiments: ALICE, ATLAS and LHCb. Students, engineers, physicists and technicians have been using systems designed and managed by Glance on a daily basis for over 20 years. In order to produce quality products continuously, considering internal stakeholder's ever-evolving requests, there is the need of standardization....

    Go to contribution page
  225. Gavalian, Gagik (Jefferson Lab)
    5/9/23, 2:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Particle track reconstruction is the most computationally intensive process in nuclear physics experiments.
    Traditional algorithms use a combinatorial approach that exhaustively tests track measurements (hits) to
    identify those that form an actual particle trajectory. In this article we describe the development of machine
    learning models that assist the tracking algorithm by identifying...

    Go to contribution page
  226. Dilks, Christopher (Duke University)
    5/9/23, 3:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    Performing a physics analysis of data from simulations of a high energy experiment requires the application of several common procedures, from obtaining and reading the data to producing detailed plots for interpretation. Implementing common procedures in a general analysis framework allows the analyzer to focus on the unique parts of their analysis. Over the past few years, EIC simulations...

    Go to contribution page
  227. Megino, Fernando Harald Barreiro (The University of Texas at Arlington)
    5/9/23, 3:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The ATLAS experiment at CERN is one of the largest scientific machines built to date and will have ever growing computing needs as the Large Hadron Collider collects an increasingly larger volume of data over the next 20 years. ATLAS is conducting R&D projects on Amazon and Google clouds as complementary resources for distributed computing, focusing on some of the key features of commercial...

    Go to contribution page
  228. Fitzgerald, Dillon (University of Michigan)
    5/9/23, 3:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Making the large datasets collected at the LHC accessible to the public is a considerable challenge given the complexity and volume of data. Yet to harness the full scientific potential of the facility, it is essential to enable meaningful access to the data by the broadest physics community possible. Here we present an application, the LHCb Ntuple Wizard, which leverages the existing...

    Go to contribution page
  229. Mr Caillou, Sylvain (L2IT, Toulouse)
    5/9/23, 3:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Data from the LHC detectors are not easily represented using regular data structures. These detectors are comprised of several species of subdetectors and therefore produce heterogeneous data. LHC detectors are granular by design so that nearby particles may be distinguished. As a consequence, LHC data are sparse, in that many detector channels are not active during a given collision event....

    Go to contribution page
  230. Klimentov, Alexei (Brookhaven National Laboratory)
    5/9/23, 3:00 PM
    Track 4 - Distributed Computing
    Oral

    Operational analytics is the direction of research related to the analysis of the current state of computing processes and the prediction of the future in order to anticipate imbalances and take timely measures to stabilize a complex system. There are two relevant areas in ATLAS Distributed Computing that are currently in the focus of studies: end-user physics analysis including the forecast...

    Go to contribution page
  231. Leggett, Charles (Lawrence Berkeley National Laboratory)
    5/9/23, 3:00 PM

    FastCaloSim is a parameterized simulation of the particle energy response and of the energy distribution in the ATLAS calorimeter. It is a relatively small and self-contained package with massive inherent parallelism and captures the essence of GPU offloading via important operations like data transfer, memory initialization, floating point operations, and reduction. Thus, it was identified as...

    Go to contribution page
  232. Valls Canudas, Núria (La Salle URL)
    5/9/23, 3:00 PM
    Track 2 - Online Computing
    Oral

    The LHCb experiment has recently started a new period of data taking after a major upgrade in both software and hardware. One of the biggest challenges has been the migration of the first part of the trigger system (HLT1) into a parallel GPU architecture framework called Allen, which performs a partial reconstruction of most of the LHCb sub-detectors. In Allen, the reconstruction of the...

    Go to contribution page
  233. Raduta, George (CERN)
    5/9/23, 3:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The recent major upgrade of the ALICE Experiment at CERN’s Large Hadron Collider has been coupled with the development of a new Online-Offline computing system capable of interacting with a sustained input throughput of 3.5TB/s. To facilitate the control of the experiment, new web applications have been developed and deployed to be used 24 hours a day, 365 days a year in the control room and...

    Go to contribution page
  234. Kirby, Michael (FNAL)
    5/9/23, 3:00 PM
    Track 3 - Offline Computing
    Oral

    We summarize the status of Deep Underground Neutrino Experiment (DUNE) software and computing development. We describe plans for the computing infrastructure needed to acquire, catalog, reconstruct, simulate and analyze the data from the DUNE experiment and its prototypes in pursuit of the experiment's physics goals of precision measurements of neutrino oscillation parameters, detection of...

    Go to contribution page
  235. Davis, Michael (CERN)
    5/9/23, 3:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The CERN Tape Archive (CTA) was conceived as the successor to CASTOR and as the tape back-end to EOS, designed for the archival storage of data from LHC Run-3 and other experimental programmes at CERN. In the wider WLCG, the tape software landscape is quite heterogenous, but we are now entering a period of consolidation. This has led to a number of sites in WLCG (and beyond) reevaluating their...

    Go to contribution page
  236. Garg, Rocky (Stanford University), Sokoloff, Michael (University of Cincinnati)
    5/9/23, 3:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    We have been studying the use of deep neural networks (DNNs) to identify and locate primary vertices (PVs) in proton-proton collisions at the LHC. Earlier work focused on finding primary vertices in simulated LHCb data using a hybrid approach that started with kernel density estimators (KDEs) derived from the ensemble of charged track parameters heuristically and predicted “target histogram”...

    Go to contribution page
  237. Bertran Ferrer, Marta (CERN)
    5/9/23, 3:15 PM
    Track 4 - Distributed Computing
    Oral

    For LHC Run3 the ALICE experiment software stack has been completely refactored, incorporating support for multicore job execution. The new multicore jobs spawn multiple processes and threads within the payload. Given that some of the deployed processes may be short-lived, accounting for their resource consumption presents a challenge. This article presents the newly developed methodology for...

    Go to contribution page
  238. Antunes, Carina
    5/9/23, 3:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    CERN, as many large organizations, relies on multiple communication means for different use-cases and teams.
    Email and mailing lists are the most popular ones, but more modern communications systems gain traction such as Mattermost and Push notifications.
    On one end of the spectrum we have communication teams writing individual emails to users on a daily basis, which may be small targets, or...

    Go to contribution page
  239. Dr Vizcaya Hernandez, Ana Paula (Colorado State University)
    5/9/23, 3:15 PM
    Track 3 - Offline Computing
    Oral

    The Deep Underground Neutrino Experiment (DUNE) is a long-baseline experiment which aims to study neutrino oscillation and astroparticle physics. It will produce vast amounts of metadata, which describe the data coming from the read-out of the primary DUNE detectors. Various databases will make up the overall DB architecture for this metadata. ProtoDUNE at CERN is the largest existing...

    Go to contribution page
  240. Roy, Avik (University of Illinois at Urbana-Champaign)
    5/9/23, 3:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Research in high energy physics (HEP) heavily relies on domain-specific digital contents. We reflect on the interpretation of principles of Findability, Accessibility, Interoperability, and Reusability (FAIR) in preservation and distribution of such digital objects. As a case study, we demonstrate the implementation of an end-to-end support infrastructure for preserving and accessing Universal...

    Go to contribution page
  241. Misawa, Shigeki (Brookhaven National Laboratory), LAURET, Jerome (Brookhaven Science Associates)
    5/9/23, 3:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    An all-inclusive analysis of costs for on-premises and public cloud-based solutions to handle the bulk of HEP computing requirements shows that dedicated on-premises deployments are still the most cost-effective. Since the advent of public cloud services, the HEP community has engaged in multiple proofs of concept to study the technical viability of using cloud resources; however, the...

    Go to contribution page
  242. Melo, Andrew (Vanderbilt University)
    5/9/23, 3:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    Apache Spark is a distributed computing framework which can process very large datasets using large clusters of servers. Laurelin is a Java-based implementation of ROOT I/O which allows Spark to read and write ROOT files from common HEP storage systems without a dependency on the C++ implementation of ROOT. We discuss improvements due to the migration to an Arrow-based in-memory representation...

    Go to contribution page
  243. Dr Kortelainen, Matti (Fermi National Accelerator Laboratory)
    5/9/23, 3:15 PM

    The CMS experiment started to utilize Graphics Processing Units (GPU) to accelerate the online reconstruction and event selection running on its High Level Trigger (HLT) farm in the 2022 data taking period. The projections of the HLT farm to the High-Luminosity LHC foresee a significant use of compute accelerators in the LHC Run 4 and onwards in order to keep the cost, size, and power budget...

    Go to contribution page
  244. Migliorini, Matteo (Padova University & INFN)
    5/9/23, 3:15 PM
    Track 2 - Online Computing
    Oral

    The sensitivity of modern HEP experiments to New Physics (NP) is limited by the hardware-level triggers used to select data online, resulting in a bias in the data collected. The deployment of efficient data acquisition systems integrated with online processing pipelines is instrumental to increase the experiments' sensitivity to the discovery of any anomaly or possible signal of NP. In...

    Go to contribution page
  245. spiga, daniele (INFN)
    5/9/23, 4:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    The challenges expected for the HL-LHC era are pushing LHC experiments to re-think their computing models at many levels. The evolution toward solutions that allow an effortless interactive analysis experience is, among others, one of the topics followed closely by the CMS experiment. In this context, ROOT RDataFrame offers a high-level, lazy programming model which makes it a flexible and...

    Go to contribution page
  246. Franz, Maja (Technical University of Applied Sciences Regensburg, Germany)
    5/9/23, 4:30 PM

    Quantum Computing (QC) is a promising early-stage technology that offers novel approaches to simulation and analysis in nuclear and high energy physics (NHEP). By basing computations directly on quantum mechanical phenomena, speedups and other advantages for many computationally hard tasks are potentially achievable, albeit both, the theoretical underpinning and the practical realization, are...

    Go to contribution page
  247. Crooks, David (UKRI - STFC)
    5/9/23, 4:30 PM
    Track 4 - Distributed Computing
    Oral

    No single organisation has the resources to defend its services alone against most modern malicious actors and so we must protect ourselves as a community. In the face of determined and well-resourced attackers, we must actively collaborate in this effort across HEP and more broadly across Research and Education (R&E).

    Parallel efforts are necessary to appropriately respond to this...

    Go to contribution page
  248. Dr Krawczyk, Rafal (CERN)
    5/9/23, 4:30 PM
    Track 2 - Online Computing
    Oral

    The CMS experiment at CERN incorporates one of the highest throughput Data Acquisition (DAQ) systems in High-Energy Physics. Its network throughput will further increase by over an order of magnitude at the High-Luminosity LHC.
    The current Run 3 CMS Event Builder receives all the fragments of Level-1 trigger accepted events from the front-end electronics (740 data streams from the CMS...

    Go to contribution page
  249. Abudinén, Fernando (University of Warwick)
    5/9/23, 4:30 PM
    Track 3 - Offline Computing
    Oral

    EvtGen is a simulation generator specialized for decays of heavy hadrons. Since its early development in the 90’s, the generator has been extensively used and has become today an essential tool for heavy-flavour physics analyses. Throughout this time, its source code has remained mostly unchanged, except for additions of new decay models. In view of the upcoming boom of multi-threaded...

    Go to contribution page
  250. Gray, Lindsey (FNAL)
    5/9/23, 4:30 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The recent release of AwkwardArray 2.0 significantly changes the way that lazy evaluation and task-graph building are handled in columnar analysis. The Dask parallel processing library is now used for these pieces of functionality with AwkwardArray, and this change affords new ways of optimizing columnar analysis and distributing it on clusters. In particular this allows optimization of a task...

    Go to contribution page
  251. Newman, Harvey (California Institute of Technology)
    5/9/23, 4:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    We will present the rapid progress, vision and outlook across multiple state of the art development lines within the Global Network Advancement Group and its Data Intensive Sciences and SENSE/AutoGOLE working groups, which are designed to meet the present and future needs and address the challenges of the Large Hadron Collider and other science programs with global reach. Since it was founded...

    Go to contribution page
  252. Fackeldey, Peter (RWTH Aachen University Physics Institute III A)
    5/9/23, 4:30 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Large research infrastructures, such as DESY and CERN, in the field of the exploration of the universe and matter (ErUM) are significantly driving the digital transformation of the future. The German action plan "ErUM-Data" promotes this transformation through the interdisciplinary networking and financial support of 20.000 scientists.
    The ErUM-Data-Hub (https://erumdatahub.de) serves as a...

    Go to contribution page
  253. Fischer, Benjamin (RWTH Aachen University (DE))
    5/9/23, 4:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The development of an LHC physics analysis involves numerous investigations that require the repeated processing of terabytes of measured and simulated data. Thus, a rapid processing turnaround is beneficial to the scientific process. We identified two bottlenecks in analysis independent algorithms and developed the following solutions.
    First, inputs are now cached on individual SSD caches of...

    Go to contribution page
  254. Rajput, Kishansingh (Thomas Jefferson National Accelerator Facility)
    5/9/23, 4:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Machine learning (ML) and deep learning (DL) are powerful tools for modeling complex systems. However, most of the standard models in ML/DL do not provide a measure of confidence or uncertainties associated with their predictions. Further, these models can only be trained on available data. During operation, models may encounter data samples poorly reflected in training data. These data...

    Go to contribution page
  255. Pozo Astigarraga, Eukeni (CERN)
    5/9/23, 4:45 PM
    Track 2 - Online Computing
    Oral

    The ATLAS experiment Data Acquisition (DAQ) system will be extensively upgraded to fully exploit the High-Luminosity LHC (HL-LHC) upgrade, allowing it to record data at unprecedented rates. The detector will be read out at 1 MHz generating over 5 TB/s of data. This design poses significant challenges for the Ethernet-based network as it will be required to transport 20 times more data than...

    Go to contribution page
  256. Panta, Anil (University of Mississippi)
    5/9/23, 4:45 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Rucio is a Data Management software that has become a de-facto standard in the HEP community and beyond. It allows the management of large volumes of data over their full lifecycle. The Belle II experiment located at KEK (Japan) recently moved to Rucio to manage its data over the coming decade (O(10) PB/year). In addition to its Data Management functionalities, Rucio also provides support for...

    Go to contribution page
  257. Roy, Avik (University of Illinois at Urbana-Champaign)
    5/9/23, 4:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    We explore interpretability of deep neural network (DNN) models designed for identifying jets coming from top quark decay in the high energy proton-proton collisions at the Large Hadron Collider (LHC). Using state-of-the-art methods of explainable AI (XAI), we identify which features play the most important roles in identifying the top jets, how and why feature importance varies across...

    Go to contribution page
  258. Ahmad, Adeel (CERN)
    5/9/23, 4:45 PM
    Track 4 - Distributed Computing
    Oral

    In 2022, CERN ran its annual phishing campaign in which 2000 users gave away their passwords (Note: this number is in line with results of campaigns at other organisations). In a real phishing incident this would have meant 2000 compromised accounts... unless they were protected by Two-Factor Authentication (2FA)! In the same year, CERN introduced 2FA for accounts with access to critical...

    Go to contribution page
  259. Krasznahorkay, Attila (CERN)
    5/9/23, 4:45 PM
    Track 3 - Offline Computing
    Oral

    The upgrade of the Large Hadron Collider (LHC) is going well, during next decade we will face the ten-fold increase in experimental data. The application of state-of-the-art detectors and data acquisition systems requires high-performance simulation support, which even more demanding in case of heavy ion collisions. Our basic aim was to develop a Monte-Carlo simulation code for heavy ion...

    Go to contribution page
  260. Rieger, Marcel (University of Hamburg)
    5/9/23, 4:45 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    In particle physics, workflow management systems are primarily used as tailored solutions in dedicated areas such as Monte Carlo production. However, physicists performing data analyses are usually required to steer their individual, complex workflows manually, frequently involving job submission in several stages and interaction with distributed storage systems by hand. This process is not...

    Go to contribution page
  261. Misa Moreira, Carmen (CERN)
    5/9/23, 4:45 PM
    Track 7 - Facilities and Virtualization
    Oral

    The NOTED (Network Optimised Transfer of Experimental Data) project has successfully demonstrated the ability to dynamically reconfigure network links to increase the effective bandwidth available for FTS-driven transfers between endpoints, such as WLCG sites by inspecting on-going data transfers and so identifying those that are bandwidth-limited for a long period of time. Recently, the...

    Go to contribution page
  262. Eulisse, Giulio
    5/9/23, 4:45 PM
    Track 6 - Physics Analysis Tools
    Oral

    ALICE, one of the four large experiments at CERN LHC, is a detector for the physics of heavy ions. In a high interaction rate environment, the pile-up of multiple events leads to an environment that requires advanced multidimensional data analysis methods.

    Machine learning (ML) has become very popular in multidimensional data analysis in recent years. Compared to the simple, low-dimensional...

    Go to contribution page
  263. Pasquale, Andrea (Università degli Studi di Milano - INFN Sezione di Milano - Technology Innovation Institute Abu Dhabi)
    5/9/23, 4:45 PM

    Over the last 20 years, thanks to the development of quantum technologies, it has been
    possible to deploy quantum algorithms and applications, that before were only
    accessible through simulation, on real quantum hardware. The current devices available are often refereed to as noisy intermediate-scale quantum (NISQ) computers and they require
    calibration routines in order to obtain...

    Go to contribution page
  264. Liu, Jianli (Institute of High Energy Physics, CAS)
    5/9/23, 4:45 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    XAS (synchrotron X-ray absorption spectroscopy) uses X-ray photon energy as a variable to measure the structure of X-ray absorption coefficient that changes with energy. In spectral experiments, the determination of the composition and structure of unknown samples requires data collection first, and then data processing and analysis. It takes a lot of time and there can be no errors in the...

    Go to contribution page
  265. Rinaldi, Lorenzo
    5/9/23, 5:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The organization of seminars and conferences was strongly influenced by the covid-19 pandemic.
    In the early period of the pandemic, many events were canceled or held completely online, using video conferencing tools such as ZOOM or MS Teams. Later, thanks to large-scale vaccination and immunization, it was possible to organize again large events in the presence. Nevertheless, given some local...

    Go to contribution page
  266. Bocci, Andrea (CERN)
    5/9/23, 5:00 PM
    Track 2 - Online Computing
    Oral

    To achieve better computational efficiency and exploit a wider range of computing resources, the CMS software framework (CMSSW) has been extended to offload part of the physics reconstruction to NVIDIA GPUs, while the support for AMD and Intel GPUs is under development. To avoid the need to write, validate and maintain a separate implementation of the reconstruction algorithms for each...

    Go to contribution page
  267. Schreiner, Henry (Princeton University)
    5/9/23, 5:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Data analysis in particle physics is socially distributed: unlike centrally developed and executed reconstruction pipelines, the analysis work performed after Analysis Object Descriptions (AODs) are made and before the final paper review—which includes particle and event selection, systematic error handling, decay chain reconstruction, histogram aggregation, fitting, statistical models, and...

    Go to contribution page
  268. Costanzo, Davide (University of Sheffield (UK))
    5/9/23, 5:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The ATLAS experiment is preparing a major change in the conditions data infrastructure in view of Run4 In this presentation we will expose the main motivations for the new design (called CREST for Conditions-REST), the ongoing changes in the DB architecture and present the developments for the deployment of the new system. The main goal is to setup a parallel infrastructure for full scale...

    Go to contribution page
  269. Kissel, Ezra (Energy Sciences Network (ESnet))
    5/9/23, 5:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    Data caches of various forms have been widely deployed in the context of commercial and research and education networks, but their common positioning at the Edge limits their utility from a network operator perspective. When deployed outside the network core, providers lack visibility to make decisions or apply traffic engineering based on data access patterns and caching node location.

    As...

    Go to contribution page
  270. Prouve, Claire (Universidade de Santiago de Compostela (ES))
    5/9/23, 5:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The task of identifying B meson flavor at the primary interaction point in the LHCb detector is crucial for measurements of mixing and time-dependent CP violation.

    Flavor tagging is usually done with a small number of expert systems that find important tracks to infer the B flavor from.

    Recent advances show that replacing all of those expert systems with one ML algorithm that considers...

    Go to contribution page
  271. Held, Alexander (University of Wisconsin–Madison)
    5/9/23, 5:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    Realistic environments for prototyping, studying and improving analysis workflows are a crucial element on the way towards user-friendly physics analysis at HL-LHC scale. The IRIS-HEP Analysis Grand Challenge (AGC) provides such an environment. It defines a scalable and modular analysis task that captures relevant workflow aspects, ranging from large-scale data processing and handling of...

    Go to contribution page
  272. Varo, Valle (DESY)
    5/9/23, 5:00 PM

    The Quantum Angle Generator (QAG) constitutes a new quantum machine learning model designed to generate accurate images on current Noise Intermediate Scale (NISQ) Quantum devices. Variational quan- tum circuits constitute the core of the QAG model, and various circuit architectures are evaluated. In combination with the so-called MERA- upsampling architecture, the QAG model achieves excellent...

    Go to contribution page
  273. Gheata, Andrei (CERN)
    5/9/23, 5:00 PM
    Track 3 - Offline Computing
    Oral

    In a context where the HEP community is striving to improve the software to cope with higher data throughput, detector simulation is adapting to benefit from new performance opportunities. Given the complexity of the particle transport modeling, new developments such as adapting to accelerator hardware represent a scalable R&D effort.

    The AdePT and Celeritas projects have already...

    Go to contribution page
  274. Dack, Tom (STFC UKRI)
    5/9/23, 5:00 PM
    Track 4 - Distributed Computing
    Oral

    Since 2017, the Worldwide LHC Computing Grid (WLCG) has been working towards enabling token-based authentication and authorization throughout its entire middleware stack. Following the initial publication of the WLCG v1.0 Token Schema in 2019, work has been done to integrate OAuth2.0 token flows across the Grid middleware. There are many complex challenges to be addressed before the WLCG can...

    Go to contribution page
  275. Gheata, Andrei (CERN)
    5/9/23, 5:15 PM
    Track 3 - Offline Computing
    Oral

    Motivated by the need to have large Monte Carlo data statistics to be able to perform the physics analysis for the coming runs of HEP experiments, particularly for HL-LHC, there are a number of efforts exploring different avenues for speeding up particle transport simulation. In particular, one of the possibilities is to re-implement the simulation code to run efficiently on GPUs. This could...

    Go to contribution page
  276. Suresh, Karthik (University of Regina), Fanelli, Cristiano (William & Mary, Jefferson Lab)
    5/9/23, 5:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Recently, a workshop on Artificial Intelligence for the Electron Ion Collider (AI4EIC) has been held at the College of William&Mary. The workshop covered all active and potential areas of applications of AI/ML for the EIC; it also had a strong outreach and educational component, with different tutorials given by experts in AI and machine learning from national labs, universities, and industry...

    Go to contribution page
  277. CHAN, Wai Yuen
    5/9/23, 5:15 PM

    In the near future, the LHC detector will deliver much more data to be processed. Therefore, new techniques are required to deal with such a large amount of data. Recent studies showed that one of the quantum computing techniques, quantum annealing (QA), can be used to perform the particle tracking with efficiency higher than 90% even in the dense environment. The algorithm starts from...

    Go to contribution page
  278. Grigoras, Costin (CERN)
    5/9/23, 5:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The ALICE experiment at CERN has undergone a substantial detector, readout and software upgrade for the LHC Run3. A signature part of the upgrade is the triggerless detector readout, which necessitates a real time lossy data compression from 1.1TB/s to 100GB/s performed on a GPU/CPU cluster of 250 nodes. To perform this compression, a significant part of the software, which traditionally is...

    Go to contribution page
  279. Guok, Chin (ESnet)
    5/9/23, 5:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    The Caltech team, in collaboration with network, computer science, and HEP partners at the DOE laboratories and universities, is building intelligent network services ("The Software-defined network for End-to-end Networked Science at Exascale (SENSE) research project") to accelerate scientific discovery.
    The overarching goal of SENSE is to enable National Labs and universities to request and...

    Go to contribution page
  280. Padulano, Vincenzo Eduardo (CERN)
    5/9/23, 5:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    The growing amount of data generated by the LHC requires a shift in how HEP analysis tasks are approached. Efforts to address this computational challenge have led to the rise of a middle-man software layer, a mixture of simple, effective APIs and fast execution engines underneath. Having common, open and reproducible analysis benchmarks proves beneficial in the development of these modern...

    Go to contribution page
  281. Itoh, Ryosuke (KEK)
    5/9/23, 5:15 PM
    Track 2 - Online Computing
    Oral

    The original HLT framework used in the Belle II experiment was formaly upgraded replacing the old IPC based ring buffer with the ZeroMQ data transport to overcome the unexpected IPC locking problem. The new framework has been working stably in the beam run so far, but it lacks the capability to recover the processing fault without stopping the on-going data taking. In addition, the...

    Go to contribution page
  282. Dr Burr, Christopher (CERN)
    5/9/23, 5:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    In the LHCb experiment, a wide variety of Monte Carlo simulated samples need to be produced for the experiment’s physics programme. LHCb has a centralised production system for simulating, reconstructing and processing collision data, which runs on the DIRAC backend on the WLCG.
    To cope with a large set of different types of sample, requests for simulation production are based on a concept of...

    Go to contribution page
  283. Mambelli, Marco (Fermilab)
    5/9/23, 5:15 PM
    Track 4 - Distributed Computing
    Oral

    GlideinWMS is a distributed workload manager that has been used in production for many years to provision resources for experiments like CERN's CMS, many Neutrino experiments, and the OSG. Its security model was based mainly on GSI (Grid Security Infrastructure), using x509 certificate proxies and VOMS (Virtual Organization Membership Service) extensions. Even if other credentials, like ssh...

    Go to contribution page
  284. Liu, Shenghua (University of Notre Dame)
    5/9/23, 5:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    An increasingly frequent challenge faced in HEP data analysis is to characterize the agreement between a prediction that depends on a dozen or more model parameters–such as predictions coming from an effective field theory (EFT) framework–and the observed data. Traditionally, such characterizations take the form of a negative log likelihood (NLL) distribution, which can only be evaluated...

    Go to contribution page
  285. Volkel, Benedikt (CERN)
    5/9/23, 5:30 PM
    Track 3 - Offline Computing
    Oral

    Monte Carlo detector transport codes are one of the backbones in high-energy physics. They simulate the transport of a large variety of different particle types through complex detector geometries based on a multitude of physics models.

    Those simulations are usually configured or tuned through large sets of parameters. Often, tuning the physics accuracy on the one hand and optimising the...

    Go to contribution page
  286. Mascheroni, Marco (University of California San Diego)
    5/9/23, 5:30 PM
    Track 4 - Distributed Computing
    Oral

    The CMS Submission Infrastructure (SI) is the main computing resource provisioning system for CMS workloads. A number of HTCondor pools are employed to manage this infrastructure, which aggregates geographically distributed resources from the WLCG and other providers. Historically, the model of authentication among the diverse components of this infrastructure has relied on the Grid Security...

    Go to contribution page
  287. Sun, Hao-Kai (IHEP, CAS)
    5/9/23, 5:30 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    With the construction and operation of fourth-generation light sources like European Synchrotron Radiation Facility Extremely Brilliant Source (ESRF-EBS), Advanced Photon Source Upgrade (APS-U), Advanced Light Source Upgrade (ALS-U), High Energy Photon Source (HEPS), etc., several advanced biological macromolecule crystallography (MX) beamlines are or will be built and thereby the huge amount...

    Go to contribution page
  288. Lieret, Kilian (Princeton University)
    5/9/23, 5:30 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    To meet the computing challenges of upcoming experiments, software training efforts play an essential role in imparting best practices and popularizing new technologies. Because many of the taught skills are experiment-independent, the HSF/IRIS-HEP training group coordinates between different training initiatives while building a training center that provides students with various training...

    Go to contribution page
  289. Timm, S. (FNAL)
    5/9/23, 5:30 PM

    The Superconducting Quantum Materials and Systems Center (SQMS) and the Computational Science and AI Directorate (CSAID) at Fermi National Accelerator Laboratory and Rigetti Computing have teamed up to define and deliver a standard pathway for quantum computing at Rigetti from HEPCloud. HEPCloud now provides common infrastructure and interfacing for managing connectivity and providing access...

    Go to contribution page
  290. Bhatti, Zubair (New York University)
    5/9/23, 5:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Searches for new physics set exclusion limits in parameter spaces of typically up to 2 dimensions. However, the relevant theory parameter space is usually of a higher dimension but only a subspace is covered due to the computing time requirements of signal process simulations. An Active Learning approach is presented to address this limitation. Compared to the usual grid sampling, it reduces...

    Go to contribution page
  291. Lasorak, Pierre (Imperial College London)
    5/9/23, 5:30 PM
    Track 2 - Online Computing
    Oral

    The Deep Underground Neutrino Experiment (DUNE) is a next generation long-baseline neutrino experiment based in the USA which is expected to start taking data in 2029. DUNE aims to precisely measure neutrino oscillation parameters by detecting neutrinos from the LBNF beamline (Fermilab) at the Far Detector, 1300 kilometres away, in South Dakota. The Far Detector will consist of four cryogenic...

    Go to contribution page
  292. Misa Moreira, Carmen (CERN)
    5/9/23, 5:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    A comprehensive analysis of the HEP (High Energy Physics) experiment traffic across LHCONE (Large Hadron Collider Open Network Environment) and other networks, is essential for immediate network optimisation (for example by the NOTED project) and highly desirable for long-term network planning. Such an analysis requires two steps: tagging of network packets to indicate the type and owner of...

    Go to contribution page
  293. Mr Jones, Mark (Norfolk State University)
    5/9/23, 5:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    Abstract
    PyPWA is a toolkit designed to fit (regression) parametric models to data and to generate distributions (simulation) according to a given model (function). PyPWA software has been written under the python ecosystem with the goal of performing Amplitude or Partial Wave Analysis (PWA) in nuclear and particle physics experiments. The aim of spectroscopy experiments is often the...

    Go to contribution page
  294. Gerlach, Lino (Brookhaven National Laboratory)
    5/9/23, 5:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The HSF Conditions Databases activity is a forum for cross-experiment discussions hoping for as broad a participation as possible. It grew out of the HSF Community White Paper work to study conditions data access, where experts from ATLAS, Belle II, and CMS converged on a common language and proposed a schema that represents best practice. The focus of the HSF work is the most difficult use...

    Go to contribution page
  295. Farkas, Máté Zoltán (CMS)
    5/9/23, 5:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The continuous growth in model complexity in high-energy physics (HEP) collider experiments demands increasingly time-consuming model fits. We show first results on the application of conditional invertible networks (cINNs) to this challenge. Specifically, we construct and train a cINN to learn the mapping from signal strength modifiers to observables and its inverse. The resulting network...

    Go to contribution page
  296. Burr, Chris (CERN)
    5/9/23, 5:45 PM
    Track 6 - Physics Analysis Tools
    Oral

    Most analyses in the LHCb experiment start by filtering data and simulation stored on the WLCG. Traditionally this has been achieved by submitting user jobs that each process a small fraction of the total dataset. While this has worked well, it has become increasingly complex as the LHCb datasets have grown and this model requires all analysts to understand the intricacies of the grid. This...

    Go to contribution page
  297. Dykstra, Dave
    5/9/23, 5:45 PM
    Track 7 - Facilities and Virtualization
    Oral

    Apptainer (formerly known as Singularity) since its beginning implemented many of its container features with the assistance of a setuid-root program. It still supports that mode, but as of version 1.1.0 it no longer uses setuid by default. This is feasible because it now can mount squash filesystems, mount ext2/3/4 filesystems, and use overlayfs using unprivileged user namespaces and FUSE. It...

    Go to contribution page
  298. West, Maxwell
    5/9/23, 5:45 PM

    The study of the decays of $B$ mesons is a key component of modern experiments which probe heavy quark mixing and $CP$ violation, and may lead to concrete deviations from the predictions of the Standard Model [1]. Flavour tagging, the process of determining the quark flavour composition of $B$ mesons created in entangled pairs at particle accelerators, is an essential component of this...

    Go to contribution page
  299. Dr Boyer, Alexandre (CERN)
    5/9/23, 5:45 PM
    Track 4 - Distributed Computing
    Oral

    DIRAC is the interware for building and operating large scale distributed computing systems. It is adopted by multiple collaborations from various scientific domains for implementing their computing models.
    DIRAC provides a framework and a rich set of ready-to-use services for Workload, Data and Production Management tasks of small, medium and large scientific communities having different...

    Go to contribution page
  300. Couturier, Benjamin (CERN)
    5/9/23, 5:45 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Monte Carlo simulations are a key tool for the physics program of High Energy Experiments. Their accuracy and reliability is of the utmost importance. A full suite of verifications is in place for the LHCb Simulation software to ensure the quality of the simulated samples produced.
    In this contribution we will give a short overview of the procedure and the tests in place, that exploits the...

    Go to contribution page
  301. Yarba, Julia (Fermi National Accelerator Laboratory)
    5/9/23, 5:45 PM
    Track 3 - Offline Computing
    Oral

    Geant4, the leading detector simulation toolkit used in High Energy Physics, employs a set of physics models to simulate interactions of particles with matter across a wide range of interaction energies. These models, especially the hadronic ones, rely largely on directly measured cross-sections and inclusive characteristics, and use physically motivated parameters. However, they generally aim...

    Go to contribution page
  302. Dr Fu, Shiyuan ( Institute of High Energy Physics, CAS)
    5/9/23, 5:45 PM
    Track 2 - Online Computing
    Oral

    Large-scale research facilities are becoming prevalent in the modern scientific landscape. One of these facilities' primary responsibilities is to make sure that users can process and analysis measurement data for publication. To allow for barrier-less access to those highly complex experiments, almost all beamlines require fast feedback capable of manipulating and visualizing data online to...

    Go to contribution page
  303. Dr Gallas, Elizabeth (University of Oxford)
    5/9/23, 5:45 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The ATLAS EventIndex is a global catalogue of the events collected, processed or generated by the ATLAS experiment. The system was upgraded in advance of LHC Run 3, with a migration of the Run 1 and Run 2 data from HDFS MapFiles to HBase tables with a Phoenix interface. The frameworks for testing functionality and performance of the new system have been developed. There are two types of tests...

    Go to contribution page
  304. DeMuth, David (Valley City State University)
    5/9/23, 5:45 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Deep underground, the removal of rock to fashion three soccer field
    sized caverns is underway, as are detector prototypings. In 2024, the
    first DUNE far detector will be constructed as a large cryostat,
    instrumented as a traditional tracking calorimeter but in a cold bath of
    zenon doped liquidized argon. An Epic Game UnReal Engine rendered 3D
    simulation of the underground laboratory has...

    Go to contribution page
  305. Eulisse, Giulio, Rohr, David (CERN)
    5/10/23, 9:00 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Plenary

    ALICE has upgraded many of its detectors for LHC Run 3 to operate in continuous readout mode recording Pb-Pb collisions at 50 kHz interaction rate without trigger.
    This results in the need to process data in real time at rates 50 times higher than during Run 2. In order to tackle such a challenge we introduced O2, a new computing system and the associated infrastructure. Designed and...

    Go to contribution page
  306. Lawrence, David (Jefferson Lab)
    5/10/23, 9:40 AM
    Plenary

    Streaming Readout Data Acquisition systems coupled with distributed resources spread over vast geographic distances present new challenges to the next generation of experiments. High bandwidth modern network connectivity opens the possibility to utilize large, general-use, HTC systems that are not necessarily located close to the experiment. Near real-time response rates and workflow...

    Go to contribution page
  307. Deconinck, Wouter (University of Manitoba)
    5/10/23, 10:05 AM

    As nuclear physics collaborations and experiments increase in size, the data management and software practices in this community have changed as well. Large nuclear physics experiments at Brookhaven National Lab (STAR, PHENIX, sPHENIX), at Jefferson Lab (GlueX, CLAS12, MOLLER), and at the Electron-Ion Collider (ePIC) are taking different approaches to data management, building on existing...

    Go to contribution page
  308. Elvira, Daniel (FNAL)
    5/10/23, 10:50 AM
    Plenary

    This presentation will cover the content of the report delivered by the Snowmass computational Frontier late in 2022. A description of the frontier organization and various preparatory events, including the Seattle Community Summer Study (CSS), will be followed by a discussion on the evolution of computing hardware and the impact of newly established and emerging technologies, including...

    Go to contribution page
  309. Gao, Haiyan (Duke University)
    5/10/23, 11:15 AM
    Plenary

    The U.S. Nuclear Physics community has been conducting long-range planning (LRP) for nuclear science since late 1970s. The process is known as the Nuclear Science Advisory Committee (NSAC) LRP with NSAC being an advisory body jointly appointed by the U.S. Department of Energy and the U.S. National Science Foundation. The last NSAC LRP was completed in 2015 and the current NSAC LRP is ongoing...

    Go to contribution page
  310. Ramprakash, Jini (ANL)
    5/10/23, 11:40 AM
    Plenary
  311. Weatherly, Kimberly (William & Mary), Turner, Clayton (NASA Langley), Williams, Aurelia (Norfolk State University), McKinney, Ivan (Unreasonable Kids College), Schultz, A.K. (SVT Robotics)
    5/11/23, 9:00 AM
    Plenary

    Today's students are tomorrow's leaders in science, technology, engineering and math. To ensure the best minds reach their potential tomorrow, it's vital to ensure that students not only experience meaningful STEM learning today, but also have the opportunities and support to pursue careers in a STEM environment that is more welcoming, inclusive and just. This panel will feature expertise from...

    Go to contribution page
  312. Gladney, Larry (Yale)
    5/11/23, 9:45 AM

    Big science is represented by projects like those in particle physics. Big engineering is the application of engineering principles to large-scale projects that have a significant impact on society, like popular use of AI/ML (think ChatGPT and Google Bard). Both big science and big engineering are among the noblest and boldest applications of the human intellect to understanding the universe...

    Go to contribution page
  313. Campana, Simone (CERN)
    5/11/23, 10:15 AM
    Track 4 - Distributed Computing
    Plenary

    The WLCG infrastructure provides the compute power and the storage capacity for the computing needs of the experiments at the Large Hadron Collider (LHC) at CERN. The infrastructure is distributed across over 170 data centers in more than 40 countries. The amount of consumed energy in WLCG to support the scientific program of the LHC experiments and its evolution depends on different factors:...

    Go to contribution page
  314. HRIVNACOVA, Ivana (IJCLab, IN2P3/CNRS)
    5/11/23, 11:15 AM
    Track 3 - Offline Computing
    Oral

    The analysis category was introduced in Geant4 almost ten years ago (in 2014) with the aim to provide users with a lightweight analysis tool, available as part of the Geant4 installation without the need to link to an external analysis package. It helps capture statistical data in the form of histograms and n-tuples and store these in files in four various formats. It was already presented at...

    Go to contribution page
  315. Hu, Biying (Sun Yat-sen University)
    5/11/23, 11:15 AM
    Track 7 - Facilities and Virtualization
    Oral

    High energy physics experiments are pushing forward the precision measurements and searching for new physics beyond standard model. It is urgent to simulate and generate mass data to meet requirements from physics. It is one of the most popular areas to make good use of existing power of supercomputers for high energy physics computing. Taking the BESIII experiment as an illustration, we...

    Go to contribution page
  316. Goodrich, Michael
    5/11/23, 11:15 AM

    To increase the science rate for high data rates/volumes, Thomas Jefferson National Accelerator Facility (JLab) has partnered with Energy Sciences Network (ESnet) to define an edge to data center traffic shaping/steering transport capability featuring data event-aware network shaping and forwarding.

    The keystone of this ESnet JLab FPGA Accelerated Transport (EJFAT) is the joint development...

    Go to contribution page
  317. Poat, Michael (BNL)
    5/11/23, 11:15 AM
    Track 4 - Distributed Computing
    Oral

    The Electron Ion Collider (EIC) collaboration and future experiment is a unique scientific ecosystem within Nuclear Physics as the experiment starts right off as a cross-collaboration from Brookhaven National Lab (BNL) & Jefferson Lab (JLab). As a result, this muti-lab computing model tries at best to provide services accessible from anywhere by anyone who is part of the collaboration. While...

    Go to contribution page
  318. Ling, Jerry (Harvard University)
    5/11/23, 11:15 AM
    Track 6 - Physics Analysis Tools
    Oral

    We present tools for high-performance analysis written in pure Julia, a just-in-time (JIT) compiled dynamic programming language with a high-level syntax and performance. The packages we present center around UnROOT.jl, a pure Julia ROOT file I/O package that is optimized for speed, lazy reading, flexibility, and thread safety.

    We discuss what affects performance in Julia, the challenges,...

    Go to contribution page
  319. Jahan, Johannes (University of Houston)
    5/11/23, 11:15 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    EPOS 4 is the last version of the high-energy collision event generator EPOS, released publicly in 2022. It was delivered with improvements on several aspects, whether about the theoretical bases on which it relies, how they are handled technically, or regarding user's interface and data compatibility.

    This last point is especially important, as part of a commitment to provide the widest...

    Go to contribution page
  320. Pareja, Carlos
    5/11/23, 11:15 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Machine learning (ML) has become ubiquitous in high energy physics (HEP) for many tasks, including classification, regression, reconstruction, and simulations. To facilitate development in this area, and to make such research more accessible, and reproducible, we require standard, easy-to-access, datasets and metrics. To this end, we develop the open source Python JetNet library with easily...

    Go to contribution page
  321. Wadenstein, Mattias (NeIC)
    5/11/23, 11:15 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The Data Lake concept has promised increased value to science and more efficient operations for storage compared to the traditional isolated storage deployments. Building on the established distributed dCache serving as the Nordic Tier-1 storage for LHC data, we have also integrated tier-2 pledged storage in Slovenia, Sweden, and Switzerland, resulting in a coherent storage space well above...

    Go to contribution page
  322. Le Boulicaut, Elise
    5/11/23, 11:15 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Communicating the science and achievements of the ATLAS Experiment is a core objective of the ATLAS Collaboration. This talk will explore the range of communication strategies adopted in ATLAS communications, with particular focus on how these have been impacted by the COVID-19 pandemic. In particular, an overview of ATLAS’ digital communication platforms will be given – with focus on social...

    Go to contribution page
  323. Rossi, Cristian (INFN Roma1)
    5/11/23, 11:15 AM
    Track 2 - Online Computing
    Oral

    NA62 is a K meson physics experiment based on a decay-in-flight technique and whose Trigger and Data Acquisition system (TDAQ) is multi-level and network based. A reorganization of both the beam line and the detector is foreseen in the next years to complete and extend the physics reach of NA62. One of the challenging aspects of this upgrade is a significant increase (x4) in the event rate...

    Go to contribution page
  324. Fanelli, Cristiano
    5/11/23, 11:30 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The newly formed EPIC Collaboration has recently laid the foundations of its software infrastructure. Noticeably, several forward-looking aspects of the software are favorable for Artificial Intelligence (AI) and Machine Learning (ML) applications and utilization of heterogeneous resources. EPIC has a unique opportunity to integrate AI/ML from the beginning: the number of AI/ML activities is...

    Go to contribution page
  325. Galewsky, Ben (University of Illinois)
    5/11/23, 11:30 AM
    Track 6 - Physics Analysis Tools
    Oral

    We will describe how ServiceX, an IRIS-HEP project, generates C++ or python code from user queries and orchestrates thousands of experiment-provided docker containers to filter and select event data. The source datafiles are identified using Rucio. We will show how the service encapsulates best practice for using Rucio and helps inexperienced analysers get up to speed quickly. The data is...

    Go to contribution page
  326. Dr Sawkey, Daren (Varian Medical Systems)
    5/11/23, 11:30 AM
    Track 3 - Offline Computing
    Oral

    For the new Geant4 series 11.X electromagnetic (EM) physics sub-libraries were revised and reorganized in view of requirements for simulation of Phase-2 LHC experiments. EM physics simulation software takes a significant part of CPU during massive production of Monte Carlo events for LHC experiments. We present recent evolution of Geant4 EM sub-libraries for simulation of gamma, electron, and...

    Go to contribution page
  327. Ungaro, Maurizio (Jefferson Lab)
    5/11/23, 11:30 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    A mechanism to store in databases all the parameters needed to simulate the detectors response to physics interactions is presented. This includes geometry, materials, magnetic field, electronics.

    GEMC includes a python API to populate the databases, and the software to run the Monte-Carlo simulation. The engine is written in C++ and uses Geant4 for the passage of particles through...

    Go to contribution page
  328. Panta, Anil (University of Mississippi)
    5/11/23, 11:30 AM
    Track 4 - Distributed Computing
    Oral

    Rucio, the data management software initially developed for ATLAS, has been in use at Belle II since January 2021. After the transition to Rucio, new features and functionality were implemented in Belle II grid tools based on Rucio, to improve the experience of grid users. The container structure in the Rucio File Catalog enabled us to define collections of arbitrary datasets, allowing the...

    Go to contribution page
  329. Dr Josep, Flix (CIEMAT - PIC)
    5/11/23, 11:30 AM
    Track 7 - Facilities and Virtualization
    Oral

    The CMS experiment is working to integrate an increasing number of High Performance Computing (HPC) resources into its distributed computing infrastructure. The case of the Barcelona Supercomputing Center (BSC) is particularly challenging as severe network restrictions prevent the use of CMS standard computing solutions. The CIEMAT CMS group has performed significant work in order to overcome...

    Go to contribution page
  330. Ciangottini, Diego (INFN Perugia)
    5/11/23, 11:30 AM

    Field Programmable Gate Arrays (FPGAs) are playing an increasingly important role in the sampling and data processing industry due to their intrinsically highly parallel architecture, low power consumption, and flexibility to execute custom algorithms. In particular, the use of FPGAs to perform machine learning inference is increasingly growing thanks to the development of high-level synthesis...

    Go to contribution page
  331. Sun, Haokai (Institute of High Energy Physics, CAS)
    5/11/23, 11:30 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    China’s High Energy Photon Source (HEPS), the first national high-energy synchrotron radiation light source and soon one of the world’s brightest fourth-generation synchrotron radiation facilities, is being under intense construction in Beijing’s Huairou District, and will be completed in 2025.

    To make sure that the huge amount of data collected at HEPS is accurate, available and...

    Go to contribution page
  332. Yacoob, Sahal (University of Cape Town / CERN)
    5/11/23, 11:30 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The International Particle Physics Outreach Group (IPPOG) is a network of scientists, science educators and communication specialists working across the globe in informal science education and public engagement for particle physics. The primary methodology adopted by IPPOG includes the direct participation of scientists active in current research with education and communication specialists,...

    Go to contribution page
  333. Park, Seokhee (KEK)
    5/11/23, 11:30 AM
    Track 2 - Online Computing
    Oral

    The backend of the Belle II data acquisition system consists of a high-level trigger system, online storage, and an express-reconstruction system for online data processing. The high-level trigger system was updated to use the ZeroMQ networking library from the old ring buffer and TCP/IP socket, and the new system is successfully operated. However, the online storage and express-reconstruction...

    Go to contribution page
  334. Schreiner, Henry (Princeton University)
    5/11/23, 11:45 AM
    Track 6 - Physics Analysis Tools
    Oral

    Awkward Arrays is a library for performing NumPy-like computations on nested, variable-sized data, enabling array-oriented programming on arbitrary data structures in Python. However, imperative (procedural) solutions can sometimes be easier to write or faster to run. Performant imperative programming requires compilation; JIT-compilation makes it convenient to compile in an interactive Python...

    Go to contribution page
  335. Hufnagel, Dirk (Fermilab)
    5/11/23, 11:45 AM
    Track 7 - Facilities and Virtualization
    Oral

    With advances in the CMS CMSSW framework to support GPU, culminating with the deployment of GPU in the Run-3 HLT, CMS is also starting to look at integrating GPU resources into CMS Offline Computing as well. At US HPC Facilities a number of very large HPC with GPU have either just become available or will become available soon, offering opportunities for CMS if we would be able to make use of...

    Go to contribution page
  336. Balcas, Justas (California Institute of Technology)
    5/11/23, 11:45 AM
    Track 4 - Distributed Computing
    Oral

    A critical challenge of performing data transfers or remote reads is to be fast and efficient as possible while, at the same time, keeping the usage of system resources as low as possible. Ideally, the software that manages these data transfers should be able to organize them so that one can have them run up to the hardware limits. Significant portions of LHC analysis use the same datasets,...

    Go to contribution page
  337. Moneta, Lorenzo (CERN)
    5/11/23, 11:45 AM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The recent developments in ROOT/TMVA focus on fast machine learning inference, which enables analysts to deploy their machine learning models rapidly on large scale datasets. A new tool has been recently developed, SOFIE, allowing for generating C++ code for evaluation of deep learning models, which are trained from external tools such as Tensorflow or PyTorch.
    While Python-based deep...

    Go to contribution page
  338. Hernandez, Fabio (IN2P3 / CNRS computing center)
    5/11/23, 11:45 AM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The [Vera C. Rubin observatory][1] is preparing for execution of the most ambitious astronomical survey ever attempted, the Legacy Survey of Space and Time (LSST). Currently in its final phase of construction in the Andes mountains in Chile and due to start operations late 2024 for 10 years, its 8.4-meter telescope will nightly scan the southern sky and collect images of the entire visible sky...

    Go to contribution page
  339. Savard, Claire (University of Colorado at Boulder)
    5/11/23, 11:45 AM
    Track 2 - Online Computing
    Oral

    The High-Luminosity LHC will open an unprecedented window on the weak-scale nature of the universe, providing high-precision measurements of the standard model as well as searches for new physics beyond the standard model. Such precision measurements and searches require information-rich datasets with a statistical power that matches the high-luminosity provided by the Phase-2 upgrade of the...

    Go to contribution page
  340. McCormack, William (MIT)
    5/11/23, 11:45 AM

    Computing demands for large scientific experiments, such as the CMS experiment at CERN, will increase dramatically in the next decades. To complement the future performance increases of software running on CPUs, explorations of coprocessor usage in data processing hold great potential and interest. We explore the novel approach of Services for Optimized Network Inference on Coprocessors...

    Go to contribution page
  341. Dr Fang, Wenxing (IHEP)
    5/11/23, 11:45 AM
    Track 3 - Offline Computing
    Oral

    The Circular Electron Positron Collider (CEPC) [1] is one of the future experiments aiming to study the Higgs boson’s properties precisely. For this purpose, excellent track reconstruction and particle identification (PID) performance are required. Such as the tracking efficiency should be around 100%, the momentum resolution should be less than 0.1%, and the Kaon and pion should have 2 sigma...

    Go to contribution page
  342. Cordero, Danelix (CROEM High School (Departamento de Educación de Puerto Rico))
    5/11/23, 11:45 AM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    Sudhir Malik, Peter Elmer, Adam LaMee, Ken Cecire

    The NSF-funded IRIS-HEP "Training, Education & Outreach" program and QuarkNet are partnering to enable and expand software training for the high school teachers with a goal to tap, grow and diversify the talent pipeline from K-12 students for future cyberinfrastructure. IRIS-HEP (https://iris-hep.org/) is a software institute that aims to...

    Go to contribution page
  343. Mazurek, Michał (CERN)
    5/11/23, 11:45 AM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The LHCb software has undergone a major upgrade in view of data taking with higher luminosity in Run3 of the LHC at CERN.
    The LHCb simulation framework, Gauss, had to be adapted to follow the changes in modern technologies of the underlying experiment core software and to introduce new simulation techniques to cope with the increase of the required amount of simulated data. Additional...

    Go to contribution page
  344. Fu, Shiyuan (IHEP)
    5/11/23, 12:00 PM
    Track 2 - Online Computing
    Oral

    High Energy Photon Source(HEPS)is expected to generate a huge amount of data, which puts extreme pressure on data I/O in computing tasks. Meanwhile, inefficient data I/O significantly affect computing performance.

    Firstly, the data reading mode and limitations of computing resources are taken into account,and we propose a method for automatic tuning of storage parameters, such as data block...

    Go to contribution page
  345. Barbone, Marco (Imperial College London)
    5/11/23, 12:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Neural Networks (NN) are often trained offline on large datasets and deployed on specialized hardware for inference, with a strict separation between training and inference. However, in many realistic applications the training environment differs from the real world or data arrive in a streaming fashion and are continuously changing. In these scenarios, the ability to continuously train and...

    Go to contribution page
  346. Ms Suiu, Alice Florenta
    5/11/23, 12:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    ALICE is one of the four large experiments at the CERN LHC designed to study the structure and origins of matter in collisions of heavy ions (and protons) at ultra-relativistic energies. The experiment measures the particles produced as a result of collisions in its center so that it can reconstruct and study the evolution of the system produced during these collisions. To perform these...

    Go to contribution page
  347. Dr Kortelainen, Matti (Fermi National Accelerator Laboratory)
    5/11/23, 12:00 PM

    In the past years the landscape of tools for expressing parallel algorithms in a portable way across various compute accelerators has continued to evolve significantly. There are many technologies on the market that provide portability between CPU, GPUs from several vendors, and in some cases even FPGAs. These technologies include C++ libraries such as Alpaka and Kokkos, compiler directives...

    Go to contribution page
  348. Mazurek, Michał (CERN)
    5/11/23, 12:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Gaussino is a new simulation experiment-independent framework based on the Gaudi data processing framework. It provides generic core components and interfaces to build a complete simulation application: generation, detector simulation, geometry, monitoring, and saving of the simulated data. Thanks to its highly configurable and extendable components Gaussino can be used both as a toolkit and a...

    Go to contribution page
  349. Dr Hurtado Anampa, Kenyi (University of Notre Dame)
    5/11/23, 12:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The NSF-funded Scalable CyberInfrastructure for Artificial Intelligence and Likelihood Free Inference (SCAILFIN) project has developed and deployed artificial intelligence (AI) and likelihood-free inference (LFI) techniques and software using scalable cyberinfrastructure (CI) built on top of existing CI elements. Specifically, the project has extended the CERN-based REANA framework, a...

    Go to contribution page
  350. Kalliokoski, Matti (Helsinki Institute of Physics)
    5/11/23, 12:00 PM
    Track 3 - Offline Computing
    Oral

    MoEDAL (the Monopole and Exotics Detector at the LHC) searches directly magnetic monopoles at the Interaction Point 8 of the Large Hadron Collider (LHC). As an upgrade of the experiment an addition, MAPP (MoEDAL Apparatus for Penetrating Particles) detector extends the physics reach by providing sensitivity to milli-charged and long-lived exotic particles. The MAPP detectors are scintillator...

    Go to contribution page
  351. Ifrim, Ioana (Princeton University)
    5/11/23, 12:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    In particle physics, data analysis frequently needs variable-length, nested data structures such as arbitrary numbers of particles per event and combinatorial operations to search for particle decay. Arrays of these data types are provided by the Awkward Array library.

    The previous version of this library was implemented in C++, but this impeded its ability to grow. Thus, driven by this...

    Go to contribution page
  352. Collier, Ian (UKRI-STFC)
    5/11/23, 12:00 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    UKRI/STFC’s Scientific Computing Department (SCD) runs a vibrant range of computing related public engagement activities. We benefit form the work done by the National Labs public engagement team to develop a well articulated PE strategy, and an accompanying evaluation framework, including the idea of defining formal generic learning outcomes (GLOs).

    This paper presents how this combination...

    Go to contribution page
  353. Barreiro Megino, Fernando (Unive)
    5/11/23, 12:00 PM
    Track 4 - Distributed Computing
    Oral

    In recent years, advanced and complex analysis workflows have gained increasing importance in the ATLAS experiment at CERN, one of the large scientific experiments at the Large Hadron Collider (LHC). Support for such workflows has allowed users to exploit remote computing resources and service providers distributed worldwide, overcoming limitations on local resources and services. The spectrum...

    Go to contribution page
  354. Le Boulicaut, Elise
    5/11/23, 12:15 PM
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Oral

    The Virtual Visit service run by the ATLAS Collaboration has been active since 2010. The ATLAS Collaboration has used this popular and effective method to bring the excitement of scientific exploration and discovery into classrooms and other public places around the world. The programme, which uses a combination of video conferencing, webcasts, and video recording to communicate with remote...

    Go to contribution page
  355. Choi, KyungEon (University of Texas at Austin)
    5/11/23, 12:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    Recent developments of HEP software allow novel approaches to physics analysis workflows. The novel data delivery system, ServiceX, can be very effective when accessing a fraction of large datasets at remote grid sites. ServiceX can deliver user-selected columns with filtering and run at scale. We will introduce the ServiceX data management package, ServiceX DataBinder, for easy manipulations...

    Go to contribution page
  356. Li, Haoyang (UCSD)
    5/11/23, 12:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The findable, accessible, interoperable, and reusable (FAIR) data principles have provided a framework for examining, evaluating, and improving how we share data with the aim of facilitating scientific discovery. Efforts have been made to generalize these principles to research software and other digital products. Artificial intelligence (AI) models---algorithms that have been trained on data...

    Go to contribution page
  357. Bassi, Giovanni (SNS and INFN Pisa)
    5/11/23, 12:15 PM

    We report the implementation details, commissioning results, and physics performances of a two-dimensional cluster finder for reconstructing hit positions in the new vertex pixel detector (VELO) that is part of the LHCb Upgrade. The associated custom VHDL firmware has been deployed to the existing FPGA cards that perform the readout of the VELO and fully commissioned during the start of LHCb...

    Go to contribution page
  358. Murray, Steven (CERN)
    5/11/23, 12:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The File Transfer System (FTS) is a software system responsible for queuing, scheduling, dispatching and retrying file transfer requests, it is used by three of the LHC experiments, namely ATLAS, CMS and LHCb, as well as non LHC experiments including AMS, Dune and NA62. FTS is critical to the success of many experiments and the service must remain available and performant during the entire...

    Go to contribution page
  359. Graf, Norman (SLAC National Accelerator Laboratory)
    5/11/23, 12:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    LCIO is a persistency framework and event data model originally developed to foster closer collaboration among the international groups conducting simulation studies for future linear colliders. In the twenty years since its introduction at CHEP 2003 it has formed the backbone for ILC and CLIC physics and detector studies. It has also been successfully employed to study and develop other...

    Go to contribution page
  360. Cifra, Pierfrancesco (Nikhef National institute for subatomic physics (PL) and CERN)
    5/11/23, 12:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    LHCb (Large Hadron Collider beauty) is one of the four large particle physics experiments aimed at studying differences between particles and anti-particles and very rare decays in the charm and beauty sector of the standard model at the LHC. The Experiment Control System (ECS) is in charge of the configuration, control, and monitoring of the various subdetectors as well as all areas of the...

    Go to contribution page
  361. Mr Khan, Raees (University of Pittsburgh)
    5/11/23, 12:15 PM
    Track 3 - Offline Computing
    Oral

    FullSimLight is a lightweight, Geant4-based command line
    simulation utility intended for studies of simulation performance. It
    is part of the GeoModel toolkit (geomodel.web.cern.ch) which has been
    stable for more than one year. The FullSimLight component
    has recently undergone renewed development aimed at extending its
    functionality. It has been endowed with a GUI for fast,...

    Go to contribution page
  362. Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)
    5/11/23, 12:15 PM
    Track 4 - Distributed Computing
    Oral

    The computing resources supporting the LHC experiments research programmes are still dominated by x86 processors deployed at WLCG sites. This will however evolve in the coming years, as a growing number of HPC and Cloud facilities will be employed by the collaborations in order to process the vast amounts of data to be collected in the LHC Run 3 and into the HL-LHC phase. Compute power in...

    Go to contribution page
  363. Simelevicius, Dainius (Vilnius University/CERN)
    5/11/23, 12:15 PM
    Track 2 - Online Computing
    Oral

    The CMS data acquisition (DAQ) is implemented as a service-oriented architecture where DAQ applications, as well as general applications such as monitoring and error reporting, are run as self-contained services. The task of deployment and operation of services is achieved by using several heterogeneous facilities, custom configuration data and scripts in several languages. Deployment of all...

    Go to contribution page
  364. Rossi, Cristian (INFN Roma1)
    5/11/23, 12:30 PM

    High Energy Physics (HEP) Trigger and Data Acquisition systems (TDAQs) need ever increasing throughput and real-time data analytics capabilities either to improve particle identification accuracy and further suppress background events in trigger systems or to perform an efficient online data reduction for trigger-less ones.
    As for the requirements imposed by HEP TDAQs applications in the...

    Go to contribution page
  365. Wang, Rui (Argonne national laboratory)
    5/11/23, 12:30 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Modern HEP workflows must manage increasingly large and complex data collections. HPC facilities may be employed to help meet these workflows' growing data processing needs. However, a better understanding of the I/O patterns and underlying bottlenecks of these workflows is necessary to meet the performance expectations of HPC systems.
    Darshan is a lightweight I/O characterization tool that...

    Go to contribution page
  366. Srimanobhas, Phat
    5/11/23, 12:30 PM
    Track 3 - Offline Computing
    Oral

    In this contribution we report status of the CMS Geant4 simulation and the prospects for Run-3 and Phase-2.
    Firstly, we report about our experience during the start of Run-3 with Geant4 10.7.2, the common software package DD4hep for geometry description, and VecGeom run time geometry library. In addition, FTFP_BERT_EMM Physics List and CMS configuration for tracking in magnetic field have...

    Go to contribution page
  367. Ms Long, Shiting (Forschungszentrum Jülich)
    5/11/23, 12:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    HPC systems are increasingly often used for addressing various challenges in high-energy physics. But often the data infrastructures used in the latter area are not well integrated with infrastructures that include HPC resources. Here we will focus on a specific infrastructure, namely Fenix, which is based on a consortium of 6 leading European supercomputing centres. The Fenix sites are...

    Go to contribution page
  368. Canal, Philippe
    5/11/23, 12:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    Cling is a clang/LLVM-based, high-performance C++ interpreter originating from HEP. In ROOT, cling is used as the basis for language interoperability, it provides reflection data to ROOT's I/O system, and enables RDataFrame's dynamic type-safe interfaces.

    Cling regularely moves to more recent LLVM versions to bring new features and support for new language standards. The recent LLVM 13...

    Go to contribution page
  369. Ebert, Marcus (University of Victoria)
    5/11/23, 12:30 PM
    Track 4 - Distributed Computing
    Oral

    Cloudscheduler is a system to manage resources of local and remote compute clouds and makes those resources available to HTCondor pools. It examines the resource needs of idle jobs, then starts virtual machines (VMs) sized to suit those resource needs on allowed clouds with available resources. Using yaml files, cloudscheduler then provisions the VMs during the boot process with all necessary...

    Go to contribution page
  370. Brummer, Philipp (CERN)
    5/11/23, 12:30 PM
    Track 2 - Online Computing
    Oral

    The CMS experiment data acquisition (DAQ) collects data for events accepted by the Level-1 trigger from the different detector systems and assembles them in an event builder prior to making them available for further selection in the High Level Trigger, and finally storing the selected ones for offline analysis. In addition to the central DAQ providing global acquisition functionality, several...

    Go to contribution page
  371. Glushkov, Ivan (The University of Texas at Arlington)
    5/11/23, 12:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    The ATLAS Trigger and Data Acquisition (TDAQ) High Level Trigger (HLT) computing farm contains 120,000 cores. These resources are critical for online selection and collection of collision data in the ATLAS experiment during LHC operation. Since 2013, during longer period of LHC inactivity these resources are being used for offline event simulation via the "Simulation at Point One" project...

    Go to contribution page
  372. Mr Tsoi, Ho Fung (University of Wisconsin Madison)
    5/11/23, 12:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The high-energy physics community is investigating the feasibility of deploying more machine-learning-based solutions on FPGAs to meet modern physics experiments' sensitivity and latency demands. In this contribution, we introduce a novel end-to-end procedure that utilises a forgotten method in machine learning, i.e. symbolic regression (SR). It searches equation space to discover algebraic...

    Go to contribution page
  373. Furukawa, Marin (University of Tokyo)
    5/11/23, 2:00 PM
    Track 2 - Online Computing
    Oral

    The Liquid Argon Calorimeters are employed by ATLAS for all electromagnetic calorimetry in the pseudo-rapidity region |η| < 3.2, and for hadronic and forward calorimetry in the region from |η| = 1.5 to |η| = 4.9. They also provide inputs to the first level of the ATLAS trigger. After successful period of data taking during the LHC Run-2 between 2015 and 2018 the ATLAS detector entered into the...

    Go to contribution page
  374. Krumnack, Nils (Iowa State University (US))
    5/11/23, 2:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    During Run 2 the ATLAS experiment employed a large number of different user frameworks to perform the final corrections of its event data. For Run 3 a common framework was developed that incorporates the lessons learned from existing frameworks. Besides providing analysis standardization it also incorporates optimizations that lead to a substantial reduction in computing needs during analysis.

    Go to contribution page
  375. Huang, Qiulan (Brookhaven National Laboratory)
    5/11/23, 2:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The rapid growth of scientific data and the computational needs of BNL-supported science programs will bring the Scientific Data and Computing Center (SDCC) to the Exabyte scale in the next few years. The SDCC Storage team is responsible for the symbiotic development and operations of storage services for all BNL experiment data, in particular for the data generated by the ATLAS experiment...

    Go to contribution page
  376. van Gemmeren, Peter (Argonne National Laboratory)
    5/11/23, 2:00 PM
    Track 3 - Offline Computing
    Oral

    For HEP event processing, data is typically stored in column-wise synchronized containers, such as most prominently ROOT’s TTree, which have been used for several decades to store by now over 1 exabyte. These containers can combine row-wise association capabilities needed by most HEP event processing frameworks (e.g. Athena for ATLAS) with column-wise storage, which typically results in better...

    Go to contribution page
  377. Aaij, Roel (Nikhef National institute for subatomic physics (NL))
    5/11/23, 2:00 PM

    The first stage of the LHCb High Level Trigger is implemented as a GPU application. In 2023 it will run on 400 NVIDIA GPUs and its goal is to reduce the rate of incoming data from 5 TB/s to approximately 100 GB/s. A broad scala of reconstruction algorithms is implemented as approximately 70 kernels. Machine Learning algorithms are attractive to further extend the physics reach of the...

    Go to contribution page
  378. Mascheroni, Marco (University of California San Diego)
    5/11/23, 2:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    The CMSWEB cluster is pivotal to the activities of the Compact Muon Solenoid (CMS) experiment, as it hosts critical services required for the operational needs of the CMS experiment. The security of these services and the corresponding data is crucial to CMS. Any malicious attack can compromise the availability of our services. Therefore, it is important to construct a robust security...

    Go to contribution page
  379. Dr Karavakis, Edward (Brookhaven National Laboratory)
    5/11/23, 2:00 PM
    Track 4 - Distributed Computing
    Oral

    The Vera C. Rubin Observatory will produce an unprecedented astronomical data set for studies of the deep and dynamic universe. Its Legacy Survey of Space and Time (LSST) will image the entire southern sky every three days and produce tens of petabytes of raw image data and associated calibration data. More than 20 terabytes of data must be processed and stored every night for ten...

    Go to contribution page
  380. Voigt, Johann (TU Dresden)
    5/11/23, 2:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The Large Hadron Collider (LHC) at CERN is the largest and most powerful particle collider today. The Phase-II Upgrade of the LHC will increase the instantaneous luminosity by a factor of 7 leading to the High Luminosity LHC (HL-LHC). At the HL-LHC, the number of proton-proton collisions in one bunch crossing (called pileup) increases significantly, putting more stringent requirements on the...

    Go to contribution page
  381. Knoepfel, Kyle (Fermi National Accelerator Laboratory)
    5/11/23, 2:00 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    HEP data-processing frameworks are essential ingredients in getting from raw data to physics results. But they are often tricky to use well, and they present a significant learning barrier for the beginning HEP physicist. In addition, existing frameworks typically support rigid, collider-based data models, which do not map well to neutrino-physics experiments like DUNE. Neutrino physicists...

    Go to contribution page
  382. Mr Pham, Tuan Minh (University of Wisconsin-Madison)
    5/11/23, 2:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Reliably simulating detector response to hadrons is crucial for almost all physics programs at the Large Hadron Collider. The core component of such simulation is the modeling of hadronic interactions. Unfortunately, there is no first-principle theory guidance. The current state-of-the-art simulation tool, Geant4, exploits phenomenology-inspired parametric models, each simulating a specific...

    Go to contribution page
  383. Beermann, Thomas (DESY)
    5/11/23, 2:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    Helmholtz Federated IT Services (HIFIS) provides shared IT services in across all fields and centres in the Helmholtz Association. HIFIS is a joint platform in which most of the research centres in the Helmholtz Association collaborate and offer cloud and fundamental backbone services free of charge to scientists in Helmholtz and their partners. Furthermore, HIFIS provides a federated...

    Go to contribution page
  384. Delaney, Blaise (Massachusetts Institute of Technology)
    5/11/23, 2:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The data-taking conditions expected of Run 3 pose unprecedented challenges for the DAQ systems of the LHCb experiment at the LHC. The LHCb collaboration is pioneering the adoption of a fully-software trigger to cope with the expected increase in luminosity and, thus, event rate. The upgraded trigger system has required advances in the use of hardware architectures, software and algorithms....

    Go to contribution page
  385. Chapeland, Sylvain (CERN)
    5/11/23, 2:15 PM
    Track 2 - Online Computing
    Oral

    ALICE (A Large Ion Collider Experiment) is a heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). During the second long shut-down of the LHC, the ALICE detector was upgraded to cope with an interaction rate of 50 kHz in Pb-Pb collisions, producing in the online computing system (O2) a sustained input...

    Go to contribution page
  386. Chuchuk, Olga (CERN, University of Cote d'Azur)
    5/11/23, 2:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    In the HEP community, the prediction of Data Popularity is a topic that has been approached for many years. Nonetheless, while facing increasing data storage challenges, especially in the HL-LHC era, we are still in need for better predictive models to answer the questions of whether particular data should be kept, replicated, or deleted.

    The usage of caches proved to be a convenient...

    Go to contribution page
  387. Pasquali, Dominic (UC Santa Cruz / CERN)
    5/11/23, 2:15 PM

    In mainstream machine learning, transformers are gaining widespread usage. As Vision Transformers rise in popularity in computer vision, they now aim to tackle a wide variety of machine learning applications. In particular, transformers for High Energy Physics (HEP) experiments continue to be investigated for tasks including jet tagging, particle reconstruction, and pile-up mitigation.

    In a...

    Go to contribution page
  388. Mr Buss, Thorsten (Universität Hamburg)
    5/11/23, 2:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The full simulation of particle colliders incurs a significant computational cost. Among the most resource-intensive steps are detector simulations. It is expected that future developments, such as higher collider luminosities and highly granular calorimeters, will increase the computational resource requirement for simulation beyond availability. One possible solution is generative neural...

    Go to contribution page
  389. Dr Marcon, Caterina (INFN Milano)
    5/11/23, 2:15 PM
    Track 3 - Offline Computing
    Oral

    The increased footprint foreseen for Run-3 and HL-LHC data will soon expose
    the limits of currently available storage and CPU resources. Data formats
    are already optimized according to the processing chain for which they are
    designed. ATLAS events are stored in ROOT-based reconstruction output files
    called Analysis Object Data (AOD), which are then processed within the
    derivation...

    Go to contribution page
  390. Schaarschmidt, Jana (University of Washington (US))
    5/11/23, 2:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    ATLAS is one of the main experiments at the Large Hadron Collider, with a diverse physics program covering precision measurements as well as new physics searches in countless final states, carried out by more than 2600 active authors. The High Luminosity LHC (HL-LHC) era brings unprecedented computing challenges that call for novel approaches to reduce the amount of data and MC that is stored,...

    Go to contribution page
  391. Kuhr, Thomas (LMU Munich)
    5/11/23, 2:15 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    The Belle II software was developed as closed source. As several HEP experiments released their code to the public, the topic of open source software was also discussed within the Belle II collaboration. A task force analyzed advantages and disadvantages and proposed a policy which was adopted by the collaboration in 2020. The Belle II offline software was then released under an open source...

    Go to contribution page
  392. Hernandez, Fabio (IN2P3 / CNRS computing center)
    5/11/23, 2:15 PM
    Track 4 - Distributed Computing
    Oral

    The Vera C. Rubin Observatory, currently in construction in Chile, will start performing the Large Survey of Space and Time (LSST) late 2024 for 10 years. Its 8.4-meter telescope will survey the southern sky in less than 4 nights in six optical bands, and repeatedly generate about 2000 exposures per night, corresponding to a data volume of about 20 TB every night. Three data facilities are...

    Go to contribution page
  393. Papadopoulos, Sokratis (CERN)
    5/11/23, 2:30 PM
    Track 7 - Facilities and Virtualization
    Oral

    The centralised Elasticsearch service has already been running at CERN for over 6 years, providing the search and analytics engine for numerous CERN users. The service has been based on the open-source version of Elasticsearch, surrounded by a set of external open-source plugins offering security, multi-tenancy, extra visualization types and more. The evaluation of OpenDistro for...

    Go to contribution page
  394. Krumnack, Nils (Iowa State University (US))
    5/11/23, 2:30 PM
    Track 6 - Physics Analysis Tools
    Oral

    Current analysis at ATLAS always involves a step of producing an analysis-specific n-tuple that incorporates the final step of calibrations as well as systematic variations. The goal for Run 4 is to make these analysis-specific corrections fast enough that they can be applied "on-the-fly" without the need for an intermediate n-tuple. The main complications for this are that some of these...

    Go to contribution page
  395. Ventura Gonzalez, Salvador (CEA-SACLAY)
    5/11/23, 2:30 PM
    Track 2 - Online Computing
    Oral

    A new era of hadron collisions will start around 2029 with the High-Luminosity LHC which will allow to collect ten times more data than what has been collected during 10 years of operation at LHC. This will be achieved by higher instantaneous luminosity at the price of higher number of collisions per bunch crossing.

    In order to withstand the high expected radiation doses and the harsher...

    Go to contribution page
  396. Ganis, Gerardo (CERN), Sailer, Andre (CERN)
    5/11/23, 2:30 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Detector studies for future experiments rely on advanced software
    tools to estimate performance and optimize their design and technology
    choices. The Key4hep project provides a flexible turnkey solution for
    the full experiment life-cycle based on established community tools
    such as ROOT, Geant4, DD4hep, Gaudi, podio and spack. Members of the
    CEPC, CLIC, EIC, FCC, and ILC communities have...

    Go to contribution page
  397. CHENG, Yaodong (IHEP, CAS)
    5/11/23, 2:30 PM
    Track 4 - Distributed Computing
    Oral

    The Large High Altitude Air Shower Observatory (LHAASO) is a large-scale astrophysics experiment led by China. The offline data processing was highly dependent on the Institute of High Energy Physics(IHEP) local cluster and the local file system.
    As the LHAASO experimental cooperation groups’ resources are located geographically and most of them have the characteristics of limited scale, low...

    Go to contribution page
  398. Mr Garrido Bear, Borja (CERN)
    5/11/23, 2:30 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Complete and reliable monitoring of the WLCG data transfers is an important condition for effective computing operations of the LHC experiments. WLCG data challenges organised in 2021 and 2022 highlighted the need for improvements in the monitoring of data traffic on the WLCG infrastructure. In particular, it concerns the implementation of the monitoring of the remote data access via the...

    Go to contribution page
  399. Dr Faucci Giannelli, Michele (INFN Roma Tor Vergata)
    5/11/23, 2:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    For High Energy Physics (HEP) experiments, the calorimeter is a key detector to measure the energy of particles. Particles interact with the material of the calorimeter, creating cascades of secondary particles, the so-called showers. Description of the showering process relies on simulation methods that precisely describe all particle interactions with matter. Constrained by the complexity of...

    Go to contribution page
  400. Brown, Christopher (Imperial College (GB))
    5/11/23, 2:30 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The High-Luminosity LHC upgrade of the CMS experiment will utilise a large number of Machine Learning (ML) based algorithms in its hardware-based trigger. These ML algorithms will facilitate the selection of potentially interesting events for storage and offline analysis. Strict latency and resource requirements limit the size and complexity of these models due to their use in a high-speed...

    Go to contribution page
  401. VALLECORSA, SOFIA (CERN)
    5/11/23, 2:30 PM

    A new theoretical framework in Quantum Machine Learning (QML) allows to compare the performances of Quantum and Classical ML models on supervised learning tasks. We assess the performance of a quantum and classic support vector machine for a High Energy Physics dataset: the Higgs tt ̄H(b ̄b) decay dataset, grounding our study in a new theoretical framework based on three metrics: the geometry...

    Go to contribution page
  402. Esseiva, Julien (Lawrence Berkeley National Laboratory)
    5/11/23, 2:30 PM
    Track 3 - Offline Computing
    Oral

    With the increased data volumes expected to be delivered by the HL-LHC, it becomes critical for the ATLAS experiment to maximize the utilization of available computing resources ranging from conventional GRID clusters to supercomputers and cloud computing platforms. To be able to run its data processing applications on these resources, the ATLAS software framework must be capable of...

    Go to contribution page
  403. Arora, Aashay (UCSD)
    5/11/23, 2:45 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    Due to the increased demand of network traffic expected during the HL-LHC era, the T2 sites in the USA will be required to have 400Gbps of available bandwidth to their storage solution. 
    With the above in mind we are pursuing a scale test of XRootD software when used to perform Third Party Copy transfers using the HTTP protocol. Our main objective is to understand the possible limitations in...

    Go to contribution page
  404. Jain, Milan (PNNL)
    5/11/23, 2:45 PM
    Track 7 - Facilities and Virtualization
    Oral

    The LCAPE project develops artificial intelligence to improve operations in the FNAL control room by reducing the time to identify the cause of an outage, improving the reproducibility of labeling it, predicting their duration and forecasting their occurrence.
    We present our solution for incorporating information from ~2.5k monitored devices to distinguish between dozens of different causes...

    Go to contribution page
  405. Dossett, David (University of Melbourne)
    5/11/23, 2:45 PM
    Track 3 - Offline Computing
    Oral

    Since March 2019 the Belle II detector has collected data from e+ e- collisions at the SuperKEKB collider. For Belle II analyses to be competitive it is crucial that calibration constants are calculated promptly so that the reconstructed datasets can be provided to analysts. A subset of calibration constants also benefit by being re-derived during yearly recalibration campaigns to give...

    Go to contribution page
  406. Amram, Oz (Fermilab)
    5/11/23, 2:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Simulation is a crucial part of all aspects of collider data analysis. However, the computing challenges of the High Luminosity era will require simulation to use a smaller fraction of computing resources, at the same time as more complex detectors are introduced, requiring more detailed simulation. This motivates the use of machine learning (ML) models as surrogates to replace full...

    Go to contribution page
  407. Hoya, Joaquin (CERN)
    5/11/23, 2:45 PM
    Track 2 - Online Computing
    Oral

    Over the next decade, the ATLAS detector will be required to operate in an increasingly harsh collision environment. To maintain physics performance, the detector will undergo a series of upgrades during major shutdowns. A key goal of these upgrades is to improve the capacity and flexibility of the detector readout system. To this end, the Front-End Link eXchange (FELIX) system was developed...

    Go to contribution page
  408. VALLECORSA, SOFIA (CERN)
    5/11/23, 2:45 PM

    Free energy-based reinforcement learning (FERL) with clamped quantum Boltzmann machines (QBM) was shown to significantly improve the learning efficiency compared to classical Q-learning with the restriction, however, to discrete state-action space environments. We extended FERL to continuous state-action space environments by developing a hybrid actor-critic scheme combining a classical...

    Go to contribution page
  409. Haide, Isabel (Karlsruhe Institute of Technology)
    5/11/23, 2:45 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    For the Belle II experiment, the electromagnetic calorimeter (ECL) plays a crucial role in both the trigger decisions and the offline analysis.
    The performance of existing clustering algorithms degrades with rising backgrounds that are expected for the increasing luminosity in Belle II. In offline analyses, this mostly impacts the energy resolution for low-energy photons; for the trigger,...

    Go to contribution page
  410. Madlener, Thomas (DESY)
    5/11/23, 2:45 PM
    Track 6 - Physics Analysis Tools
    Oral

    A performant and easy-to-use event data model (EDM) is a key component of any HEP software stack.The podio EDM toolkit provides a user friendly way of generating such a performant implementation in C++ from a high level description in yaml format. Finalizing a few important developments, we release v1.0 of podio, a stable release with backward compatibility for datafiles written with podio...

    Go to contribution page
  411. Faure, Alice (LUPM)
    5/11/23, 2:45 PM
    Track 4 - Distributed Computing
    Oral

    The Cherenkov Telescope Array Observatory (CTAO) is the next generation ground-based observatory for gamma-ray astronomy at very high energies. It will consist of tens of Cherenkov telescopes, spread between two array sites: one in the Northern hemisphere in La Palma (Spain), and one in the Southern hemisphere in Paranal (Chile). Currently under construction, CTAO will start scientific...

    Go to contribution page
  412. Dr Salzburger, Andreas
    5/11/23, 2:45 PM
    Track 5 - Sustainable and Collaborative Software Engineering
    Oral

    Open Data Detector (ODD) is a detector for algorithm research and development. The tracking system is an evolution of the detector used in the successful Tracking Machine Learning Challenge. It offers a more realistic design, with a support structure, cables, and cooling pipes. The ODD got extended with granular calorimetry and can be completed in future with a muon system. The magnetic field...

    Go to contribution page
  413. Mr Eschle, Jonas (University of Zürich)
    5/11/23, 3:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    In a decade from now, the Upgrade II of LHCb experiment will face an instantaneous luminosity ten times higher than in the current Run 3 conditions. This will bring LHCb to a new era, with huge event sizes and typically several signal heavy-hadron decays per event. The trigger scope will shift from selecting interesting events to select interesting parts of multi-signal events. To allow for an...

    Go to contribution page
  414. Strobl, Melvin (Karlsruhe Institute of Technology)
    5/11/23, 3:00 PM

    With the emergence of the research field Quantum Machine Learning, interest in finding advantageous real-world applications is growing as well.
    However challenges concerning the number of available qubits on Noisy Intermediate Scale Quantum (NISQ) devices and accuracy losses due to hardware imperfections still remain and limit the applicability of such approaches in real-world scenarios.
    For...

    Go to contribution page
  415. Mr Jones, Ben (CERN)
    5/11/23, 3:00 PM
    Track 7 - Facilities and Virtualization
    Oral

    Batch@CERN team manages over 250k CPU cores for Batch processing of LHC data with our HTCondor cluster comprising ~5k nodes. We will present a lifecycle management solution of our systems with our in-house developed state-manager daemon BrainSlug and how it handles draining, rebooting, interventions and other actions on the worker nodes, with Rundeck as our human-interaction endpoint and using...

    Go to contribution page
  416. Tadel, Matevz (UCSD)
    5/11/23, 3:00 PM
    Track 3 - Offline Computing
    Oral

    REve, the new generation of the ROOT event-display module, uses a web server-client model to guarantee exact data translation from the experiments' data analysis frameworks to users' browsers. Data is then displayed in various views, including high-precision 2D and 3D graphics views, currently driven by THREE.js rendering engine based on WebGL technology.

    RenderCore, a computer graphics...

    Go to contribution page
  417. Timm, Steven (Fermi National Accelerator Laboratory)
    5/11/23, 3:00 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    In preparation for the second runs of the ProtoDUNE detectors at CERN (NP02 and NP04), DUNE has established a new data pipeline for bringing the data from the EHN-1 experimental hall at CERN to primary tape storage at Fermilab and CERN, and then spreading it out to a distributed disk data store at many locations around the world. This system includes a new Ingest Daemon and a new Declaration...

    Go to contribution page
  418. Gyurjyan, Vardan (Jefferson Lab)
    5/11/23, 3:00 PM
    Track 2 - Online Computing
    Oral

    The volume and complexity of data produced at HEP and NP research facilities have grown exponentially. There is an increased demand for new approaches to process data in near-real-time. In addition, existing data processing architectures need to be reevaluated to see if they are adaptable to new technologies. A unified programming model for event processing and distribution that can exploit...

    Go to contribution page
  419. Storetvedt, Maksim (CERN)
    5/11/23, 3:00 PM
    Track 4 - Distributed Computing
    Oral

    In preparation for LHC Run 3 and 4, the ALICE Collaboration has moved to a new Grid middleware, JAliEn, and workflow management system. The migration was dictated by the substantially higher requirements on the Grid infrastructure in terms of payload complexity, increased number of jobs and managed data volume, all of which required a complete rewrite of the middleware using modern software...

    Go to contribution page
  420. Lersch, Daniel (Florida State University)
    5/11/23, 3:00 PM
    Track 6 - Physics Analysis Tools
    Oral

    In the last two decades, there have been major breakthroughs in Quantum Chromodynamics (QCD), the theory of the strong interaction of quark and gluons, as well as major advances in the accelerator and detector technologies that allow us to map the spatial distribution and motion of the quarks and gluons in terms of quantum correlation functions (QCF). This field of research broadly known as...

    Go to contribution page
  421. Raikwar, Piyush (CERN)
    5/11/23, 3:00 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    Recently, transformers have proven to be a generalized architecture for various data modalities, i.e., ranging from text (BERT, GPT3), time series (PatchTST) to images (ViT) and even a combination of them (Dall-E 2, OpenAI Whisper). Additionally, when given enough data, transformers can learn better representations than other deep learning models thanks to the absence of inductive bias, better...

    Go to contribution page
  422. Barbone, Marco (Imperial College London)
    5/11/23, 3:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    The search for exotic long-lived particles (LLP) is a key area of the current LHC physics programme and is expected to remain so into the High-Luminosity (HL)-LHC era. As in many areas of the LHC physics programme Machine Learning algorithms play a crucial role in this area, in particular Deep Neural Networks (DNN), which are able to use large numbers of low-level features to achieve enhanced...

    Go to contribution page
  423. Bertran Ferrer, Marta (CERN)
    5/11/23, 3:15 PM
    Track 4 - Distributed Computing
    Oral

    The ALICE Grid is designed to perform a realtime comprehensive monitoring of both jobs and execution nodes in order to maintain a continuous and consistent status of the Grid infrastructure. An extensive database of historical data is available and is periodically analyzed to tune the workflow and data management to optimal performance levels. This data, when evaluated in real time, has the...

    Go to contribution page
  424. Syed, S. (FNAL)
    5/11/23, 3:15 PM
    Track 1 - Data and Metadata Organization, Management and Access
    Oral

    The LArSoft/art framework is used at Fermilab’s liquid argon time projection chamber experiments such as ICARUS to run traditional production workflows in a grid environment. It has become increasingly important to utilize HPC facilities for experimental data processing tasks. As part of the SciDAC-4 HEP Data Analytics on HPC and HEP Event Reconstruction with Cutting Edge Computing...

    Go to contribution page
  425. Kelsey, David (UKRI-STFC)
    5/11/23, 3:15 PM
    Track 7 - Facilities and Virtualization
    Oral

    The transition of WLCG storage services to dual-stack IPv4/IPv6 is nearing completion after more than 5 years, thus enabling the use of IPv6-only CPU resources as agreed by the WLCG Management Board and presented by us at earlier CHEP conferences. Much of the data is transferred by the LHC experiments over IPv6. All Tier-1 storage and over 90% of Tier-2 storage is now IPv6-enabled, yet we...

    Go to contribution page
  426. Garg, rocky (Stanford University)
    5/11/23, 3:15 PM
    Track 3 - Offline Computing
    Oral

    Particle tracking is among the most sophisticated and complex part of the full event reconstruction chain. A number of reconstruction algorithms work in a sequence to build these trajectories from detector hits. Each of these algorithms use many configuration parameters that need to be fine-tuned to properly account for the detector/experimental setup, the available CPU budget and the desired...

    Go to contribution page
  427. Wolf, Moritz (Hamburg University)
    5/11/23, 3:15 PM
    Track 9 - Artificial Intelligence and Machine Learning
    Oral

    At the CMS experiment, a growing reliance on the fast Monte Carlo application (FastSim) will accompany the high luminosity and detector granularity expected in Phase 2. The FastSim chain is roughly 10 times faster than the application based on the GEANT4 detector simulation and full reconstruction referred to as FullSim. However, this advantage comes at the price of decreased accuracy in some...

    Go to contribution page
  428. de Boer, Remco (Ruhr University Bochum)
    5/11/23, 3:15 PM
    Track 6 - Physics Analysis Tools
    Oral

    In the ideal world, we describe our models with recognizable mathematical expressions and directly fit those models to large data sample with high performance. It turns out that this can be done with a CAS, using its symbolic expression trees as template to computational back-ends like JAX. The CAS can in fact further simplify the expression tree, which can result in speed-ups in the numerical...

    Go to contribution page
  429. Mr Heredge, Jamie (University of Melbourne)
    5/11/23, 3:15 PM

    In the search for advantage in Quantum Machine Learning, appropriate encoding of the underlying classical data into a quantum state is a critical step. Our previous work [1] implemented an encoding method inspired by underlying physics and achieved AUC scores higher than that of classical techniques for a reduced dataset of B Meson Continuum Suppression data. A particular problem faced by...

    Go to contribution page
  430. Dinardo, Mauro (Università degli Studi di Milano - Bicocca)
    5/11/23, 3:15 PM
    Track 2 - Online Computing
    Oral

    To improve the potential for discoveries at the LHC, a significant luminosity increase of the accelerator (HL-LHC) is foreseen in the late 2020s, to achieve a peak luminosity of 7.5x10^34 cm^-2 s^-1, in the ultimate performance scenario. HL-LHC is expected to run with a bunch-crossing separation of 25 ns and a maximum average of 200 events (pile-up) per crossing. To maintain or even improve...

    Go to contribution page
  431. Monzani, Maria Elena (SLAC)
    5/11/23, 4:30 PM
    Plenary

    The nature and origin of dark matter are among the most compelling mysteries of contemporary science. There is strong evidence for dark matter from its role in shaping the galaxies and galaxy clusters that we observe in the universe. Still, physicists have tried to detect dark matter particles for over three decades with little success.

    This talk will describe the leading effort in that...

    Go to contribution page
  432. Thayer, Jana (SLAC)
    5/11/23, 5:00 PM
    Plenary

    The increasing volumes of data produced at light sources such as the Linac Coherent Light Source (LCLS) enable the direct observation of materials and molecular assemblies at the length and timescales of molecular and atomic motion. This exponential increase in the scale and speed of data production is prohibitive to traditional analysis workflows that rely on scientists tuning parameters...

    Go to contribution page
  433. Bolton, Rosie (SKA Observatory)
    5/11/23, 5:30 PM
    Plenary

    The astronomy world is moving towards exascale observatories and experiments, with data distribution and access challenges that are comparable with - and on similar timescales to - those of HL-LHC. I will present some of these use cases and show progress towards prototyping work in the Square Kilometre Array (SKA) Regional Centre Network, and present the conceptual architectural view of the...

    Go to contribution page
  434. Britton, Thomas (JLab), Childers, Taylor (Argonne National Laboratory)
    5/11/23, 6:00 PM
    Poster
  435. Barisits, Martin (CERN)
    5/12/23, 9:00 AM
  436. Yamada, Satoru (KEK)
    5/12/23, 9:15 AM
  437. Bandieramonte, Marilena (University of Pittsburgh)
    5/12/23, 9:30 AM
  438. Ellis, Katy (STFC-RAL)
    5/12/23, 9:45 AM
  439. Sexton, Elizabeth (FNAL)
    5/12/23, 10:00 AM
  440. Hageboeck, Stephan (CERN)
    5/12/23, 10:15 AM
  441. Martinez Outschoorn, Verena Ingrid (University of Massachusetts Amherst)
    5/12/23, 11:00 AM
  442. Hernandez Villanueva, Michel (DESY)
    5/12/23, 11:15 AM
  443. VALLECORSA, SOFIA (CERN)
    5/12/23, 11:30 AM
  444. Timm, Steven (Fermi National Accelerator Laboratory)
    5/12/23, 11:45 AM
  445. 5/12/23, 12:00 PM
  446. Jones, Roger (Lancaster University)
    Poster
    Poster

    When a new long-term storage facility was needed at the Lancaster WLCG Tier-2 Site, an architecture was chosen involving CephFS as a failure-tolerant back-end volume, and load-balanced XRootD as an endpoint exposing the volume via the HTTPS/DAVS protocols increasingly favoured by the WLCG and other users. This allows operations to continue in the face of disc/node failures with minimal...

    Go to contribution page
  447. Giannini, Leonardo (UCSD)
    Poster
    Poster

    The CMS tracking follows an iterative approach. Tracks are reconstructed in multiple passes starting from the ones that are easiest to find and moving to the ones with more complex topologies (lower transverse momentum, high displacement). A track classification is applied after each iteration using multivariate analysis and several selection requirements are defined: loose, tight, and high...

    Go to contribution page
  448. Cavallini, Viola (INFN and University of Ferrara)
    Poster
    Poster

    Timepix4 is a hybrid pixel detector readout ASIC developed by the Medipix4 Collaboration. It consists of a matrix of about 230 k pixels with 55 micron pitch, equipped with amplifier, discriminator and time-to-digital converter with 195 ps bin size that allows to measure time-of-arrival and time-over-threshold. It is equipped with two different types of links: slow control links, for the...

    Go to contribution page
  449. Duyang, Hongyue (Shandong University)
    Poster
    Poster

    The Jiangmen Underground Neutrino Observatory (JUNO) experiment is designed to measure the neutrino mass ordering (NMO) using a 20-kton liquid scintillator (LS) detector. Besides the precise measurement of the reactor neutrinos oscillation spectrum, an atmospheric neutrino oscillation measurement in JUNO offers independent sensitivity for NMO, which can potentially increase JUNO’s total...

    Go to contribution page
  450. Wang, Tianle (Brookhaven National Lab)
    Poster
    Poster

    Random number generator is an important component of many scientific projects. Many of those projects are written using programming models (like OpenMP and SYCL) to target different architectures. However, some of the programming models do not provide a random number generator. In this talk we are going to introduce our random number generator wrapper. It is a header-only library that supports...

    Go to contribution page
  451. Vianello, Enrico (INFN)
    Poster
    Poster

    Currently, the StoRM storage manager relies on the SRM specification to recall files from tape. Although SRM has served us well for many years, its complexity has pushed the WLCG community to adopt a simpler approach, more in line with modern web technologies.
    The WLCG tape REST API offers a common HTTP interface allowing clients to manage disk residency of tape-stored files and observe the...

    Go to contribution page
  452. Dack, Tom (STFC UKRI)
    Poster
    Poster

    Since 2020, UKRI/STFC’s Scientific Computing Department (SCD) have developed several remote-first public engagement activities, drawing on its long and rich history of delivering face to face public engagement and outreach, as part of the wider STFC programme.

    With COVID-19 restrictions lifted in the UK, STFC has been able to resume in-person public engagement, both on site and in public...

    Go to contribution page
  453. Fu, Shiyuan ( Institute of High Energy Physics, CAS)
    Poster
    Poster

    Delphes is a C++ framework to perform a fast multipurpose detector response simulation. The Circular Electron Positron Collider (CEPC) experiment runs fast simulation with a modified Delphes based on its own scientific objectives. The CEPC fast simulation with Delphes is a High Throughput Computing (HTC) application with small input and output files. Besides, to compile and run Delphes, only...

    Go to contribution page
  454. Fu, Shiyuan ( Institute of High Energy Physics, CAS)
    Poster
    Poster

    Slurm REST APIs are released since version 20.02. With those REST APIs one can interact with slurmctld and slurmdbd daemons in a RESTful way. As a result, job submission and cluster status query can be achieved with a web system. To take advantage of Slurm REST APIs, a web workbench system is developed for the Slurm cluster at IHEP.
    The workbench system consists with four subsystems:...

    Go to contribution page
  455. Chudoba, Jiri
    Poster
    Poster

    The ATLAS experiment at the LHC utilizes complex multicomponent distributed systems for processing (PanDA WMS) and managing (Rucio) of data. The complexity of the relationships between components, the amount of data being processed and the continuous development of new functionality of the critical systems are the main challenges to consider when creating monitoring and accounting tools able...

    Go to contribution page
  456. Posada Trobo, Ismael (CERN)
    Poster
    Poster

    The web hosting infrastructure at CERN provides a range of solutions fulfilling the diverse needs of the community, from simple/static sites to content management systems (Drupal) to complex web applications (Platform-as-a-Service).
    An ambitious plan to modernize the web hosting infrastructure started in 2019, aiming at consolidating the various hosting technologies on a shared multi-tenant...

    Go to contribution page
  457. Mr Jones, Ben (CERN)
    Poster
    Poster

    LHC@home was launched as a BOINC project in 2004 as an outreach project for CERN's 50 years anniversary. Initially focused on the accelerator physics simulation code SixTrack, the project was expanded in 2011 to run other physics simulation codes on Linux thanks to virtualisation. Later on the experiment and theory applications running on the LHC@home platform have evolved to use containers...

    Go to contribution page
  458. David, Claire (York University (CA) / FNAL (US))
    Track 8 - Collaboration, Reinterpretation, Outreach and Education
    Poster

    Documentation on all things computing is vital for an evolving collaboration of scientists, technicians, and students. Using the MediaWiki software as the framework, a searchable knowledge base is provided via a secure web interface to the large cadre of colleagues working on the DUNE far and near detectors. Organization information, links relevant to working groups, operations and...

    Go to contribution page
  459. McKee, Shawn (University of Michigan Physics)
    Poster
    Poster

    WLCG relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues, including connection failures, congestion and traffic routing. The OSG Networking Area, in partnership with the WLCG Throughput working group, has created a monitoring infrastructure that gathers metrics from the...

    Go to contribution page
  460. Dr Giommi, Luca (INFN CNAF)
    Poster
    Poster

    Anomaly detection in data center IT and physical infrastructures is challenging due to the amount of heterogeneous data to be analyzed. Indeed, they include, among others, CPU and memory consumption, network traffic, cooling and electrical states. Defining a solution that early identifies unexpected anomalies is particularly important to prevent data losses, breakdown of the system, and any...

    Go to contribution page
  461. Anderson, Tyler (Stanford)
    Poster
    Poster

    Characterizing backgrounds is an extremely important task for any dark matter search, and almost every new detector encounters unexpected backgrounds. Such was the case in LZ's first science run, in which several detector effects contributed novel backgrounds. Where the new background events exhibited abnormal PMT waveforms, these features allowed them to be identified using pulse shape...

    Go to contribution page
  462. Byrne, Thomas (UKRI - STFC)
    Poster
    Poster

    ANTARES (A New Tape ARchive for STFC) is the new tape archive service at RAL Tier-1 that went into production on 4th of March 2022. The service is built around the EOS/CTA technologies developed at CERN. EOS is the user facing service that manages the incoming namespace requests and a thin SSD buffer, and CTA is deployed as the tape back-end system. In this talk, we describe the setup of...

    Go to contribution page
  463. Chen, Zhengyuan (Institute of High Energy Physics Chinese Academy of Sciences)
    Poster
    Poster

    Particle identification is an important ingredient to particle physics experiments. Distinguishing the charged hadrons (pions, kaons, protons and their antiparticles) is often crucial, in particular for hadronic decays which could be studied with an efficient particle identification to obtain a desirable signal-to-background ratio. An optimal performance of particle identification in a large...

    Go to contribution page
  464. JIANG, Zhixing
    Poster
    Poster

    During the next update of the High-Luminosity Large Hadron Collider (HL-LHC) of ATLAS, a new global trigger subsystem will be installed into the L0 Trigger. New and improved hardware and algorithms will be deployed during the upgrade to increase the performance of the trigger system. The global trigger subsystem consists of various components, including the FPGA-based Global Event Processor...

    Go to contribution page
  465. Smirnova, Oxana (Lund University)
    Poster
    Poster

    [Nordugrid ARC][1] is widely used today in the Worldwide LHC Computing Grid as one of the two recommended middlewares connecting the grid sites. It has served the WLCG and the High Energy Physics Communities very well since its birth in 2002. Now though, ARC aims to reach more communities outside of HEP like for instance in the fields of Bioinformatics, Astrophysics and Climate Research to...

    Go to contribution page
  466. Garg, rocky (Stanford University)
    Poster
    Poster

    The ATLAS software tutorial is a centrally organized suite of educational materials that helps to prepare newcomers for work in ATLAS. The broad objective is to familiarize participants with the basic skills needed to accomplish data analysis tasks, with a strong focus on software tools. In this talk, we will outline the recent changes to the ATLAS software tutorial to follow a project-based...

    Go to contribution page
  467. Bourilkov, Dimitri (University of Florida), Kim, Bockjoo (University of Florida)
    Poster
    Poster

    Modern large distributed computing systems produce large amounts of monitoring data. In order for these systems to operate smoothly, under-performing or failing components have to be identified quickly, and preferably automatically, enabling the system managers to react accordingly.
    In this contribution, we analyze job and data transfer data collected in the running of the LHC computing...

    Go to contribution page
  468. Whitehouse, Dan (Imperial College)
    Poster
    Poster

    Imperial College London hosts a large Tier-2 WLCG grid site based around a HTCondor batch system; additionally it provides cloud computing facilities using Openstack to non-WLCG activities. These cloud resources are open to opportunistic usage, provided the impact on the primary cloud users remains low.
    In common with most Tier 2 sites we see constant job pressure from the WLCG VOs, while the...

    Go to contribution page
  469. Moneta, Lorenzo (CERN)
    Poster
    Poster

    Through its TMVA package, ROOT provides and connects to machine learning tools for data analysis at HEP experiments and beyond. In addition, ROOT provides through its powerful I/O system and RDataFrame analysis tools the capability to efficiently select and query input data from large data sets as typically used in HEP analysis. At the same time, several existing Machine Learning tools exist...

    Go to contribution page
  470. Raduta, George (CERN)
    Poster
    Poster

    The ALICE experiment at CERN's Large Hadron Collider is designed to study the physics of strongly interacting matter at extreme energy densities. After a major upgrade, the new ALICE Online-Offline computational system expects to read an estimated throughput of 3.5TB/s of raw data and store and index 900Gb/s reconstructed data. The complexity of this endeavour implies the need for a...

    Go to contribution page
  471. Ren, Yihui (Brookhaven National Laboratory)
    Poster
    Poster

    Many large-scale physics experiments, such as ATLAS at the Large Hadron Collider, Deep Underground Neutrino Experiment and sPHENIX at the Realistic Heavy Ion Collider, rely on accurate simulations to inform data analysis and derive scientific results. Their inevitable inaccuracies may be detected and corrected using heuristics in a conventional analysis workflow.
    However, residual errors...

    Go to contribution page
  472. Currie, Robert (University of Edinburgh)
    Poster
    Poster

    Software development projects at Edinburgh identified a desire to build and manage our own monitoring platform. This better allows us to support the developing and varied physics and computing interests of our Experimental Particle Physics group. This production platform enables oversight of international experimental data management, local software development projects and active monitoring...

    Go to contribution page
  473. Gonzalez, Hugo (CERN)
    Poster
    Poster

    The CERN IT Storage group ensures the coherent design, development, operation and evolution of storage and data management services at CERN for all aspects of physics, user and project data and general needs of the Laboratory. CERNBox is one of the services that actively contributes to this objective.

    CERNBox is a cloud collaboration platform providing storage space (18PB) to users (37K...

    Go to contribution page
  474. Jones, Christopher (Fermilab)
    Poster
    Poster

    Today the LHC offline computing relies heavily on CPU resources, despite the interest in compute accelerators, such as GPUs, for the longer term future. The number of cores per CPU socket has continued to increase steadily, reaching the levels of 64 cores (128 threads) with recent AMD EPYC processors, and 128 cores on Ampere Altra Max ARM processors. Over the course of the past decade, the CMS...

    Go to contribution page
  475. Gonzalez De La Hoz, Santiago (IFIC - Univ. of Valencia and CSIC (ES))
    Poster
    Poster

    The ATLAS Spanish Tier-1 and Tier-2s have more than 18 years of experience in the deployment and development of LHC computing components and their successful operations. The sites are actively participating in, and even coordinating, R&D computing activities in the LHC Run3 and developing the computing models needed in HL-LHC period.

    In this contribution, we present details on the...

    Go to contribution page
  476. King, Sophie
    Poster
    Poster

    Hyper-Kamiokande is a next generation underground water Cherenkov detector, with the primary goal of constraining CP-violation in the lepton sector. It has a rich science programme that includes neutrino oscillation studies, neutrino astrophysics and searches for BSM physics including proton decay. Based on the experience of its predecessor Super-Kamiokande, it has a total (fiducial) volume...

    Go to contribution page
  477. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    High energy physics experiments heavily rely on the results of MC simulation of data used to extract physics results. However, the detailed simulation often requires tremendous amount of computation resources.

    Using Generative Adversarial Networks and other deep learning generative techniques can drastically speed up the computationally heavy simulations like a simulation of the calorimeter...

    Go to contribution page
  478. Schnepf, Matthias (Karlsruhe Institute of Technology)
    Poster
    Poster

    Most common CPU architectures provide simultaneous multithreading ( SMT). Thereby, the operating system sees, per physical core, two logical cores and can schedule two processes to one physical CPU core. This overbooking of physical cores enabled a better usage of parallel pipelines and doubled components within a CPU core. On systems with several applications running in parallel, such as...

    Go to contribution page
  479. Dossett, David (University of Melbourne)
    Poster
    Poster

    Until now the grid storage at Melbourne was provided by the DPM storage system which is now reaching the end of support, as well as the end of the disk lifetimes. Continuing to provide grid storage for the ATLAS and Belle II experiments requires that we move to a new solution; one that can be supported long term with minimal manpower by taking advantage of existing resources at Melbourne. Over...

    Go to contribution page
  480. Paparrigopoulos, Panos (CERN)
    Poster
    Poster

    WLCG is a large heterogeneous computing infrastructure which provides resources for the LHC experiments. The Computing Resource Information Catalogue (CRIC) has been designed as the source for the WLCG topology information. CRIC aims to describe WLCG distributed sites, their services and how the provided resources are used by the LHC experiments. The CRIC instance dedicated to WLCG has become...

    Go to contribution page
  481. Chudoba, Jiri
    Poster
    Poster

    Czech e-infrastructure project e-INFRA CZ is the only project for ICT services defined on the Road Map of the Czech Republic for Large Infrastructures for Research, Experimental Development and Innovations. It was created by 3 national e-infrastructures: CESNET, CERIT-SC and IT4Innovations. The project provides ICT services mostly for Czech universities and research institutions. High energy...

    Go to contribution page
  482. Mrs Sahakyan, Marina (DESY)
    Poster
    Poster

    The ever increasing amount of data that is produced by modern scientific facilities like EuXFEL or LHC puts a high pressure on the data management infrastructure at the laboratories. This includes poorly shareable resources of archival storage, typically, tape libraries. To achieve maximal efficiency of the available tape resources a deep integration between hardware and software components...

    Go to contribution page
  483. Gavalian, Gagik (Jefferson Lab)
    Poster
    Poster

    Modern Nuclear Physics experimental setups run experiments with higher beam intensity resulting in increased noise in detector components
    used for particle track reconstruction. Increased uncorrelated signals (noise) result in decreased particle reconstruction efficiency.
    In this work, we investigate the usage of Machine Learning, specifically Convolutional Neural Network Auto-Encoders...

    Go to contribution page
  484. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    The aim of the LHCb Upgrade II at the LHC is to operate at a luminosity of 1.5 x 1034 cm-2 s-1 to collect a data set of 300 fb-1. This will require a substantial modification of the current LHCb ECAL due to high radiation doses in the central region and increased particle densities.
    Advanced detector R&D for both new and ongoing experiments in HEP requires performing computationally intensive...

    Go to contribution page
  485. Salt Cairols, José Francisco (Instituto de Física Corpuscular), Gonzalez De La Hoz, Santiago (IFIC - Univ. of Valencia and CSIC (ES))
    Poster
    Poster

    ML/DL techniques have shown their power in the improvement of several studies and tasks in HEP, in particular, especially in physics analysis. Our approach has been to take a number of the ML/DL tools provided by several Open Source platforms applying them to several classification problems, for instance, to the ttbar resonance extraction in LHC experiment.
    A comparison has been made...

    Go to contribution page
  486. Scham, Moritz (Deutsches Elektronen-Synchrotron (DESY))
    Poster
    Poster

    In High Energy Physics, detailed and time-consuming simulations are used for particle interactions with detectors. To bypass these simulations with a generative model, the generation of large point clouds in a short time is required, while the complex dependencies between the particles must be correctly modeled. Particle showers are inherently tree-based processes, as each particle is produced...

    Go to contribution page
  487. Mr Da Costa Cardoso, Renato Paulo (CERN)
    Poster
    Poster

    With the extended usage of machine learning models, more and more complex algorithms are being studied. On the one hand, the development and optimisation processes become more challenging, on the other hand studies about models generalisation and re-usability become interesting. In this context, efficient, flexible ways to track continuous changes during development, as well as relevant...

    Go to contribution page
  488. Appleyard, Rob (UKRI STFC)
    Poster
    Poster

    The RAL Scientific Computing Department provides support for several large experimental facilities. These include, among others, the ISIS neutron spallation source, the Diamond X-Ray Synchrotron, the Rosalind Franklin Institute, and the RAL Central Laser Facility. We use a number of Ceph storage clusters to support the diverse requirements of these users.

    These include Deneb, a...

    Go to contribution page
  489. Mr Lambert, Fabian (CNRS/LPSC), Dr Odier, Jérôme (CNRS/LPSC)
    Poster
    Poster

    ATLAS Metadata Interface (AMI) is a generic ecosystem for metadata aggregation, transformation and cataloging. Benefiting from more than 20 years of feedback in the LHC context, the second major version was released in 2018. This poster describes how a renewed architecture and integration with modern technologies ease the usage and deployment of a complete AMI stack. It describes how to deploy...

    Go to contribution page
  490. Gallas, Elizabeth (Oxford)
    Poster
    Poster

    The ATLAS EventIndex is the global catalogue of all ATLAS real and simulated events. During the LHC long shutdown between Run 2 (2015-2018) and Run 3 (2022-2025) all its components were substantially revised and a new system was deployed for the start of Run 3 in Spring 2022. The new core storage system, based on HBase tables with a Phoenix interface rather than HDFS MapFiles, allows much...

    Go to contribution page
  491. Fu, Shiyuan (IHEP)
    Poster
    Poster

    Based on k8s cluster, High Energy Photon Source (HEPS) computing platform creates a container computing environment to provide analysis services for users. The computing platform provides a container data analysis environment based on jupyterlib with the jupyterhub web page as the entry point. The platform uses CVMFS to store the software library, and the container environment accesses the...

    Go to contribution page
  492. Watts, Gordon (University of Washington)
    Poster
    Poster

    Differentiable Programming could open even more doors in HEP analysis and computing to Artificial Intelligence/Machine Learning. Current common uses of AI/ML in HEP are deep learning networks – providing us with sophisticated ways of separating signal from background, classifying physics, etc. This is only one part of a full analysis – normally skims are made to reduce dataset sizes by...

    Go to contribution page
  493. Chen, Yifan (SLAC National Accelerator Laboratory)
    Poster
    Poster

    Liquid argon time projection chambers (TPCs) are widely used in particle detection. High quality physics simulators have been developed for such detectors in a variety of experiments, and the resulting simulations are used to aid in reconstruction and analysis of collected data. However, the degree to which these simulations are reflective of real data is limited by the knowledge of the...

    Go to contribution page
  494. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    High-precision modeling of systems is one of the main areas of industrial data analysis today. Models of the systems, their digital twins, are used to predict their behavior under various conditions. We have developed a digital twin of a data storage system using generative models of machine learning. The system consists of several types of components: HDD and SSD disks, disk pools with...

    Go to contribution page
  495. Liu, Jianli (Institute of High Energy Physics Chinese Academy of Sciences)
    Track 11 - Heterogeneous Computing and Accelerators
    Poster

    As a conventional ptychography iteration reconstruction method, Difference Map (DM) is suitable for computing large datasets of X-ray diffraction patterns on multi-GPU heterogeneous system at HEPS (High Energy Photon Source). However, it is based on the fact that the GPU memory is big enough to handle CUDA FFT/IFFT to all the patterns to GPU from RAM. Meanwhile, the intermediate data during...

    Go to contribution page
  496. Schnepf, Matthias (Karlsruhe Institute of Technology)
    Poster
    Poster

    Providing high performance and reliable tape storage system is GridKa's top priority. The GridKa tape storage system was recently migrated from IBM SP to HighPerformance Storage System (HPSS) for LHC and non-LHC HEP experiments. These are two different tape backends and each has its own design and specifics that need to be studied and understood very deeply. Taking into account the features...

    Go to contribution page
  497. Chudasama, Ruchi
    Poster
    Poster

    Deep learning techniques have been proven to provide excellent performance for a variety of high energy physics applications, such as particle identification, event reconstruction and trigger operations. Recently, we developed an end-to-end deep learning approach to identify various particles using low-level detector information from high energy collisions. These models will be incorporated in...

    Go to contribution page
  498. Dr Simili, Emanuele (University of Glasgow)
    Poster
    Poster

    We present a case for ARM chips as an alternative to standard x86 CPUs at WLCG sites, as we observed better performance and lower power consumption in a number of benchmarks based on actual HEP workloads, from ATLAS simulation and reconstruction tasks to the most recent HEP-Score containerised jobs.

    To support our case, we present novel measurements on the performance and energy...

    Go to contribution page
  499. Prof. Vilasís-Cardona, Xavier (La Salle URL)
    Poster
    Poster

    Starting this year, the upgraded LHCb detector is collecting data with a pure software trigger. In its first stage, reducing the rate from 30MHz to about 1MHz, GPUs are used to reconstruct and trigger on B and D meson topologies and high-pT objects in the event. In its second stage, a CPU farm is used to reconstruct the full event and perform candidate selections, which are persisted for...

    Go to contribution page
  500. Misa Moreira, Carmen (CERN)
    Poster
    Poster

    The LHCOPN network, which links CERN to all the WLCG Tier 1s,
    and the LHCONE network, which connects WLCG Tier1s and Tier2s, have
    succesfully provided the necessary bandwidth for the distribution of the data
    generated by the LHC experiments during first two runs of the LHC accel-
    erator. We give here an overview of the most significant achievements and
    the current state of the two...

    Go to contribution page
  501. Misawa, Shigeki (Brookhaven National Laboratory)
    Poster
    Poster

    Increases in data volumes are forcing high-energy and nuclear physics experiments to store more frequently accessed data on tape. Extracting the maximum performance from tape drives is critical to make this viable from a data availability and system cost standpoint. The nature of data ingest and retrieval in an experimental physics environment make achieving high access performance difficult...

    Go to contribution page
  502. Currie, Robert (University of Edinburgh)
    Poster
    Poster

    XRootD servers are commonplace to many parts of HEP data management and are a key component to data access and management strategies in both the WLCG and OSG. Deployments of XRootD instances across the UK have demonstrated the versatility and expandability of this data management software. As we become more reliant on these services, there is a requirement to collect low-level metrics to...

    Go to contribution page
  503. Carnesale, Maria
    Poster
    Poster

    Track finding in high-density environments is a key challenge for experiments at mod-
    ern accelerators. In this presentation we describe the performance obtained running
    machine learning models studied for the ATLAS Muon High Level Trigger. These mod-
    els are designed for hit position reconstruction and track pattern recognition with a
    tracking detector, on a commercially available Xilinx...

    Go to contribution page
  504. Bilodeau, Zoë (Skidmore College)
    Poster
    Poster

    In experiments with a noble liquid time-projection chamber there are arrays of photosensors positioned to allow for inference of the locations of interactions within the detector. If there is a gap in data left by a broken or saturated photosensor, inference of the position is less precise and less accurate. As it is not practical to repair or replace photosensors once the experiment has...

    Go to contribution page
  505. Fang, Wenxing
    Poster
    Poster

    Jiangmen Underground Neutrino Observatory (JUNO) experiment will start data taking in 2023. It aims to determine the neutrino mass ordering (NMO) with 3 sigma significance within 6 years. To achieve this goal, the backgrounds should be suppressed as much as possible. The cosmic-ray muon is one of the most important backgrounds for NMO study which should be taken carefully. Due to the...

    Go to contribution page
  506. Kahn, Abraham
    Poster
    Poster

    The upgrade of the ATLAS Trigger and Data Acquisition system must cope with
    the high pileup environment of the High-Luminosity Large Hadron Collider (HL-LHC).
    The new Event Filter Tracking system is a flexible, heterogeneous commercial system
    consisting of CPU cores and possibly accelerators (e.g., FPGAs or GPGPUs) to perform
    the compute-intensive Inner Tracker charged particle...

    Go to contribution page
  507. Jiang, Yi (University of Chinese Academy of Sciences)
    Poster
    Poster

    TFPWA is a general framework for partial wave analysis developed based on TensorFlow2. Partial wave analysis is a powerful method to determinate the 4-momentum distribution of multi-body decay final states and to extract the interested internal information. Base on a simple topological representation, TF-PWA can deal with most of the processes of partial wave analysis automatically and...

    Go to contribution page
  508. Dr Fornari, Federico (INFN-CNAF)
    Poster
    Poster

    INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing computing, networking and storage resources to a wide variety of scientific collaborations, not limited to the four LHC experiments. The INFN-CNAF data center will move to a new location next year. At the same time, the requirements from our experiments and users are becoming increasingly challenging and...

    Go to contribution page
  509. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    The compact highly granular time-of-flight neutron detector is designed for the fixed target BM@N experiment at Nuclotron (JINR). This detector is aimed to measure anisotropy of azimuthal neutron flows, that are sensitive to the equation of state of dense nuclear matter. Graph neural network (GNN) method is proposed to reconstruct fast neutrons with energies up to few GeV produced in...

    Go to contribution page
  510. Reuter, Lea (Karlsruhe Institute of Technology)
    Poster
    Poster

    Decay products of long-lived particles are an important signature in dark sector searches in collider experiments. The current Belle II tracking algorithm is optimized for tracks originating from the interaction point at the cost of a lower track finding efficiency for displaced tracks. This is especially the case for low-momentum displaced tracks that are crucial for dark sector searches in...

    Go to contribution page
  511. Bagnasco, Stefano (Istituto Nazionale di Fisica Nucleare)
    Poster
    Poster

    Multi-messenger astrophysics provides valuable insights into the properties of the physical Universe. These insights arise from the complementary information carried by photons, gravitational waves, neutrinos and cosmic rays about individual cosmic sources and source populations.
    When a gravitational wave (GW) candidate is identified by the Ligo, Virgo and Kagra (LVK) observatory network, an...

    Go to contribution page
  512. Sanchez, Luis (Rice University)
    Poster
    Poster

    Many data-intensive experiments, need to track large amounts of data, large code bases to process the data, and configuration data (or metadata). All of these datasets have unique requirements for version control. Furthermore, these datasets must have rules and requirements about how they can interact with the code. For this it is crucial to make the storage of data, the metadata in particular...

    Go to contribution page
  513. Salt Cairols, José Francisco (Instituto de Física Corpuscular)
    Poster
    Poster

    The ATLAS EventIndex is the global catalogue of all ATLAS real and simulated events. During the LHC long shutdown between Run 2 (2015-2018) and Run 3 (2022-2025) its components were substantially revised, and a new system has been deployed for the start of Run 3 in Spring 2022. The new core storage system is based on HBase tables with a Phoenix interface. It allows faster data ingestion rates...

    Go to contribution page
  514. Menéndez Borge, Gonzalo (CERN)
    Poster
    Poster

    When dealing with benchmarking, result collection and sharing is as essential as the production of the data itself. A central source of information allows for data comparison, visualization, and analysis, which the community both contributes to and profits from.

    While this is the case in other fields, in the High Energy Physics (HEP) benchmarking community both script and result sharing...

    Go to contribution page
  515. Fu, Shiyuan (IHEP)
    Poster
    Poster

    Quantum computing technology is developing rapidly and has attracted increasing attention and interest from governments and institutions around the world. Although in the NISQ era, quantum computing has begun to play a notable role in more and more fields such as logistics scheduling, quantum finance, and quantum chemistry, thanks to its computing power far superior to classical computers. And...

    Go to contribution page
  516. Suehara, Taikan (Kyushu University)
    Poster
    Poster

    There exist emerging interests for e+e- Higgs factories in the context of FCCee feasibility study in Europe, ILC in Japan, and new proposals (C3, HELEN) at US. As an original developer of a widely-used software package of jet analysis including flavor tagging for e+e- linear collider studies, LCFIPlus (published at NIMA and arXiv:1506.08371), we are now developing DNN-based flavor tagging...

    Go to contribution page
  517. Hess, Bryan (Jefferson Lab)
    Poster
    Poster

    Jefferson Lab has developed and operated Jasmine, its tape storage system, since 1999. Over the life of the system, design iterations have benefitted from advances in disk and network performance. Now, as large-scale data processing is increasingly performed on distributed infrastructure using community supported software tools, we detail how Jasmine’s software strategy has been refocused....

    Go to contribution page
  518. Dr Pérez-Calero Yzquierdo, Antonio (CIEMAT - PIC)
    Poster
    Poster

    The computing resource needs of LHC experiments, such as CMS, are expected to continue growing significantly over the next decade, during the Run 3 and especially the HL-LHC era. Additionally, the landscape of available resources will evolve, as HPC (and Cloud) resources will provide a comparable, or even dominant, fraction of the total capacity, in contrast with the current situation,...

    Go to contribution page
  519. Schultz, David (IceCube, University of Wisconsin-Madison), Sfiligoi, Igor
    Poster
    Poster

    The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. Understanding detector systematic effects is a continuous process. This requires the Monte Carlo simulation to be updated periodically to quantify potential changes and improvements in science results with more detailed modeling of the systematic effects. IceCube’s largest systematic...

    Go to contribution page
  520. Schultz, David (IceCube, University of Wisconsin-Madison)
    Poster
    Poster

    The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. IceCube recently completed a multi-year user management migration from a manual LDAP-based process to an automated and integrated system built on Keycloak. A custom interface allows lead scientists to register and edit their institution’s user accounts, including group memberships...

    Go to contribution page
  521. Haslbeck, Florian (University of Oxford, CERN)
    Poster
    Poster

    The complexity of the ATLAS detector and its infrastructure require an excellent understanding of the interdependencies between the components when identifying points of failure (POF). The ATLAS Technical Coordination Expert System features a graph-based inference engine that identifies a POF provided a list of faulty elements or Detector Safety System (DSS) alarms. However, the current...

    Go to contribution page
  522. Martyniak, Janusz (Imperial College London, UK)
    Poster
    Poster

    DIRAC is a widely used “framework for distributed computing”. It works by building a layer between the users and the resources offering a common interface to a number of heterogeneous providers. DIRAC, like many other workload management systems, uses pilot jobs to check and configure the worker-node environment before fetching a user payload. The pilot also records a number of different...

    Go to contribution page
  523. Li, Ivy (Rice University)
    Poster
    Poster

    XENONnT is a dual-phase time projection chamber with a target of 6 tonnes of ultra-pure liquid xenon which aims to search for dark matter via its scattering process. In XENONnT, incoming particles scatter and excite the target xenon atoms, resulting in an initial scintillation signal and an ionization reaction. The freed electrons from the ionization cause a second, larger ionization signal at...

    Go to contribution page
  524. Sinisi, Francesco (INFN-CNAF)
    Poster
    Poster

    Having a long tradition in state-of-the-art distributed IT technologies, from the first small clusters to Grid and Cloud-based computing, in the last couple of years INFN made available to its users INFN Cloud: an easy to use, distributed, user-centric cloud infrastructure and services portfolio targeted to scientific communities.

    Given the distributed nature of the infrastructure, the...

    Go to contribution page
  525. Reme-Ness, Haakon André (Western Norway University of Applied Sciences)
    Poster
    Poster

    This contribution introduces the job optimizer service for the next-
    generation ALICE grid middleware, JAliEn (Java Alice Environment).
    It is a continuous service running on central machines and is essentially
    responsible for splitting jobs into subjobs, to then be distributed and
    executed on the ALICE grid. There are several ways of creating subjobs
    based on various strategies relevant...

    Go to contribution page
  526. Michelotto, Diego (INFN)
    Poster
    Poster

    INFN-CNAF is one of the Worldwide LHC Computing Grid (WLCG) Tier-1 data centers, providing computing, networking and storage resources also to a wide variety of scientific collaborations, ranging from physics to bioinformatics and industrial engineering.
    Due to the massive adoption of containerised services and the increasing efficiency for application deployment of the recent virtualization...

    Go to contribution page
  527. Gavalian, Gagik (Jefferson Lab)
    Poster
    Poster

    With ever increasing data rates in modern physics experiments there is a need to
    partially analyze incoming data real time. Increasing data volumes create a need
    to filter data to write out only events that are useful for physics analysis.
    In this work we present a Level-3 trigger developed using Artificial Intelligence
    to select electron trigger events at data acquisition level....

    Go to contribution page
  528. Qi, Binbin (University of Science and technology of China)
    Poster
    Poster

    The Super Tau-Charm Facility (STCF) is a future electron-positron collider proposed in China. It has a peak luminosity of above $0.5×10^{35}$ $cm^{-2}s^{-1}$ and center-of-mass energy ranging from 2 to 7 GeV. A time-of-flight detector based on detection of internally reflected Cherenkov light (DTOF), is proposed for the endcap particle identification (PID) at the STCF. In this contribution, we...

    Go to contribution page
  529. Dubey, Shawn (University of Hawaii at Manoa)
    Poster
    Poster

    In this work, we report the status of a neural network regression model trained to extract new physics (NP) parameters in Monte Carlo (MC) data. We utilize a new EvtGen NP MC generator to generate $B \rightarrow K^{*} \ell^{+} \ell^{-}$ events according to the deviation of the Wilson Coefficient $C_{9}$ from its SM value, $\delta C_{9}$. We train a convolutional neural network regression...

    Go to contribution page
  530. Pardi, Silvio (INFN)
    Poster
    Poster

    The usage of WebDAV protocol has become more and more popular within the physics experiments using grid middleware in the last decade, and today it represents a valid alternative to the GridFTP currently supported at best-effort level after the retirement of Globus Toolkit.
    Belle II experiment established the adoption of WebDAV protocol as the main protocol for data access and...

    Go to contribution page
  531. Mr Niewenhuis, Dante (University of Amsterdam)
    Poster
    Poster

    ROOT RNTuple I/O subsystem has been designed to address performance bottlenecks and shortcomings of ROOT's current state of the art TTree I/O subsystem. RNTuple provides a backwards-incompatible redesign of the TTree binary format and API that evolves the ROOT event data I/O for the challenges of the upcoming decades. It has been engineered for high-performance on modern storage hardware, a...

    Go to contribution page
  532. Wiebalck, Arne (CERN)
    Poster
    Poster

    More than 500 servers are actively managed by the Windows Infrastructure team on the CERN site. They run critical services for the laboratory, from controlling some of the accelerator systems to directory services, storage and databases. Having at all times visibility on their state is critical for a smooth operation of their services. This presentation will describe the implementation of an...

    Go to contribution page
  533. Israel, Scott (Boston University)
    Poster
    Poster

    The Mu2e experiment will search for the neutrino-less conversion of muons to electrons in muonic aluminum. The Mu2e tracker detector measures the momentum of signal and background particles traveling down the beamline. One major background source is the $\mu\rightarrow e \bar{\nu} \nu$ decay of muons in orbit (DIO) process. Due to resolution and algorithm errors during reconstruction, these...

    Go to contribution page
  534. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    SHiP (Search for Hidden Particles) and the associated CERN SPS Beam Dump Facility is a new general-purpose experiment proposed at the SPS to search for "hidden" particles which are predicted by many recently elaborated models of Hidden Sector of the Standard Model. The experiment searches for very weakly interacting long-lived particles and crucially depends on effective background...

    Go to contribution page
  535. Hrivnac, Julius (IJCLab)
    Poster
    Poster

    Particle Physics experiments and Astronomy telescopes store a large amount of data, most of those data are usually stored in simple files in various format. Databases are used only to a limited extend, while database technology has made a tremendous progress in recent years, offering a large spectrum of applications in all possible domains. Those possibilities are still largely...

    Go to contribution page
  536. Zhai, Yuncong (Shandong University)
    Poster
    Poster

    Particle identification (PID) is one of the mostly commonly used tools for the physics analysis in collider physics experiments. To achieve good PID performance, information given by multiple sub-detectors are usually combined. This is in particular necessary for the discrimination of charged particles that have close masses (e.g. muon and pion). However, due to the intrinsic correlations...

    Go to contribution page
  537. Roy, Avik (University of Illinois at Urbana-Champaign)
    Poster
    Poster

    Reweighting Monte Carlo (MC) events for alternate benchmarks of beyond standard model (BSM) physics is an effective way to reduce the computational cost of physics searches. However, applicability of reweighting is often constrained by technical limitations. We demonstrate how pre-trained neural networks can be used to obtain fast and reliable reweighting without relying on the full MC...

    Go to contribution page
  538. Liu, Zhenping (Brookhaven National Laboratory)
    Poster
    Poster

    dCache at BNL has been in production for almost two decades. For years, dCache used the default driver included with the dCache software to interface with HPSS tape storage systems. Due to the synchronous nature of this approach and the high resource demands resulting from periodic script invocations, scalability was significantly limited. During the WLCG tape challenges, bottlenecks in dCache...

    Go to contribution page
  539. Jones, Ben (CERN)
    Poster
    Poster

    CERN has been providing central Windows remote desktops via the Windows Terminal Infrastructure service for a number of years, and aims to provide a similar experience for Linux graphical environments. Different communities and experiments offer a series of tools to their users with this goal in mind, but the solutions are far from ideal and generate a support overhead for their respective...

    Go to contribution page
  540. Goldenberg, Steven (Thomas Jefferson National Accelerator Facility)
    Poster
    Poster

    Particle accelerators, such as the Spallation Neutron Source (SNS), require high beam availability in order to maximize scientific discovery. Recently, researchers have made significant progress utilizing machine learning (ML) models to identify anomalies, prevent damage, reduce beam loss, and tune accelerator parameters in real time. In this work, we study the use of uncertainty aware...

    Go to contribution page
  541. Caffy, Cédric (CERN)
    Poster
    Poster

    The CERN IT Storage group operates multiple distributed storage systems to support all CERN data storage requirements. The storage and distribution of physics data generated by LHC and non-LHC experiments is one of the biggest challenges the group has to take on during LHC Run-3.

    EOS, the CERN distributed disk storage system is playing a key role in LHC data-taking. During the first ten...

    Go to contribution page
  542. Mohamed, Abdulla (University of Bahrain)
    Poster
    Poster

    During the long shutdown prior to LHC Run 3, the CMS reconstruction software (CMSSW) was upgraded to offload 40% of the High Level Trigger (HLT) processing to GPUs. This upgrade accelerated the reconstruction algorithms and improved the efficiency of the HLT farm, however it introduced new parameters to the system that had to be selected carefully to maximise performance. When offloading...

    Go to contribution page
  543. Girone, Maria (CERN)
    Poster
    Poster

    In the European Center of Excellence in Exascale Computing "Research on AI- and Simulation-Based Engineering at Exascale" (CoE RAISE), researchers from science and industry develop novel, scalable Artificial Intelligence technologies towards Exascale. In this work, we leverage European High Performance Computing (HPC) and Quantum Computing resources to perform large-scale hyperparameter...

    Go to contribution page
  544. QIAN, Sen (IHEP,CAS)
    Poster
    Poster

    In recent years, some materials from the elpasolite crystal family have been under development for either or both gamma ray and neutron detection. These include Cs2LiYCl6 (CLYC) and Cs2LiLaBr6 (CLLB). Since these crystals have different luminescence decay times under neutron and gamma irradiation. The pulse shape discrimination (PSD) is widely used to discriminate between neutron and gamma...

    Go to contribution page
  545. Mohammad Atif, FNU (Brookhaven National Laboratory)
    Poster
    Poster

    OpenMP is a directive based shared-memory parallel programming model traditionally used for multicore CPUs. In its recent versions, OpenMP was extended to enable GPU computing via its “target
    offloading” model. The architecture agnostic compiler directives can in principle offload to multiple types of GPUs and FPGAs, and its compiler support is under active development.

    In this work, we...

    Go to contribution page
  546. Tsulaia, Vakho (LBNL)
    Poster
    Poster

    std::execution::parallel (often shortened to std::par) is a method of executing algorithms in parallel that was introduced in the C++17 standard. It defines a number of execution policies that
    target various levels of parallelization using threads and vectorization. While originally designed for CPUs, backends that can execute on GPUs have recently been introduced by NVIDIA and Intel. Unlike...

    Go to contribution page
  547. Cheng, Yaosong (Institute of High Energy Physics Chinese Academy of Sciences)
    Poster
    Poster

    The Wide Field-of-view Cerenkov Telescope Array (WFCTA) is an important component of Large High Altitude Air Shower Observatory (LHAASO), which aims to measure the individual energy spectra of cosmic rays from ~30 TeV to a couple of EeV. Since the experiment started running in 2020, WFCTA simulation jobs have been running on the Intel X86 cluster last year and only 25% of the first stage...

    Go to contribution page
  548. McKee, Shawn (University of Michigan Physics)
    Poster
    Poster

    During the first WLCG Network Data Challenge in fall of 2021 some shortcomings in the monitoring that impeded the ability to fully understand the results collected during the data challenge were identified. One of the simplest missing components was site specific network information, especially information about traffic going into and out of any of the participating sites. Without this...

    Go to contribution page
  549. Hoffmann, Felix
    Poster
    Poster

    The High Luminosity LHC era at CERN will require Monte Carlo simulations to be at an even higher level of accuracy in order for them be suited for tasks such as background subtraction and filtering of rare events. In order to be able to keep up with the required amount of computing power, volunteer computing approaches can be used to complement existing Grid infrastructure.
    In order to...

    Go to contribution page
  550. Rybkin, Grigori (IJCLab Saclay)
    Poster
    Poster

    The ATLAS EventIndex system consists of the catalogue of all events collected, processed or generated by the ATLAS experiment at the CERN LHC accelerator, and all associated software tools. The new system, developed for LHC Run 3, makes use of Apache HBase - the Hadoop database - and Apache Phoenix - an SQL/relational database layer for HBase - to store and access all the event metadata. The...

    Go to contribution page
  551. Parschau, Mirco
    Poster
    Poster

    The hight interaction rate, fixed target experiment HADES at GSI, lo-
    cated in Darmstadt, Germany, investigates collisions of heavy-ion, proton
    and secondary pion beams with a target material. Hyperons are one of the
    key observables for both heavy-ion and elementary collisions. The
    challenge is to detect displaced vertices with good accuracy without hav-
    ing a dedicated vertex detector,...

    Go to contribution page
  552. Delgado-Mengual, Jordi (PIC - Port d'Informació Científica (IFAE/CIEMAT))
    Poster
    Poster

    Since 2009 Port d’Informació Científica (PIC), in Barcelona, has hosted the Major Atmospheric Gamma Imaging Cherenkov (MAGIC) Data Center. At the Observatorio del Roque de Los Muchachos (ORM) in the Canary Island of La Palma (Spain), data produced from observations by the 17m diameter MAGIC telescopes are transferred to PIC on a daily basis. More than 200 TB per year are being transferred to...

    Go to contribution page
  553. Fu, Shiyuan (IHEP)
    Poster
    Poster

    With the computing power far superior to classical computers, quantum computing has attracted increasing attention and interest all over the world. Many prototypes are constructed like IBM Osprey, USTC Jiuzhang 2.0 to demonstrate the advantage of quantum computing, and corresponding developing SDKs are proposed to make the utmost of these devices.

    Qiskit is one of the most popular quantum...

    Go to contribution page
  554. Heinrich, Lukas (Max-Planck-Institut f\"ur Physik)
    Poster
    Poster

    In this paper we describe the development of a streamlined framework for large-scale ATLAS pMSSM reinterpretations of LHC Run 2 analyses using containerised computational workflows. The project is looking to assess the global coverage of BSM physics and requires running O(5k) computational workflows representing pMSSM model points. Following ATLAS Analysis Preservation policies, many analyses...

    Go to contribution page
  555. Bhat, Preeti
    Poster
    Poster

    The efficiency of high energy physics workflows relies on the ability to rapidly transfer data among the sites where the data is processed and analysed. The best data transfer tools should provide a simple and reliable solution for local, regional, national and in some cases intercontinental data transfers. This work outlines the results of data transfer tool tests using internal and external...

    Go to contribution page
  556. Dr Couet, Olivier (CERN)
    Poster
    Poster

    ROOT's users can produce the plots they want and tune them all the way they need to be ready for publication. This tuning can be quite invasive in a ROOT macro and represent a fair amount of code. The good practice is to have these personnel tunings in a dedicated macro or in a special style. That's what experienced users are doing and they get the nice results they want. New ROOT users or...

    Go to contribution page
  557. Aguado Corman, Asier (CERN)
    Poster
    Poster

    Researchers are now well accustomed to the convenience of Single Sign-On for their web applications. A significant proportion of researchers' and laboratory staff's daily work, however, is not performed on the web today - to support these workflows we must go beyond the well understood web flows for SAML and OAuth2 and find ways to provision tokens on the command line. This paper will showcase...

    Go to contribution page
  558. Van Buren, Gene (Brookhaven National Laboratory)
    Poster
    Poster

    Cutting edge research has driven scientists in many fields into the world of big data. While data storage technologies continue to evolve, the costs remain high for rapid data access on such scales and are a major factor in planning and operations. As a joint effort spanning experiment scientists, developers for ROOT, and industrial leaders in data compression, we sought to address this...

    Go to contribution page
  559. Bauer, Daniela (Imperial College London)
    Poster
    Poster

    DIRAC is a widely used "framework for distributed computing". It works by providing a layer between users and computing resources by offering a common interface to a number of heterogeneous resource providers. DIRAC originally provided support for dynamic workload management on a cloud via its VMDIRAC extension. When the VMDIRAC extension was envisaged, it was common to use commercial clouds...

    Go to contribution page
  560. Ms Cheema, Priyanka (University of Sydney)
    Poster
    Poster

    The Belle II experiment situated at the SuperKEKB energy-asymmetric $e^+e^-$ collider began operation in 2019. It has since recorded half of the data collected by its predecessor, and reached a world record instantaneous luminosity of $4.7\times 10^{34}$ cm$^{-2}$s$^{-1}$. For distinguishing decays with missing energy from background events at Belle II, the residual calorimeter energy measured...

    Go to contribution page
  561. Walker, Rodney (Ludwig-Maximilians-Universitaet Muenchen)
    Poster
    Poster

    The European energy crisis during 2022 prompted many computing facilities to take urgent electricity saving measures. In part voluntary, in response to EU and national appeals, but also to keep within a flat energy budget, as the electricity price rose. We review some of these measures and, as the situation normalises, take a longer view on how the flexibility of high throughput computing can...

    Go to contribution page
  562. Sfiligoi, Igor
    Poster
    Poster

    The x86_64 instruction set architecture, is not a single, consistent, compatible interface to execute computer programs. Since the initial release in 1999, every new generation has added new instructions, some of which were later removed. Most of these new instructions are intended to improve the performance of those programs which explicitly take advantage of them. However, running such a...

    Go to contribution page
  563. Mrnjavac, Teo (CERN)
    Poster
    Poster

    The ALICE Experiment at CERN's Large Hadron Collider (LHC) has undergone a major upgrade during LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O² (Online-Offline). To ensure the efficient operation of the upgraded experiment and of its newly designed computing system, a reliable, high performance, full-featured experiment control system has also been developed...

    Go to contribution page
  564. De Souza Pereira, Jomar Junior (Federal University of Rio de Janeiro, COPPE/EE/IF, Brazil)
    Poster
    Poster

    With over 2000 active members from 174 institutes over 41 countries in the world, the ALICE experiment is one of the 4 large experiments at CERN. With such numerous interactions, the experiment management needs a way to record members participation history and their current status, such as employments, institutes, appointments, clusters and funding agencies, as well as to automatically...

    Go to contribution page
  565. Pacheco Pages, Andreu (IFAE Barcelona (ES))
    Poster
    Poster

    The ATLAS experiment has 18+ years of experience using workload management systems to deploy and develop workflows to process and to simulate data on the distributed computing infrastructure. Simulation, processing and analysis of LHC experiment data require the coordinated work of heterogeneous computing resources. In particular, the ATLAS experiment utilizes the resources of 250 computing...

    Go to contribution page
  566. Mr Litvintsev, Dmitry (Fermilab)
    Poster
    Poster

    Over the past several years, the dCache collaboration has been working on developing a feature-rich, efficient and scalable data lifecycle and QoS service. With the ever-increasing volume of data anticipated by the LHC and Intensity Frontier experiments, their reliance on tape storage and the limited amount of disk cache available to them, this effort to provide for efficient staging of large...

    Go to contribution page
  567. Leduc, Julien (CERN)
    Poster
    Poster

    The EOSCTA service is CERN physics data archival storage infrastructure for Run 3. It entered production at CERN during summer 2020 and has been serving all the LHC and non-LHC data tape workflows. It is a complete redesign of the tape software, tape cache and tape workflows designed for Run 3 rates that introduces scalability changes targeting Run4. It already set a new record of monthly...

    Go to contribution page
  568. Fanzago, Federica
    Poster
    Poster

    CloudVeneto is a private cloud targeted to scientific communities based on OpenStack software. It was designed in 2013 and put in operation one year later, to support INFN projects, mainly HEP ones. Its resources are physically distributed among two sites: the Physics Department of University of Padova-INFN Padova Unit and the INFN Legnaro National Laboratories. During these 10 years...

    Go to contribution page
  569. Litvintsev, Dmitry (Fermilab)
    Poster
    Poster

    dCache (https://dcache.org) is a highly scalable storage system providing location-independent access to data. The data are stored across multiple data servers as complete files presented to the end-user via a single-rooted namespace. From its inception, dCache has been designed as a caching disk buffer to a tertiary tape storage system with the assumption that the latter has virtually...

    Go to contribution page
  570. Smirnov, Yuri (NIU)
    Poster
    Poster

    Every sub-detector in the ATLAS experiment at the LHC, including ATLAS
    Calorimeter, writes conditions and calibration data into an ORACLE
    Database (DB). In order to provide an interface for reliable interactions
    both with the Conditions Online and Offline DBs a unique semiautomatic
    web based application, TileCalibWeb Robot, has been developed. TileCalibWeb is being used in LHC Run 3 by...

    Go to contribution page
  571. Dr Mei, Xinxin (Jefferson Lab)
    Poster
    Poster

    The development of modern heterogeneous accelerators, such as GPUs, has boosted the prosperity of artificial intelligence (AI). Recent years have seen an increasing popularity of AI for the nuclear physics (AI4NP) domain. While most AI4NP studies focus on feasibility analysis, we target at their performance on the modern GPUs equipped with Tensor Cores.

    We first benchmark the throughput...

    Go to contribution page
  572. Gavalian, Gagik (Jefferson Lab)
    Poster
    Poster

    This work shows the implementation of Artificial Intelligence models in track reconstruction
    software for the CLAS12 detector at Jefferson Lab. The Artificial Intelligence-based approach resulted
    in improved track reconstruction efficiency in high luminosity experimental conditions. The track
    reconstruction efficiency increased by $10-12\%$ for a single particle, and statistics in...

    Go to contribution page
  573. Lavizzari, Giulia (INFN and University of Milano-Bicocca)
    Poster
    Poster

    new methodology to improve the sensitivity to new physics contributions to the Standard Model processes at LHC is presented.

    A Variational AutoEncoder trained on Standard Model processes is used to identify Effective Field Theory contributions as anomalies. While the output of the model is supposed to be very similar to the inputs for Standard Model events, it is expected to deviate...

    Go to contribution page
  574. JIANG, Xiaowei
    Poster
    Poster

    The token-based certification method is spreading in the distributed computing system of high energy physics. More and more software and middleware are supporting token as one of certification methods. As an example, WLCG has upgraded all the services to supporting WLCG token. In IHEP(Institute of High Energy Physics in China), the Kerberos token has been used as the main certification method...

    Go to contribution page
  575. Rinaldi, Lorenzo
    Poster
    Poster

    The GRID computing paradigms adopted by the main HEP experiments is based on the distribution of experimental data on computer resources located all over the world. In general, the data distribution operation is managed centrally by services, such as the CERN File Transfer Service (FTS), which interact with the local Storage Elements. While performing bulk data transfers at such large scale,...

    Go to contribution page
  576. Dr Odier, Jérôme (CNRS/LPSC), Lambert, Fabian (CNRS/LPSC)
    Poster
    Poster

    ATLAS Metadata Interface (AMI) is a generic ecosystem for metadata aggregation, transformation and cataloging. Benefiting from more than 20 years of feedback in the LHC context, the second major version was released in 2018. Each sub-system of the stack has recently be improved in order to acquire messaging/telemetry capabilities. This poster describes the whole stack monitoring with the...

    Go to contribution page
  577. Gartung, Patrick (Fermilab)
    Poster
    Poster

    The CMS experiment has been utilizing vectorization, or SIMD, in parts of its data processing applications for over a decade. On x86 platforms the vectorization level is still SSE3. In the past attempts to use wider vector instruction sets such as AVX or AVX-512 have, in practice, not resulted in improvements in the overall event processing throughput, because the CPUs scale down their...

    Go to contribution page
  578. Yoo, Changhyun
    Poster
    Poster

    The JSNS$^2$ (J-PARC Sterile Neutrino Search at the J-PARC Spallation Neutron Source) experiment searches for neutrino oscillations at 24m baseline from the J-PARC’s 3 GeV 1 MW proton beam incident on a mercury target at the Materials and Life science experimental Facility (MLF). The JSNS$^2$ detector consists of three cylindrical layers including an inner most neutrino target, an intermediate...

    Go to contribution page
  579. Eich, Niclas (RWTH Aachen University)
    Poster
    Poster

    VISPA (VISual Physics Analysis) realizes a scientific cloud enabling modern scientific data analysis in a web browser. Our local VISPA instance is backed by a small institute cluster and is dedicated to fundamental research and university education. By hardware upgrades (732 CPU threads, 29 workstation GPUs), we have tailored the cloud services to accomplish both, rapid turn-around when...

    Go to contribution page
  580. Ratnikov, Fedor (HSE University)
    Poster
    Poster

    Particle identification at the Super Charm-Tau factory experiment will be provided by a Focusing multilayer Aerogel Ring Imaging CHerenkov detector FARICH. Due to hardware constraints the detector captures a great amount of noise which must be mitigated to reduce both a data flow and further storage space.

    In this presentation we present our approach to filtering signal hits. The approach...

    Go to contribution page
  581. Hoeft, Bruno (Karlsruhe Institute of Technology (KIT))
    Poster
    Poster

    The Worldwide Large Hadron Collider Computing Grid (WLCG) actively pursues the migration from the protocol IPv4 to IPv6. For this purpose, the HEPiX-IPv6 working group was founded during the fall HEPiX Conference in 2010. One of the first goals was to categorize the applications running in the WLCG into different groups: the first group was easy to define, because it comprised of all...

    Go to contribution page
  582. Mr Caffy, Cedric (CERN)
    Poster
    Poster

    During the LHC era the XRootD framework has proven to be a critical component of numerous data management and software defined storage solutions (most importantly EOS, the CERN storage technology used for the LHC experiments), and as such grew into one of the most strategic storage technologies in the High Energy Physics (HEP) community. Over the last year significant developments in the area...

    Go to contribution page
  583. Ding, Biao
    Poster
    Poster

    The Beijing Spectrometer (BESIII) is a detector for hadron and tau-charm physics studies. It’s located at the Beijing Electron Positron Collider (BEPCII), which runs at center-of-mass energy 2.0-5.0GeV. Zc(3900) was firstly discovered by BESIII collaboration in 2013. This exotic resonance structure is considered to be a tetra-quark state, which scientists have been looking for a long time. In...

    Go to contribution page