Indico is back online after maintenance on Tuesday, April 30, 2024.
Please visit Jefferson Lab Event Policies and Guidance before planning your next event: https://www.jlab.org/conference_planning.

May 8 – 12, 2023
Norfolk Waterside Marriott
US/Eastern timezone

High-Throughput Machine Learning Inference with NVIDIA TensorRT

May 11, 2023, 2:00 PM
15m
Marriott Ballroom VII (Norfolk Waterside Marriott)

Marriott Ballroom VII

Norfolk Waterside Marriott

235 East Main Street Norfolk, VA 23510

Speaker

Aaij, Roel (Nikhef National institute for subatomic physics (NL))

Description

The first stage of the LHCb High Level Trigger is implemented as a GPU application. In 2023 it will run on 400 NVIDIA GPUs and its goal is to reduce the rate of incoming data from 5 TB/s to approximately 100 GB/s. A broad scala of reconstruction algorithms is implemented as approximately 70 kernels. Machine Learning algorithms are attractive to further extend the physics reach of the application, but inference must be integrated into the existing GPU application and be very fast to maintain the required throughput of at least 10 GB/s per GPU. We investigate the use of NVIDIA TensorRT for flexible loading of Machine Learning models and fast inference, benchmark its performance for a range of interesting Machine Learning models and compare it to hand-coded implementations where possible.

Consider for long presentation No

Primary author

Dr Sclocco, Alessio (Netherlands eScience Center)

Co-authors

Mr Ali, Suvayu (Netherlands eScience Center) Dr van den Oord, Gijs (Netherlands eScience Center) Dr Cámpora Pérez, Daniel (Maastricht University) Dr de Vries, Jacco (Nikhef and Maastricht University) Aaij, Roel (Nikhef National institute for subatomic physics (NL)) Dr van Veghel, Maarten (Nikhef)

Presentation materials