26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)

Name: 26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)
Start: 2023-05-08T08:00:00-04:00
End: 2023-05-12T16:00:00-04:00
Location: Norfolk Waterside Marriott

May 8 – 12, 2023

Norfolk Waterside Marriott

US/Eastern timezone

Conference Secretariat

chep2023-secretariat@jlab.org

High-Throughput Machine Learning Inference with NVIDIA TensorRT

May 11, 2023, 2:00 PM

15m

Marriott Ballroom VII (Norfolk Waterside Marriott)

Marriott Ballroom VII

Norfolk Waterside Marriott

235 East Main Street Norfolk, VA 23510

Oral Track 11 - Heterogeneous Computing and Accelerators Track X - Exascale Science, Heterogeneous Computing and Accelerators, and Quantum Computing

Aaij, Roel (Nikhef National institute for subatomic physics (NL))

The first stage of the LHCb High Level Trigger is implemented as a GPU application. In 2023 it will run on 400 NVIDIA GPUs and its goal is to reduce the rate of incoming data from 5 TB/s to approximately 100 GB/s. A broad scala of reconstruction algorithms is implemented as approximately 70 kernels. Machine Learning algorithms are attractive to further extend the physics reach of the application, but inference must be integrated into the existing GPU application and be very fast to maintain the required throughput of at least 10 GB/s per GPU. We investigate the use of NVIDIA TensorRT for flexible loading of Machine Learning models and fast inference, benchmark its performance for a range of interesting Machine Learning models and compare it to hand-coded implementations where possible.

Consider for long presentation	No

Dr Sclocco, Alessio (Netherlands eScience Center)

Mr Ali, Suvayu (Netherlands eScience Center) Dr van den Oord, Gijs (Netherlands eScience Center) Dr Cámpora Pérez, Daniel (Maastricht University) Dr de Vries, Jacco (Nikhef and Maastricht University) Aaij, Roel (Nikhef National institute for subatomic physics (NL)) Dr van Veghel, Maarten (Nikhef)

chep2023_lhcb_gpu_tensorrt_inference_mvanveghel_v3.pdf

26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)

Conference Secretariat

High-Throughput Machine Learning Inference with NVIDIA TensorRT

Marriott Ballroom VII

Norfolk Waterside Marriott

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS (CHEP2023)

Conference Secretariat

Speaker

Description

Author

Co-authors

Presentation materials