TY - JOUR
T1 - Parallelized Kalman-Filter-Based Reconstruction of Particle Tracks on Many-Core Processors and GPUs
AU - Cerati, Giuseppe
AU - Elmer, Peter
AU - Krutelyov, Slava
AU - Lantz, Steven
AU - Lefebvre, Matthieu
AU - Masciovecchio, Mario
AU - McDermott, Kevin
AU - Riley, Daniel
AU - Tadel, Matevž
AU - Wittich, Peter
AU - Würthwein, Frank
AU - Yagil, Avi
N1 - Publisher Copyright:
© 2017 The Authors, published by EDP Sciences.
PY - 2017/8/8
Y1 - 2017/8/8
N2 - For over a decade now, physical and energy constraints have limited clock speed improvements in commodity microprocessors. Instead, chipmakers have been pushed into producing lower-power, multi-core processors such as Graphical Processing Units (GPU), ARM CPUs, and Intel MICs. Broad-based efforts from manufacturers and developers have been devoted to making these processors user-friendly enough to perform general computations. However, extracting performance from a larger number of cores, as well as specialized vector or SIMD units, requires special care in algorithm design and code optimization. One of the most computationally challenging problems in high-energy particle experiments is finding and fitting the charged-particle tracks during event reconstruction. This is expected to become by far the dominant problem at the High-Luminosity Large Hadron Collider (HL-LHC), for example. Today the most common track finding methods are those based on the Kalman filter. Experience with Kalman techniques on real tracking detector systems has shown that they are robust and provide high physics performance. This is why they are currently in use at the LHC, both in the trigger and offine. Previously we reported on the significant parallel speedups that resulted from our investigations to adapt Kalman filters to track fitting and track building on Intel Xeon and Xeon Phi. Here, we discuss our progresses toward the understanding of these processors and the new developments to port the Kalman filter to NVIDIA GPUs.
AB - For over a decade now, physical and energy constraints have limited clock speed improvements in commodity microprocessors. Instead, chipmakers have been pushed into producing lower-power, multi-core processors such as Graphical Processing Units (GPU), ARM CPUs, and Intel MICs. Broad-based efforts from manufacturers and developers have been devoted to making these processors user-friendly enough to perform general computations. However, extracting performance from a larger number of cores, as well as specialized vector or SIMD units, requires special care in algorithm design and code optimization. One of the most computationally challenging problems in high-energy particle experiments is finding and fitting the charged-particle tracks during event reconstruction. This is expected to become by far the dominant problem at the High-Luminosity Large Hadron Collider (HL-LHC), for example. Today the most common track finding methods are those based on the Kalman filter. Experience with Kalman techniques on real tracking detector systems has shown that they are robust and provide high physics performance. This is why they are currently in use at the LHC, both in the trigger and offine. Previously we reported on the significant parallel speedups that resulted from our investigations to adapt Kalman filters to track fitting and track building on Intel Xeon and Xeon Phi. Here, we discuss our progresses toward the understanding of these processors and the new developments to port the Kalman filter to NVIDIA GPUs.
UR - http://www.scopus.com/inward/record.url?scp=85028383642&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85028383642&partnerID=8YFLogxK
U2 - 10.1051/epjconf/201715000006
DO - 10.1051/epjconf/201715000006
M3 - Conference article
AN - SCOPUS:85028383642
SN - 2101-6275
VL - 150
JO - EPJ Web of Conferences
JF - EPJ Web of Conferences
M1 - 00006
T2 - Workshop on Connecting the Dots/ Intelligent Trackers, CTD/WIT 2017
Y2 - 6 March 2017 through 9 March 2017
ER -