Migrating CUDA to SYCL Achieved Up to 1.9x Performance Improvement
… The NVIDIA Nsight Systems tool helped identify the performance regression in SYCL binary running on an NVIDIA GPU. By fixing this regression, the SYCL binary running on an NVIDIA A100 GPU roughly matched the performance of CUDA binary running on an NVIDIA A100 GPU. …