AMD Optimizing CPU Libraries (AOCL)
… GitHub Repo AOCL-BLAS Performance improvements in S/D/ZGEMM on Zen3/4/5 SGEMM optimizations for tiny matrices New Thread Control APIs with Global and thread-local variants Support for OpenMP 2.5 and earlier versions Optional support for reproducibility using compiler options AOCL-Compression Refact… …