Tuning Flash Attention for Peak Performance in NVIDIA CUDA Tile | NVIDIA Technical Blog
…His current focus is on AI-driven GPU kernels and next-generation programming models for accelerated computing. His experience spans the full AI stack, from GPU kernel optimization to AI product leadership…