Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog
…He is currently focusing on the inference side of the NVIDIA NeMo Framework. Onur holds a Ph.D. in computer engineering from the New Jersey Institute of Technology. His dissertation focused on…
