Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer | NVIDIA Technical Blog
…What is NVIDIA Model Optimizer? The NVIDIA Model Optimizer (ModelOpt) library incorporates state-of-the-art model optimization techniques to compress and accelerate AI models. These techniques include quantization, distillation, pruning, speculative…