Quantization user guide

This quantization user guide is organized into the following sections:

Quantization workflow

For overall workflow to quantize model using AIMET toolkit, see the Quantization workflow.

Debugging guidelines

For set of debugging steps to improve the performance of quantized model, see the Debugging guidelines

On-target inference

For instructions on how to deploy quantized model to different target runtimes, see the On-target inference.