Quantization user guide¶
This quantization user guide is organized into the following sections:
Quantization workflow¶
For overall workflow to quantize model using AIMET toolkit, see the Quantization workflow.
Debugging guidelines¶
For set of debugging steps to improve the performance of quantized model, see the Debugging guidelines
On-target inference¶
For instructions on how to deploy quantized model to different target runtimes, see the On-target inference.