AIMET Keras Quantization APIsΒΆ

In order to make full use of AIMET Quantization features, there are several guidelines users are encouraged to follow when defining Keras models. AIMET provides APIs which can automate some of the model definition changes and checks whether AIMET Quantization features can be applied on Keras model.

Users should first invoke Model Preparer API before using any of the AIMET Quantization features.
AIMET Quantization for Keras provides the following functionality
  • Quantization Simulation: Allows ability to simulate inference and training on quantized hardware

  • Adaptive Rounding: Post-training quantization technique to optimize rounding of weight tensors

  • Cross-Layer Equalization: Post-training quantization technique to equalize layer parameters

  • BatchNorm Re-estimation: Quantization-aware training technique to counter potential instability of batchnorm statistics (i.e. running mean and variance)