AIMET ONNX Quantization APIsΒΆ
- AIMET Quantization for ONNX Models provides the following functionality.
Quantization Simulation API: Allows ability to simulate inference on quantized hardware
Cross-Layer Equalization API: Post-training quantization technique to equalize layer parameters