Hide navigation sidebar

Hide table of contents sidebar

Skip to content

Toggle site navigation sidebar

Toggle table of contents sidebar

Version: 2.8.0
Other versions

Quick Start
Installation
User Guide
Toggle navigation of User Guide
Quantization Simulation Guide
Toggle navigation of Quantization Simulation Guide
Feature Guide
Toggle navigation of Feature Guide
- Adaptive rounding
- Sequential MSE
- Batch norm folding
- Cross-layer equalization
- AdaScale
- Mixed precision
  Toggle navigation of Mixed precision
  - Manual mixed precision
  - Automatic mixed precision
- Automatic quantization
- Batch norm re-estimation
- Analysis tools
  Toggle navigation of Analysis tools
- Compression
  Toggle navigation of Compression
  - Compression guidebook
  - Greedy compression ratio selection
  - Visualization
  - Weight SVD
  - Spatial SVD
  - Channel pruning
    Toggle navigation of Channel pruning
    - Winnowing
- Quantized LoRa
  Toggle navigation of Quantized LoRa
  - QW-LoRa
  - QWA-LoRa
- OmniQuant
Example Notebooks
API Reference
Release Notes
External Resources
Toggle navigation of External Resources
Glossary

Toggle table of contents sidebar

aimet_onnx API¶

AIMET quantization for ONNX models provides the following functionality.

aimet_onnx.quantsim
aimet_onnx.apply_adaround
aimet_onnx.apply_seq_mse
aimet_onnx.quantsim.set_grouped_blockwise_quantization_for_weights
aimet_onnx.batch_norm_fold
aimet_onnx.cross_layer_equalization
aimet_onnx.mixed_precision
aimet_onnx.quant_analyzer
aimet_onnx.autoquant
aimet_onnx.layer_output_utils

Copyright © 2020, Qualcomm Innovation Center, Inc.

Made with Sphinx and @pradyunsg's Furo