Batch norm re-estimation¶

Context¶

Batch norm re-estimation (BN re-estimation) uses a small subset of training data to re-estimate the statistics of the batch norm (BN) layers in a model. AIMET then folds the BN layers into the preceding convolution or linear layers.

BN re-estimation is recommended under the following conditions:

When batch norm folding (BNF) reduces performance
In models where the main issue is weight quantization
In quantization of depth-wise separable layers, as their batch norm statistics are sensitive to oscillations

API¶

PyTorch

Top-level API

aimet_torch.bn_reestimation.reestimate_bn_stats(model, dataloader, num_batches=100, forward_fn=None)[source]¶

Reestimate BatchNorm statistics (running mean and var).

Parameters:

model (Module) – Model to reestimate the BN stats.
dataloader (DataLoader) – Training dataset.
num_batches (int) – The number of batches to be used for reestimation.
forward_fn (Optional[Callable[[Module, Any], Any]]) – Optional adapter function that performs forward pass given a model and a input batch yielded from the data loader.

Return type:

Handle

Returns:

Handle that undos the effect of BN reestimation upon handle.remove().

ONNX

Not supported.

Batch norm re-estimation¶

Context¶

Workflow¶

Prerequisites¶

Execution¶

Setup¶

Step 1¶

Step 2¶

Step 3¶

Step 4¶

API¶