Batch norm re-estimation¶

Context¶

Batch norm re-estimation (BN re-estimation) uses a small subset of training data to re-estimate the statistics of the batch norm (BN) layers in a model. AIMET then folds the BN layers into the preceding convolution or linear layers.

BN re-estimation is recommended under the following conditions:

When batch norm folding (BNF) reduces performance
In models where the main issue is weight quantization
In quantization of depth-wise separable layers, as their batch norm statistics are sensitive to oscillations

API¶

PyTorch

Top-level API

aimet_torch.bn_reestimation.reestimate_bn_stats(model, dataloader, num_batches=100, forward_fn=None)[source]¶

Reestimate BatchNorm statistics (running mean and var).

Parameters:

model (Module) – Model to reestimate the BN stats.
dataloader (DataLoader) – Training dataset.
num_batches (int) – The number of batches to be used for reestimation.
forward_fn (Optional[Callable[[Module, Any], Any]]) – Optional adapter function that performs forward pass given a model and a input batch yielded from the data loader.

Return type:

Handle

Returns:

Handle that undos the effect of BN reestimation upon handle.remove().

TensorFlow

Top-level API

aimet_tensorflow.keras.bn_reestimation.reestimate_bn_stats(model, bn_re_estimation_dataset, bn_num_batches=100)[source]¶

top level api for end user directly call

Parameters:

model (Model) – tf.keras.Model
bn_re_estimation_dataset (DatasetV2) – Training dataset
bn_num_batches (int) – The number of batches to be used for reestimation

Return type:

Handle

Returns:

Handle that undos the effect of BN reestimation upon handle.remove()

aimet_tensorflow.keras.batch_norm_fold.fold_all_batch_norms_to_scale(sim)[source]¶

Fold all batch_norm layers in a model into the quantization scale parameter of the corresponding conv layers

Parameters:: sim (QuantizationSimModel) – QuantizationSimModel to be folded
Return type:: List[Tuple[QcQuantizeWrapper, QcQuantizeWrapper]]
Returns:: A list of pairs of layers [(Conv/Linear, BN layer that got folded)]

ONNX

Not supported.

Batch norm re-estimation¶

Context¶

Workflow¶

Prerequisites¶

Execution¶

Setup¶

Step 1¶

Step 2¶

Step 3¶

Step 4¶

API¶