aimet_torch.seq_mse¶
Top level APIs
- aimet_torch.seq_mse.apply_seq_mse(*args, **kwargs)[source]¶
 Sequentially minimizing activation MSE loss in layer-wise way to decide optimal param quantization encodings.
1 Disable all input/output quantizers, param quantizers of non-supported modules 2 Find and feeze optimal parameter encodings candidate for remaining supported modules 3 Re-enable disabled quantizers from step 1
Example userflow: model = Model().eval() sim = QuantizationSimModel(…) apply_seq_mse(…) sim.compute_encodings(…) [compute encodings for all activations and parameters of non-supported modules] sim.export(…)
NOTE: modules in modules_to_exclude won’t be quantized and skipped when applying sequential MSE.
- Parameters:
 sim – QuantizationSimModel object
data_loader – Data loader
num_candidates – Number of candidate encodings to evaluate for each layer
forward_fn – callback function to perform forward pass given accepts model, inputs
modules_to_exclude – List of supported type module(s) to exclude when applying Sequential MSE
checkpoints_config – Config files to split fp32/quant model by checkpoints to speedup activations sampling
Sequential MSE parameters
- class aimet_torch.seq_mse.SeqMseParams(num_batches, num_candidates=20, inp_symmetry='symqt', loss_fn='mse', forward_fn=<function default_forward_fn>)[source]¶
 Sequential MSE parameters
- Parameters:
 num_batches (
Optional[int]) – Number of batches.num_candidates (
int) – Number of candidates to perform grid search. Default 20.inp_symmetry (
str) – Input symmetry. Available options are ‘asym’, ‘symfp’ and ‘symqt’. Default ‘symqt’.loss_fn (
str) – Loss function. Available options are ‘mse’, ‘l1’ and ‘sqnr’. Default ‘mse’.forward_fn (
Callable) – Optional adapter function that performs forward pass given a model and inputs yielded from the data loader. The function expects model as first argument and inputs to model as second argument.
- forward_fn(inputs)¶
 Default forward function. :type model: :param model: pytorch model :type inputs: :param inputs: model inputs