PyTorch Eager Mode Workflow¶

The PyTorch stack supports eager mode execution on Cloud AI accelerators via the torch_qaic package.

The torch_qaic Python package lists operators supported by Cloud AI accelerators.
torch_qaic registers the accelerator with PyTorch.
The JIT runtime enables running operators on Cloud AI hardware.
The operator library is pre-coded and supports multiple NSPs.

The PyTorch eager mode stack facilitates fine-tuning on Cloud AI accelerators. The Qualcomm Efficient Transformers Library provides the necessary infrastructure for fine-tuning models. For more details, refer to Finetune Infra.

Limitations¶

x86 host support only
Python 3.10 only
Torch 2.4.1 only
Precision support: FP32, FP16, Int32, Int64 (assuming data is within Int32 range), Boolean