Getting started on AWS instances¶
Qualcomm Cloud AI 100 inference accelerator instances are now available on the AWS Cloud.
DL2q¶
DL2q is an accelerated computing instance powered by Cloud AI 100 Standard accelerator cards that can be used to cost-efficiently deploy deep learning (DL) inference workloads.
Details regarding the DL2q instance can be found here.
The latest AMI from Qualcomm is “Deep Learning Base Qualcomm AMI (Amazon Linux 2) 1.12.0.88P 1.14.0.24A SDK” ami-057a7f5a69e18e465
. Email your 12-digit AWS account number along with the intended use case (CV, NLP, GenAI etc) to qualcomm_dl2q_ami@qti.qualcomm.com
. Qualcomm will provide access the latest AMI.
We recommend users to attach 500GB of EBS GP3 storage to the instance to start with (especially for running LLMs).
Note
Users of DL2q instance may need to request AWS to increase the vCPU limit to be able to spin up the first DL2q instance from an AWS account.
SSH to the instance and add user to the qaic
group to be able to run Cloud AI SDK tools without superuser (sudo) privileges.
Users can clone the Cloud AI 100 GitHub repository containing model recipes, code samples and toolchain tutorials on the instance to get started.