Pre-requisites
System Requirements:
Supported Linux OS - Ubuntu, RHEL and AWS Linux
Installation
1. Download Apps SDK
Uninstall existing Apps SDK
sudo ./uninstall.sh
Run the install.sh script as root or with sudo to install with root permissions.
sudo ./install.sh --enable-qeff
source /opt/qti-aic/dev/python/qeff/bin/activate
On successful installation, the contents are stored to the /opt/qti-aic path under the dev and exec directories:
dev exec integrations scripts
Check the Apps SDK version with the following command
sudo /opt/qti-aic/tools/qaic-version-util --apps
Apply chmod commands
sudo chmod a+x /opt/qti-aic/dev/hexagon_tools/bin/*
sudo chmod a+x /opt/qti-aic/exec/*
2. Install Efficient-Transformers
Using GitHub Repository
Warning
Efficient Transformers have been validated to work with the same compatible SDK. Upgrading this may result in certain models becoming incompatible.
# Create Python virtual env and activate it. (Required Python 3.10)
python3.10 -m venv qeff_env
source qeff_env/bin/activate
pip install -U pip
# Clone and Install the QEfficient Repo.
pip install git+https://github.com/quic/efficient-transformers
Sanity Check
After above installation methods, you can check if QEfficient
is installed correctly by using
python -c "import QEfficient; print(QEfficient.__version__)"
If the above line executes successfully, you are good to go ahead and start deploying models on Cloud AI 100
cards using QEfficient
library.