Validated Models

Text-only Language Models

Text Generation Task

QEff Auto Class: QEFFAutoModelForCausalLM

Architecture

Model Family

Representative Models

CB Support

FalconForCausalLM

Falcon

tiiuae/falcon-40b

✔️

GemmaForCausalLM

CodeGemma

google/codegemma-2b
google/codegemma-7b

✔️

Gemma

google/gemma-2b
google/gemma-7b
google/gemma-2-2b
google/gemma-2-9b
google/gemma-2-27b

✔️

GPTBigCodeForCausalLM

Starcoder1.5

bigcode/starcoder

✔️

Starcoder2

bigcode/starcoder2-15b

✔️

GPTJForCausalLM

GPT-J

EleutherAI/gpt-j-6b

✔️

GPT2LMHeadModel

GPT-2

openai-community/gpt2

✔️

GraniteForCausalLM

Granite 3.1

ibm-granite/granite-3.1-8b-instruct
ibm-granite/granite-guardian-3.1-8b

✔️

Granite 20B

ibm-granite/granite-20b-code-base-8k
ibm-granite/granite-20b-code-instruct-8k

✔️

InternVLChatModel

Intern-VL

OpenGVLab/InternVL2_5-1B

LlamaForCausalLM

CodeLlama

codellama/CodeLlama-7b-hf
codellama/CodeLlama-13b-hf
codellama/CodeLlama-34b-hf

✔️

DeepSeek-R1-Distill-Llama

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

✔️

InceptionAI-Adapted

inceptionai/jais-adapted-7b
inceptionai/jais-adapted-13b-chat
inceptionai/jais-adapted-70b

✔️

Llama 3.3

meta-llama/Llama-3.3-70B-Instruct

✔️

Llama 3.2

meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-3B

✔️

Llama 3.1

meta-llama/Llama-3.1-8B
meta-llama/Llama-3.1-70B

✔️

Llama 3

meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-70B

✔️

Llama 2

meta-llama/Llama-2-7b-chat-hf
meta-llama/Llama-2-13b-chat-hf
meta-llama/Llama-2-70b-chat-hf

✔️

Vicuna

lmsys/vicuna-13b-delta-v0
lmsys/vicuna-13b-v1.3
lmsys/vicuna-13b-v1.5

✔️

MistralForCausalLM

Mistral

mistralai/Mistral-7B-Instruct-v0.1

✔️

MixtralForCausalLM

Codestral
Mixtral

mistralai/Codestral-22B-v0.1
mistralai/Mixtral-8x7B-v0.1

✔️

MPTForCausalLM

MPT

mosaicml/mpt-7b

✔️

Phi3ForCausalLM

Phi-3, Phi-3.5

microsoft/Phi-3-mini-4k-instruct

✔️

QwenForCausalLM

DeepSeek-R1-Distill-Qwen

DeepSeek-R1-Distill-Qwen-32B

✔️

Qwen2, Qwen2.5

Qwen/Qwen2-1.5B-Instruct

✔️

Embedding Models

Text Embedding Task

QEff Auto Class: QEFFAutoModel

Architecture

Model Family

Representative Models

BertModel

BERT-based

BAAI/bge-base-en-v1.5
BAAI/bge-large-en-v1.5
BAAI/bge-small-en-v1.5
e5-large-v2

LlamaModel

Llama-based

intfloat/e5-mistral-7b-instruct

Qwen2ForCausalLM

Qwen2

stella_en_1.5B_v5

XLMRobertaForSequenceClassification

XLM-RoBERTa

bge-reranker-v2-m3bge-reranker-v2-m3

MPNetForMaskedLM

MPNet

sentence-transformers/multi-qa-mpnet-base-cos-v1

NomicBertModel

NomicBERT

nomic-embed-text-v1.5

MistralModel

Mistral

e5-mistral-7b-instruct

Multimodal Language Models

Vision-Language Models (Text + Image Generation)

QEff Auto Class: QEFFAutoModelImageTextToText

Architecture

Model Family

Representative Models

LlavaForConditionalGeneration

LLaVA-1.5

llava-hf/llava-1.5-7b-hf

MllamaForConditionalGeneration

Llama 3.2

meta-llama/Llama-3.2-11B-Vision Instruct
meta-llama/Llama-3.2-90B-Vision

Audio Models

(Automatic Speech Recognition) - Transcription Task QEff Auto Class: QEFFAutoModelForSpeechSeq2Seq

Architecture

Model Family

Representative Models

Whisper

Whisper

openai/whisper-tiny
openai/whisper-base
openai/whisper-small
openai/whisper-medium
openai/whisper-large
openai/whisper-large-v3-turbo

Models Coming Soon

Architecture

Model Family

Representative Models

BaichuanForCausalLM

Baichuan2

baichuan-inc/Baichuan2-7B-Base

CohereForCausalLM

Command-R

CohereForAI/c4ai-command-r-v01

DbrxForCausalLM

DBRX

databricks/dbrx-base