Validated Models
Text-only Language Models
Text Generation Task
QEff Auto Class: QEFFAutoModelForCausalLM
Architecture |
Model Family |
Representative Models |
CB Support |
---|---|---|---|
FalconForCausalLM |
Falcon |
✔️ |
|
GemmaForCausalLM |
CodeGemma |
✔️ |
|
Gemma |
google/gemma-2b |
✔️ |
|
GPTBigCodeForCausalLM |
Starcoder1.5 |
✔️ |
|
Starcoder2 |
✔️ |
||
GPTJForCausalLM |
GPT-J |
✔️ |
|
GPT2LMHeadModel |
GPT-2 |
✔️ |
|
GraniteForCausalLM |
Granite 3.1 |
ibm-granite/granite-3.1-8b-instruct |
✔️ |
Granite 20B |
ibm-granite/granite-20b-code-base-8k |
✔️ |
|
InternVLChatModel |
Intern-VL |
||
LlamaForCausalLM |
CodeLlama |
codellama/CodeLlama-7b-hf |
✔️ |
DeepSeek-R1-Distill-Llama |
✔️ |
||
InceptionAI-Adapted |
inceptionai/jais-adapted-7b |
✔️ |
|
Llama 3.3 |
✔️ |
||
Llama 3.2 |
✔️ |
||
Llama 3.1 |
✔️ |
||
Llama 3 |
✔️ |
||
Llama 2 |
meta-llama/Llama-2-7b-chat-hf |
✔️ |
|
Vicuna |
lmsys/vicuna-13b-delta-v0 |
✔️ |
|
MistralForCausalLM |
Mistral |
✔️ |
|
MixtralForCausalLM |
Codestral |
✔️ |
|
MPTForCausalLM |
MPT |
✔️ |
|
Phi3ForCausalLM |
Phi-3, Phi-3.5 |
✔️ |
|
QwenForCausalLM |
DeepSeek-R1-Distill-Qwen |
✔️ |
|
Qwen2, Qwen2.5 |
✔️ |
Embedding Models
Text Embedding Task
QEff Auto Class: QEFFAutoModel
Architecture |
Model Family |
Representative Models |
---|---|---|
BertModel |
BERT-based |
BAAI/bge-base-en-v1.5 |
LlamaModel |
Llama-based |
|
Qwen2ForCausalLM |
Qwen2 |
|
XLMRobertaForSequenceClassification |
XLM-RoBERTa |
|
MPNetForMaskedLM |
MPNet |
|
NomicBertModel |
NomicBERT |
|
MistralModel |
Mistral |
Multimodal Language Models
Vision-Language Models (Text + Image Generation)
QEff Auto Class: QEFFAutoModelImageTextToText
Architecture |
Model Family |
Representative Models |
---|---|---|
LlavaForConditionalGeneration |
LLaVA-1.5 |
|
MllamaForConditionalGeneration |
Llama 3.2 |
meta-llama/Llama-3.2-11B-Vision Instruct |
Audio Models
(Automatic Speech Recognition) - Transcription Task
QEff Auto Class: QEFFAutoModelForSpeechSeq2Seq
Architecture |
Model Family |
Representative Models |
---|---|---|
Whisper |
Whisper |
openai/whisper-tiny |
Models Coming Soon
Architecture |
Model Family |
Representative Models |
---|---|---|
BaichuanForCausalLM |
Baichuan2 |
|
CohereForCausalLM |
Command-R |
|
DbrxForCausalLM |
DBRX |