Skip to content

CPU - Intel® Xeon®

Validated Hardware

Hardware
Intel® Xeon® 6 Processors
Intel® Xeon® 5 Processors

Text-only Language Models

Model Architecture Supported
unsloth/gpt-oss-20b GptOssForCausalLM
meta-llama/Llama-3.1-8B-Instruct LlamaForCausalLM
meta-llama/Llama-3.2-1B LlamaForCausalLM
meta-llama/Llama-3.2-3B-Instruct LlamaForCausalLM
meta-llama/Llama-3.3-70B-Instruct LlamaForCausalLM
RedHatAI/Meta-Llama-3.1-8B-quantized.w8a8 LlamaForCausalLM
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 LlamaForCausalLM
RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8 LlamaForCausalLM
RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8 LlamaForCausalLM
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 LlamaForCausalLM
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 LlamaForCausalLM
AMead10/Llama-3.2-1B-Instruct-AWQ LlamaForCausalLM
AMead10/Llama-3.2-3B-Instruct-AWQ LlamaForCausalLM
TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ LlamaForCausalLM
TheBloke/TinyLlama-1.1B-Chat-v1.0-GPTQ LlamaForCausalLM
ibm-granite/granite-3.2-2b-instruct GraniteForCausalLM
Qwen/Qwen3-1.7B Qwen3ForCausalLM
Qwen/Qwen3-4B Qwen3ForCausalLM
Qwen/Qwen3-8B Qwen3ForCausalLM
Qwen/Qwen3-14B Qwen3ForCausalLM
Qwen/Qwen3-14B-AWQ Qwen3ForCausalLM
Qwen/Qwen3-30B-A3B Qwen3MoeForCausalLM
Qwen/QwQ-32B-AWQ Qwen2ForCausalLM
Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int4 Qwen2ForCausalLM
RedHatAI/QwQ-32B-quantized.w8a8 Qwen2ForCausalLM
zai-org/glm-4-9b-hf GLMForCausalLM
google/gemma-7b GemmaForCausalLM
microsoft/Phi-4-reasoning Phi3ForCausalLM
TheBloke/Mistral-7B-Instruct-v0.2-AWQ MistralForCausalLM

Multimodal Language Models

Model Architecture Supported
meta-llama/Llama-4-Scout-17B-16E-Instruct Llama4ForConditionalGeneration
google/gemma-3-4b-it Gemma3ForConditionalGeneration
google/gemma-3-12b-it Gemma3ForConditionalGeneration
google/gemma-4-E4B-it Gemma4ForConditionalGeneration
google/gemma-4-E2B-it Gemma4ForConditionalGeneration
google/gemma-4-26B-A4B-it Gemma4ForConditionalGeneration
microsoft/Phi-4-multimodal-instruct Phi4MMForCausalLM
Qwen/Qwen2.5-VL-7B-Instruct Qwen2VLForConditionalGeneration
openai/whisper-large-v3 WhisperForConditionalGeneration

✅ Runs and optimized.