A powerful multimodal instruction‑tuned model from the Llama 4 family, built on a 17 B-parameter mixture‑of‑experts architecture (16 active experts, 109 B total). It natively processes text + image inputs and outputs text.

Llama 4 Scout 17B 16E Instruct
Provider: Azure_ai