Intel's latest llm-scaler-vllm v0.14.0-b8 delivers a 25% performance boost for AI inferencing on Battlemage GPUs. This update confirms support for the elusive BMG-G31 "Big Battlemage" silicon, achieving up to 1.49x faster throughput. We analyze the new features, validated models like Qwen3-VL, and what this means for the future of Intel Arc in the enterprise AI landscape.

