Deploy reasoning capabilities on Windows 11 Copilot+ PCs and edge devices with NPU acceleration
Phi-4 Reasoning models are optimized for Phi Silica (NPU-optimized variant) and will run on Copilot+ PC NPUs using ONNX optimization. This enables blazing-fast time-to-first-token responses and power-efficient token throughput, allowing concurrent invocation with other applications while providing offline reasoning capabilities.

