Mistral Medium 3.5 - Inward App

Mistral Medium 3.5

A 128B model for coding, reasoning, and long tasks

Website mistral.ai

What it is

Mistral Medium 3.5 is a 128B dense model merging coding, reasoning, and instruction-following in one set of weights. 256k context, configurable reasoning effort. Open weights on HuggingFace for engineers and teams running self-hosted inference.

Intent

I need it when

Execute multi-step productivity and research workflows with tool integration

Mistral Medium 3.5 enables Work mode in Le Chat, an agentic interface that orchestrates cross-tool workflows, web research, document synthesis, and inbox triage. The model calls multiple tools in parallel, maintains session state across turns, and produces structured output for downstream consumption.

Integrate vision capabilities into agentic workflows

Mistral Medium 3.5 includes a vision encoder trained from scratch to handle variable image sizes and aspect ratios, enabling multimodal agentic tasks within a single model without separate vision components.

Build and deploy custom AI agents with strong reasoning and instruction-following

As a 128B dense model with configurable reasoning effort per request, Mistral Medium 3.5 merges instruction-following, reasoning, and coding into a single set of weights. It can be self-hosted on as few as four GPUs and is available as open weights, enabling custom agent development with full control.

Process large documents and maintain context across extended conversations

Mistral Medium 3.5 features a 256k context window, enabling it to handle long-horizon tasks and maintain state across multiple tool calls and conversation turns without losing context or task coherence.

Automate complex coding tasks and software engineering work without manual intervention

Mistral Medium 3.5 powers remote coding agents in Vibe that run asynchronously in the cloud, allowing developers to spawn long-running tasks from CLI or Le Chat and receive notifications when complete. The model scores 77.6% on SWE-Bench Verified and handles module refactors, test generation, dependency upgrades, and bug fixes reliably.

Drop

Not a fit when

User needs a lightweight model for resource-constrained environments; Mistral Medium 3.5 is a 128B dense model requiring at least four GPUs for self-hosting
User requires real-time, ultra-low-latency completions; the model is optimized for long-horizon agentic tasks, not high-frequency code completion
User needs a model with closed-source, proprietary licensing; Mistral Medium 3.5 is released as open weights under modified MIT license
User requires specialized domain expertise in non-coding areas; the model is primarily built for instruction-following, reasoning, and coding tasks
User operates in an environment where configurable reasoning effort per request is not needed; this feature adds complexity for simple use cases

Commercials

Pricing

API token-based pricing: $1.5 per million input tokens, $7.5 per million output tokens. Also available via Vibe subscription plans (Free, Pro $14.99/mo, Team $24.99/user/mo, Enterprise custom). View pricing