Mistral small 3.1 - Inward App

Mistral small 3.1

The best model in its weight class

Website mistral.ai

What it is

- We’re committed to empower the AI community with open technology. Our open models sets the bar for efficiency, and are available for free, with fully permissive license. - Our optimized commercial models are designed for performance and are available via our flexible deployment options.

Intent

I need it when

Build cost-effective AI applications with lightweight inference

Mistral Small 3.1 is a lightweight, multimodal model optimized for cost-sensitive projects. At $0.1/M input tokens and $0.3/M output tokens, it delivers state-of-the-art performance for tasks like text generation, classification, and summarization while minimizing API costs compared to larger models.

Integrate AI into production applications with minimal setup

Mistral Small 3.1 is available via Mistral's API and can be deployed through Studio, Vibe, or self-hosted infrastructure. Developers can integrate it into workflows with full observability, evals, and guardrails through Mistral Studio's unified registry and deployment portability.

Deploy multilingual and multimodal AI without vendor lock-in

Mistral Small 3.1 is released as open-weight under Apache 2.0 license for research/individual use, enabling self-hosting on any infrastructure. It supports multiple languages and modalities, allowing organizations to maintain control over their AI deployments and data.

Accelerate development of agentic AI systems and autonomous workflows

Mistral Small 3.1 powers Vibe agents and can be orchestrated through Mistral Studio for multi-step task automation, persistent memory, and tool integration. It enables developers to build autonomous agents for document analysis, research, and complex reasoning workflows.

Drop

Not a fit when

User requires on-device inference without API calls or cloud connectivity
Organization needs guaranteed SLA and support without enterprise contract negotiation
Project requires models optimized for extremely low-latency real-time applications under 50ms
User needs a fully open-source model with no commercial licensing restrictions for derivative works
Application requires specialized domain models pre-trained on proprietary data (e.g., medical, legal) without custom training

Commercials

Pricing

Pay-per-token API pricing; Vibe subscription plans (Free, Pro $14.99/mo, Team $24.99/user/mo, Enterprise custom) View pricing