Back to products
Transformers v5

Transformers v5

The backbone of modern AI, re-engineered

Overview

What it is

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Intent

I need it when

Standardize model definitions across research and production teams

Transformers acts as the model-definition framework agreed upon across the ecosystem. It centralizes model definitions so they work with multiple training frameworks (Axolotl, Unsloth, DeepSpeed), inference engines (vLLM, SGLang, TGI), and adjacent libraries.

Access and use cutting-edge transformer architectures for text, vision, audio, and video

Transformers supports state-of-the-art models across modalities including text generation, image segmentation, automatic speech recognition, document QA, and multimodal tasks. Users can explore the Timeline to discover latest architectures.

Build and deploy state-of-the-art NLP and multimodal models quickly

Transformers provides pre-trained model checkpoints (1M+ on Hub), unified model definitions across frameworks, and Pipeline API for inference with minimal code. Users can load models and run inference in seconds without building from scratch.

Fine-tune large language models efficiently with limited compute resources

Transformers integrates with PEFT for parameter-efficient fine-tuning, supports mixed precision training, torch.compile optimization, and distributed training via Accelerate. Users can adapt large models on consumer hardware.

Reduce training time and carbon footprint by leveraging pre-trained models

Transformers provides pre-trained models that reduce compute cost, time, and environmental impact compared to training from scratch. Each model is reproduced to match original performance and offers state-of-the-art results out-of-the-box.

Drop

Not a fit when

  • User needs proprietary closed-source model frameworks without open-source dependencies
  • User requires commercial support guarantees for production systems without enterprise contract
  • User needs models for non-PyTorch frameworks exclusively (Transformers is PyTorch-first)
  • User operates in restricted environments where open-source code cannot be deployed
  • User needs pre-built inference without any code implementation or customization
Commercials

Pricing

Free open-source library with optional paid compute and enterprise services View pricing