Back to products
Modal Notebooks

Modal Notebooks

High-performance, collaborative GPU notebooks in the cloud

Overview

What it is

Run or deploy machine learning models, massively parallel compute jobs, task queues, web apps, and much more, without your own infrastructure.

Intent

I need it when

Deploy and iterate on AI models with fast feedback loops and minimal cold start overhead

Modal Notebooks provide sub-second cold starts and instant autoscaling, allowing data scientists to quickly test model changes, run inference, and fine-tune models without waiting for infrastructure provisioning. The serverless model eliminates capacity planning overhead.

Access diverse GPU hardware on-demand for specialized AI workloads without long-term commitments

Modal Notebooks provide instant access to B200s, H100s, A100s, A10Gs, and other GPUs globally distributed across clouds. Users can select specific hardware per function and scale elastically, paying per-second with no reserved capacity requirements.

Run compute-intensive data analysis and machine learning experiments at scale without managing infrastructure

Modal Notebooks enable users to write Python code that automatically scales across GPUs and CPUs in the cloud. Users can execute resource-heavy workloads like model training, batch inference, and data processing with instant autoscaling from 0 to 1000+ GPUs, paying only for actual compute time used.

Run untrusted or user-generated code safely in isolated environments for AI applications

Modal Sandboxes (integrated with Notebooks) provide secure, ephemeral execution environments using gVisor containerization. This enables safe execution of coding agents, AI-generated code, and user submissions with full isolation and automatic cleanup.

Execute batch processing and parallel jobs across large datasets without writing distributed systems code

Modal Notebooks support dynamic batching, job queues, and parallel processing primitives that allow users to spawn millions of parallel jobs in seconds. Users can process large datasets with simple Python code while Modal handles orchestration and scaling.

Drop

Not a fit when

  • User requires on-premises or private cloud deployment with no public cloud infrastructure
  • User needs traditional notebook experience without serverless compute abstraction or prefers local-only execution
  • User requires guaranteed reserved capacity or fixed monthly compute allocation without autoscaling
  • User works with proprietary data that cannot be processed on shared cloud infrastructure due to compliance restrictions
  • User needs real-time interactive notebook sessions with sub-millisecond latency for every cell execution
Commercials

Pricing

USD0 / monthly View pricing