Back to products
Forge CLI

Forge CLI

Swarm agents optimize CUDA/Triton for any HF/PyTorch model Hardware • Developer Tools • Artificial Intelligence 2 109 AssemblyAI Voice Agent API One API to build production-ready voice agents API • Artificial Intelligence • Audio

Overview

What it is

RightNow AI is the #1 GPU AI code editor. It combines GPU profiling, benchmarking, AI optimization, GPU virtualization, and a full GPU emulator in one environment to help developers analyze and optimize CUDA code faster

Intent

I need it when

Reduce GPU infrastructure costs and improve model inference speed

Forge generates drop-in replacement kernels benchmarked against torch.compile(max_autotune), helping enterprises save thousands on GPU costs while improving AI model speed and efficiency across all GPU architectures.

Optimize inference workloads on datacenter GPUs with dedicated support

Forge provides enterprise-grade AI kernel optimization with dedicated infrastructure, on-premise deployment, custom SLAs, NDA and IP protection, and a dedicated support team for production inference optimization.

Benchmark and validate kernel optimizations across multiple GPU architectures

Forge CLI integrates with RightNow Editor's profiling and multi-GPU comparison capabilities to test optimized kernels against different datacenter GPU architectures and measure performance improvements.

Convert PyTorch code into optimized CUDA or Triton kernels automatically

Forge CLI is an AI-powered command-line tool that transforms any PyTorch code into optimized GPU kernels, eliminating manual kernel writing and reducing development time for GPU-accelerated inference.

Drop

Not a fit when

  • User has no NVIDIA CUDA-capable GPU or CUDA Toolkit 11.0+ installed
  • User needs to optimize for non-datacenter GPUs (Forge supports only B200, H200, H100, A100, L40S)
  • User requires transparent, publicly listed pricing rather than custom enterprise quotes
  • User needs real-time support for consumer or small-scale inference workloads without dedicated infrastructure
  • User wants a standalone CLI tool without integration into the RightNow Editor ecosystem
Commercials

Pricing

Forge is an enterprise-only AI kernel optimization service with custom pricing. It generates optimized GPU kernels for AI models and includes dedicated infrastructure, on-premise deployment options, and custom SLAs. View pricing