Caveman - Inward App

Caveman

Why use so many token when few do trick?

Website github.com

What it is

Caveman cuts ~75% of Claude's output tokens without losing technical accuracy. One-line install for Claude Code, Cursor, Windsurf, Copilot, and more. Four grunt levels, terse commits, one-line PR reviews, and input compression built in. 24.9K stars.

Intent

I need it when

Generate concise, actionable code review comments and commit messages

Caveman provides /caveman-review (one-line PR comments with specific line numbers and fixes) and /caveman-commit (conventional commits ≤50 char subjects). Developers get focused, scannable feedback instead of verbose explanations, improving code review velocity.

Compress project memory and context files to extend agent context window across sessions

Caveman includes /caveman-compress skill to rewrite memory files (CLAUDE.md, project notes) into compressed caveman-speak, cutting ~46% of input tokens. This allows agents to retain more project context across sessions without hitting token limits, enabling longer-running development workflows.

Reduce API costs and token consumption when using Claude Code or similar AI coding agents

Caveman compresses AI agent output by ~65-75% by enforcing terse, caveman-style communication while preserving technical accuracy. Users install the skill/plugin once and trigger with /caveman command, immediately cutting output tokens and associated API costs without sacrificing code quality or correctness.

Speed up AI-assisted coding workflows and reduce response latency

By reducing output verbosity, Caveman enables ~3x faster response times from AI agents. Shorter responses mean quicker feedback loops, allowing developers to iterate faster on code generation, debugging, and review tasks within their existing IDE or terminal.

Monitor and track token savings across AI coding sessions for cost visibility

Caveman's /caveman-stats command displays real session token usage, lifetime savings, and USD cost reduction. Output includes a tweetable summary and updates a statusline badge in Claude Code, giving developers immediate visibility into cost benefits and ROI of the compression skill.

Drop

Not a fit when

User needs verbose, detailed explanations and prefers comprehensive output over brevity
User requires guaranteed preservation of all technical nuance and cannot accept any compression of reasoning
User works with AI models where output token reduction is not a cost concern or priority
User needs the product to work with AI agents other than Claude Code, Codex, Gemini, Cursor, Windsurf, Cline, or Copilot (limited agent support)
User requires a commercial product with dedicated support and service-level agreements

Commercials

Pricing

Free, open-source MIT-licensed software