Back to products
Crawler.sh

Crawler.sh

Free Local AEO & SEO Spider and a Markdown content extractor

Website crawler.sh
Overview

What it is

A fast, local-first web crawler and AEO & SEO analysis tool. Crawl entire sites in seconds from the terminal or a native desktop app, run automated SEO checks, extract content as clean Markdown, and export to JSON, CSV, or Sitemap XML.

Intent

I need it when

Extract clean, structured content from websites for AI training and RAG pipelines

Crawler.sh converts any website into RAG-ready Markdown with automatic content extraction, word counts, author bylines, and excerpts. Pro plan enables bulk Content Archive export as ZIP with individual Markdown files per page, eliminating manual content preparation for LLM fine-tuning and agent context.

Crawl large websites efficiently without cloud costs or per-page fees

Crawler.sh runs locally on your machine with no cloud bill or API quota. Pro plan ($99/year) allows up to 10,000 pages per crawl with concurrent requests, automatic retry, and robots.txt compliance. Built in Rust for speed; typical crawls complete in seconds.

Audit website SEO health and identify technical issues across all pages

Crawler.sh runs 24 automated SEO checks (missing titles, duplicate meta descriptions, noindex directives, thin content, broken links, content freshness signals, etc.) across every crawled page in a single pass. Results export as CSV or TXT for team review and batch remediation.

Generate accurate, up-to-date XML sitemaps from live website crawls

Crawler.sh automatically generates W3C-compliant Sitemap XML from live crawls, keeping sitemaps accurate without manual maintenance. Available in both free and Pro tiers; export happens locally with no external dependencies.

Monitor website health and catch broken links, missing pages, and status changes before users do

Crawler.sh can be run regularly to detect broken links, missing pages, and status code changes across your site. Results export as JSON or CSV for integration into monitoring dashboards or CI/CD pipelines; runs entirely locally for privacy and speed.

Drop

Not a fit when

  • User needs real-time cloud-based crawling with webhooks or scheduled jobs (Crawler Cloud is coming soon but not yet available)
  • User requires Windows native support (currently CLI runs on WSL2 only; native Windows build in progress)
  • User needs to crawl sites protected by advanced bot detection (Cloudflare, DataDome, PerimeterX) without bypassing capability
  • User requires per-page API pricing model or pay-as-you-go billing (product uses fixed annual subscription)
  • User needs headless Chrome integration or compatibility with existing Chrome-based crawling workflows
Commercials

Pricing

Freemium with annual Pro subscription. Free tier: up to 50 pages per crawl, SEO analysis, JSON/Sitemap export. Pro tier ($99/year): up to 10,000 pages per crawl, Markdown content extraction, Content Archive export. View pricing