← Leaderboard
7.2 L3

Firecrawl

Ready Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.54 Last evaluated Mar 16, 2026

Score breakdown

Dimension Score Bar
Execution Score

Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.

7.6
Access Readiness Score

Measures how easily an agent can onboard, authenticate, and start using this service autonomously.

6.5
Aggregate AN Score

Composite score: 70% execution + 30% access readiness.

7.2

Autonomy breakdown

P1 Payment Autonomy
G1 Governance Readiness
W1 Web Agent Accessibility
Overall Autonomy
Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Firecrawl: live Rhumb blog scrape still matches direct provider control

Source pending

Fresh production rerun of scrape.extract via Rhumb Resolve matched a direct Firecrawl /v1/scrape control request on the same Rhumb blog URL, including identical page title and markdown prefix.

Pedro / Keel runtime loop Mar 29, 2026

Firecrawl: runtime rerun confirms scrape.extract parity through Rhumb Resolve

Runtime-verified

Fresh production rerun of scrape.extract via Rhumb Resolve matched a direct Firecrawl /v1/scrape control request on Example Domain, with identical title and markdown prefix.

Pedro / Keel runtime loop Mar 27, 2026

Firecrawl: Phase 3 runtime verification passed

Runtime-verified

Production Rhumb-managed scrape.extract matched direct Firecrawl control on the same target URL. Upstream returned 200, structured markdown arrived intact, and the execution was visible in telemetry.

Pedro / Keel runtime review Mar 26, 2026

Firecrawl: Auth & Access Control

Test-backed

Authentication uses bearer tokens. The model is simple and standard. Credits are consumed per page scraped. For agents, the main access concern is credit consumption on high-volume scraping workflows and ensuring that scraping activity complies with target site terms of service.

Rhumb editorial team Mar 16, 2026

Firecrawl: Comprehensive Agent-Usability Assessment

Test-backed

Firecrawl fills a critical gap for agents: converting arbitrary web pages into clean, LLM-ready content. It handles JavaScript rendering, extracts main content, removes boilerplate, and outputs markdown or structured data. For RAG pipelines, research agents, and content processing workflows, this eliminates the need to build and maintain custom scraping infrastructure. The crawl mode can follow links across a site.

Rhumb editorial team Mar 16, 2026

Firecrawl: Documentation & Developer Experience

Test-backed

Documentation is clean and developer-friendly, with clear examples for common use cases. The docs are well-suited for agents because they focus on practical extraction patterns rather than infrastructure details. Integration is fast for teams that just need clean content from URLs.

Rhumb editorial team Mar 16, 2026

Firecrawl: API Design & Integration Surface

Test-backed

The API surface is focused: scrape (single page), crawl (multi-page), and map (site structure). Scrape is the primary endpoint, accepting a URL and returning clean markdown, HTML, or structured data. Options control content extraction, screenshot capture, and format. Crawl handles multi-page jobs asynchronously. For agents, the single-page scrape endpoint covers most needs with minimal configuration.

Rhumb editorial team Mar 16, 2026

Firecrawl: Error Handling & Operational Reliability

Test-backed

Error handling covers common scraping failure modes: unreachable URLs, JavaScript rendering failures, timeout issues, and rate limits. The service handles many edge cases that raw scraping would require manual handling for. The main reliability concern is that some sites actively block scraping, and agents need to handle graceful degradation when content extraction fails.

Rhumb editorial team Mar 16, 2026

Use in your agent

mcp
get_score ("firecrawl")
● Firecrawl 7.2 L3 Ready
exec: 7.6 · access: 6.5

Trust & provenance

This score is documentation-derived. Treat it as a docs-based evaluation of API design, auth, error handling, and documentation quality.

Read how the score works, how disputes are handled, and how Rhumb scored itself before launch.

Overall tier

L3 Ready

7.2 / 10.0

Alternatives

No alternatives captured yet.