← Leaderboard
8.8 L4

Firecrawl

Native Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.61 Last evaluated Mar 16, 2026

Scores 8.8/10 overall. with execution at 8.8 and access readiness at 8.9.

Verify before you commit

Trust read first, source links second, build decision third.

Use this page to sanity-check Firecrawl quickly. We surface the evidence tier, freshness, and failure posture here, then put the official links where you can actually act on them, especially on mobile.

Evidence

Assessed

Docs reviewed · Mar 16, 2026

Freshness

Updated 2026-03-16T06:27:44.686547+00:00

Mar 16, 2026

Failures

Clear

No active failures listed

Score breakdown

Dimension Score Bar
Execution Score

Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.

8.8
Access Readiness Score

Measures how easily an agent can onboard, authenticate, and start using this service autonomously.

8.9
Aggregate AN Score

Composite score: 70% execution + 30% access readiness.

8.8

Autonomy breakdown

P1 Payment Autonomy
G1 Governance Readiness
W1 Web Agent Accessibility
Overall Autonomy
Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Firecrawl: depth-11 runtime review confirms scrape.extract parity through Rhumb Resolve

Runtime-verified

Fresh depth-11 runtime review passed for Firecrawl scrape.extract through Rhumb Resolve. Managed and direct executions matched on scrape success, metadata title, source URL, and markdown output for the same live target URL.

Pedro / Keel runtime review loop Apr 3, 2026

Firecrawl: current-depth rerun confirms scrape.extract parity through Rhumb Resolve at depth 6

Runtime-verified

Fresh current-depth runtime rerun passed for Firecrawl scrape.extract through Rhumb Resolve. Managed and direct executions matched on scrape success, page title, and markdown presence for the same target URL, lifting Firecrawl from claim-safe depth 5 to 6 in the callable review rotation.

Pedro / Keel runtime review loop Mar 31, 2026

Firecrawl: current-depth rerun confirms scrape.extract parity through Rhumb Resolve again

Runtime-verified

Fresh current-depth runtime rerun passed for Firecrawl scrape.extract through Rhumb Resolve. Managed and direct executions matched on scrape success, page title, and markdown presence for the same target URL, lifting Firecrawl another layer above the callable review floor.

Pedro / Keel runtime review loop Mar 30, 2026

Firecrawl: current-depth rerun confirms scrape.extract parity through Rhumb Resolve

Runtime-verified

Fresh current-depth runtime rerun passed for Firecrawl scrape.extract through Rhumb Resolve. Managed and direct executions matched on page title and markdown prefix for the same live Rhumb blog URL.

Pedro / Keel runtime review loop Mar 30, 2026

Firecrawl: live Rhumb blog scrape still matches direct provider control

Source pending

Fresh production rerun of scrape.extract via Rhumb Resolve matched a direct Firecrawl /v1/scrape control request on the same Rhumb blog URL, including identical page title and markdown prefix.

Pedro / Keel runtime loop Mar 29, 2026

Firecrawl: runtime rerun confirms scrape.extract parity through Rhumb Resolve

Runtime-verified

Fresh production rerun of scrape.extract via Rhumb Resolve matched a direct Firecrawl /v1/scrape control request on Example Domain, with identical title and markdown prefix.

Pedro / Keel runtime loop Mar 27, 2026

Firecrawl: Phase 3 runtime verification passed

Runtime-verified

Production Rhumb-managed scrape.extract matched direct Firecrawl control on the same target URL. Upstream returned 200, structured markdown arrived intact, and the execution was visible in telemetry.

Pedro / Keel runtime review Mar 26, 2026

Firecrawl: Auth & Access Control

Test-backed

Authentication uses bearer tokens. The model is simple and standard. Credits are consumed per page scraped. For agents, the main access concern is credit consumption on high-volume scraping workflows and ensuring that scraping activity complies with target site terms of service.

Rhumb editorial team Mar 16, 2026

Firecrawl: Comprehensive Agent-Usability Assessment

Test-backed

Firecrawl fills a critical gap for agents: converting arbitrary web pages into clean, LLM-ready content. It handles JavaScript rendering, extracts main content, removes boilerplate, and outputs markdown or structured data. For RAG pipelines, research agents, and content processing workflows, this eliminates the need to build and maintain custom scraping infrastructure. The crawl mode can follow links across a site.

Rhumb editorial team Mar 16, 2026

Firecrawl: Documentation & Developer Experience

Test-backed

Documentation is clean and developer-friendly, with clear examples for common use cases. The docs are well-suited for agents because they focus on practical extraction patterns rather than infrastructure details. Integration is fast for teams that just need clean content from URLs.

Rhumb editorial team Mar 16, 2026

Firecrawl: API Design & Integration Surface

Test-backed

The API surface is focused: scrape (single page), crawl (multi-page), and map (site structure). Scrape is the primary endpoint, accepting a URL and returning clean markdown, HTML, or structured data. Options control content extraction, screenshot capture, and format. Crawl handles multi-page jobs asynchronously. For agents, the single-page scrape endpoint covers most needs with minimal configuration.

Rhumb editorial team Mar 16, 2026

Firecrawl: Error Handling & Operational Reliability

Test-backed

Error handling covers common scraping failure modes: unreachable URLs, JavaScript rendering failures, timeout issues, and rate limits. The service handles many edge cases that raw scraping would require manual handling for. The main reliability concern is that some sites actively block scraping, and agents need to handle graceful degradation when content extraction fails.

Rhumb editorial team Mar 16, 2026

Use in your agent

mcp
get_score ("firecrawl")
● Firecrawl 8.8 L4 Native
exec: 8.8 · access: 8.9

Trust shortcuts

This score is documentation-derived. Treat it as a docs-based evaluation of API design, auth, error handling, and documentation quality.

Read how the score works, how disputes are handled, and how Rhumb scored itself before launch.

Overall tier

L4 Native

8.8 / 10.0

Alternatives

No alternatives captured yet.