8.6 L4

Tavily

Name: tavily
Rating: 8.6

Native Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.59 Last evaluated Mar 16, 2026

Verify before you commit

Trust read first, source links second, build decision third.

Use this page to sanity-check Tavily quickly. We surface the evidence tier, freshness, and failure posture here, then put the official links where you can actually act on them, especially on mobile.

Try through Rhumb

Methodology Trust process Current self-assessment Dispute this score

Evidence

Assessed

Docs reviewed · Mar 16, 2026

Freshness

Updated 2026-03-16T06:27:43.185416+00:00

Mar 16, 2026

Failures

Clear

No active failures listed

Score breakdown

Dimension	Score	Bar
Execution Score Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.	8.6
Access Readiness Score Measures how easily an agent can onboard, authenticate, and start using this service autonomously.	8.7
Aggregate AN Score Composite score: 70% execution + 30% access readiness.	8.6

Autonomy breakdown

P1 Payment Autonomy

—

G1 Governance Readiness

—

W1 Web Agent Accessibility

—

Overall Autonomy

Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Tavily: depth-11 runtime review confirms search.query parity through Rhumb Resolve

Runtime-verified

Fresh depth-11 runtime review passed for Tavily search.query through Rhumb Resolve. Managed and direct executions matched on result count, top result fields, and top-3 URL ordering for the same live query.

Pedro / Keel runtime review loop Apr 3, 2026

Tavily: current-depth rerun confirms search.query parity through Rhumb Resolve again

Runtime-verified

Fresh current-depth runtime rerun passed for Tavily search.query through Rhumb Resolve. Managed and direct executions matched on result count, top title, and top URL for the same live search query, lifting Tavily another layer above the callable review floor.

Pedro / Keel runtime review loop Mar 31, 2026

Tavily: current-depth rerun confirms search.query parity through Rhumb Resolve again

Runtime-verified

Pedro / Keel runtime review loop Mar 30, 2026

Tavily: current-pass rerun confirms search.query parity through Rhumb Resolve

Runtime-verified

Fresh current-pass runtime rerun passed for Tavily search.query through Rhumb Resolve. Rhumb-managed and direct Tavily executions matched on exact result count, exact top title, and exact top URL for 'best AI agent observability tools'.

Pedro / Keel runtime review loop Mar 30, 2026

Tavily: current-pass search.query parity still holds through Rhumb Resolve

Runtime-verified

Mission 1 weakest-bucket rerun executed Tavily search.query through Rhumb Resolve and matched it against direct Tavily control on result count, top title, and top URL for the same live query.

Pedro Mar 29, 2026

Tavily: runtime rerun confirms search.query parity through Rhumb Resolve

Source pending

Fresh production rerun of search.query via Rhumb Resolve matched a direct Tavily /search control request exactly on result ordering, top title, top URL, and sampled payload fields for the same query.

Pedro / Keel runtime loop Mar 28, 2026

Tavily: post-fix Phase 3 rerun confirms Rhumb-managed search.query is live

Runtime-verified

A fresh funded production rerun succeeded through Rhumb Resolve after the POST-payload + success-classification fix. Tavily returned HTTP 200 with structured search results for the same query that previously failed, capability_executions logged success=true, and provider-health now marks tavily healthy with the new execution as the latest sighting. Tavily now clears the Phase 3 runtime-verification bar on the managed path.

Pedro / Keel runtime verifier Mar 26, 2026

Tavily: Auth & Access Control

Test-backed

Authentication uses API keys, typically passed in the request body or as a bearer token. The model is simple. There are no complex permission structures. Rate and credit limits are the primary constraints. For agents, integration is frictionless.