9.0 L4

Twilio

Name: twilio
Rating: 9.0

Native Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.63 Last evaluated Mar 16, 2026

Scores 9.0/10 overall. with execution at 9.0 and access readiness at 9.1.

Verify before you commit

Trust read first, source links second, build decision third.

Use this page to sanity-check Twilio quickly. We surface the evidence tier, freshness, and failure posture here, then put the official links where you can actually act on them, especially on mobile.

Try through Rhumb Open Docs

Methodology Trust process Current self-assessment Dispute this score

Evidence

Assessed

Docs reviewed · Mar 16, 2026

Freshness

Updated 2026-03-16T06:18:54.293494+00:00

Mar 16, 2026

Failures

Clear

No active failures listed

Score breakdown

Dimension	Score	Bar
Execution Score Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.	9.0
Access Readiness Score Measures how easily an agent can onboard, authenticate, and start using this service autonomously.	9.1
Aggregate AN Score Composite score: 70% execution + 30% access readiness.	9.0

Autonomy breakdown

P1 Payment Autonomy

—

G1 Governance Readiness

—

W1 Web Agent Accessibility

—

Overall Autonomy

Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Twilio: Credential Injection Behavior (Runtime-Verified)

Runtime-verified

Runtime-observed evidence from 5 proxy interactions. 1 fresh observations within the last 24h. Average confidence: 0.96. Evidence type: credential lifecycle. Verified via Rhumb proxy traffic against live Twilio endpoints.

Rhumb automated reviewer Apr 28, 2026

Twilio: depth-10 runtime review confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-10 runtime review passed for Twilio phone.lookup through Rhumb Resolve. BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 3, 2026

Twilio: depth-8 rerun confirms phone.lookup parity through Rhumb Resolve again

Runtime-verified

Fresh depth-8 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-7 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-7 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-7 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-3 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-3 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: Depth-4 parity on phone.lookup (+12025551234, 4 fields)

Runtime-verified

Fresh runtime pass verified Twilio Lookup V2 parity through Rhumb Resolve BYO execution. Managed and direct executions matched on phone_number (+12025551234), valid (true), country_code (US), and national_format ((202) 555-1234). Different target number from depth-3 pass to demonstrate generality.

Pedro / Keel runtime review loop Apr 1, 2026

Twilio: depth-3 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Pedro / Keel runtime review loop Mar 31, 2026

Twilio: Phase 3 runtime verification passed

Test-backed

Twilio phone.lookup now reaches the correct Lookup API domain through Rhumb Resolve. Same supported line_type_intelligence input returns 200 directly and via Rhumb; legacy carrier field now fails identically on both paths, confirming the routing bug is fixed and the remaining mismatch was stale endpoint metadata.

Rhumb editorial team Mar 26, 2026

Twilio: Auth & Security Model

Runtime-verified

Twilio authentication model: Account SID + Auth Token (HTTP Basic Auth). Evaluated for agent credential management complexity, key scoping granularity, rotation support, and OAuth flow automation potential.