← Leaderboard
9.0 L4

Twilio

Native Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.63 Last evaluated Mar 16, 2026

Scores 9.0/10 overall. with execution at 9.0 and access readiness at 9.1.

Verify before you commit

Trust read first, source links second, build decision third.

Use this page to sanity-check Twilio quickly. We surface the evidence tier, freshness, and failure posture here, then put the official links where you can actually act on them, especially on mobile.

Evidence

Assessed

Docs reviewed · Mar 16, 2026

Freshness

Updated 2026-03-16T06:18:54.293494+00:00

Mar 16, 2026

Failures

Clear

No active failures listed

Score breakdown

Dimension Score Bar
Execution Score

Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.

9.0
Access Readiness Score

Measures how easily an agent can onboard, authenticate, and start using this service autonomously.

9.1
Aggregate AN Score

Composite score: 70% execution + 30% access readiness.

9.0

Autonomy breakdown

P1 Payment Autonomy
G1 Governance Readiness
W1 Web Agent Accessibility
Overall Autonomy
Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Twilio: Credential Injection Behavior (Runtime-Verified)

Runtime-verified

Runtime-observed evidence from 5 proxy interactions. 1 fresh observations within the last 24h. Average confidence: 0.96. Evidence type: credential lifecycle. Verified via Rhumb proxy traffic against live Twilio endpoints.

Rhumb automated reviewer Apr 28, 2026

Twilio: depth-10 runtime review confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-10 runtime review passed for Twilio phone.lookup through Rhumb Resolve. BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 3, 2026

Twilio: depth-8 rerun confirms phone.lookup parity through Rhumb Resolve again

Runtime-verified

Fresh depth-8 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-7 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-7 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-7 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-7 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: depth-3 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-3 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Apr 2, 2026

Twilio: Depth-4 parity on phone.lookup (+12025551234, 4 fields)

Runtime-verified

Fresh runtime pass verified Twilio Lookup V2 parity through Rhumb Resolve BYO execution. Managed and direct executions matched on phone_number (+12025551234), valid (true), country_code (US), and national_format ((202) 555-1234). Different target number from depth-3 pass to demonstrate generality.

Pedro / Keel runtime review loop Apr 1, 2026

Twilio: depth-3 rerun confirms phone.lookup parity through Rhumb Resolve

Runtime-verified

Fresh depth-3 runtime rerun passed for Twilio phone.lookup through Rhumb Resolve. Managed BYO execution and direct Twilio Lookup v2 returned the same normalized phone record for the same number using Fields=line_type_intelligence.

Pedro / Keel runtime review loop Mar 31, 2026

Twilio: Phase 3 runtime verification passed

Test-backed

Twilio phone.lookup now reaches the correct Lookup API domain through Rhumb Resolve. Same supported line_type_intelligence input returns 200 directly and via Rhumb; legacy carrier field now fails identically on both paths, confirming the routing bug is fixed and the remaining mismatch was stale endpoint metadata.

Rhumb editorial team Mar 26, 2026

Twilio: Auth & Security Model

Runtime-verified

Twilio authentication model: Account SID + Auth Token (HTTP Basic Auth). Evaluated for agent credential management complexity, key scoping granularity, rotation support, and OAuth flow automation potential.

Rhumb editorial team Mar 13, 2026

Twilio — Agent-Native Service Review

Runtime-verified

Twilio enables agent-driven messaging and notification workflows across channels Reference: https://www.twilio.com/docs/usage/api Docs-derived review; no guide depth available at review time.

Rhumb editorial team Mar 13, 2026

Twilio: API Design & Integration

Docs-backed

Twilio provides a REST API for agent integration. RESTful API with resource-oriented URLs for SMS, voice calls, and programmable messaging. Connection methods and SDK availability assessed for autonomous operation patterns including programmatic setup, credential injection, and multi-step workflows.

Rhumb editorial team Mar 13, 2026

Twilio: Error Handling & Reliability

Docs-backed

Twilio error handling and reliability characteristics for agent consumers. HTTP status codes with error codes and documentation URLs in error responses. Assessed for structured error responses, retry guidance, rate limit transparency, and idempotency support.

Rhumb editorial team Mar 13, 2026

Twilio: Documentation & Developer Experience

Docs-backed

Twilio developer experience evaluation. Official documentation at https://www.twilio.com/docs/usage/api. API reference documentation available. SDKs and client libraries provided for common languages. Assessed for setup friction, code example quality, API reference completeness, and test/sandbox environment availability.

Rhumb editorial team Mar 13, 2026

Use in your agent

mcp
get_score ("twilio")
● Twilio 9.0 L4 Native
exec: 9.0 · access: 9.1

Trust shortcuts

This score is documentation-derived. Treat it as a docs-based evaluation of API design, auth, error handling, and documentation quality.

Read how the score works, how disputes are handled, and how Rhumb scored itself before launch.

Overall tier

L4 Native

9.0 / 10.0

Alternatives

No alternatives captured yet.