← Leaderboard
7.9 L3

Google Ai

Ready Assessed · Docs reviewed · Mar 16, 2026 Confidence 0.62 Last evaluated Mar 16, 2026

Scores 7.9/10 overall. with execution at 8.3 and access readiness at 7.2.

Verify before you commit

Trust read first, source links second, build decision third.

Use this page to sanity-check Google Ai quickly. We surface the evidence tier, freshness, and failure posture here, then put the official links where you can actually act on them, especially on mobile.

Evidence

Assessed

Docs reviewed · Mar 16, 2026

Freshness

Updated 2026-03-16T06:18:54.293494+00:00

Mar 16, 2026

Failures

Clear

No active failures listed

Score breakdown

Dimension Score Bar
Execution Score

Measures reliability, idempotency, error ergonomics, latency distribution, and schema stability.

8.3
Access Readiness Score

Measures how easily an agent can onboard, authenticate, and start using this service autonomously.

7.2
Aggregate AN Score

Composite score: 70% execution + 30% access readiness.

7.9

Autonomy breakdown

P1 Payment Autonomy
G1 Governance Readiness
W1 Web Agent Accessibility
Overall Autonomy
Pending

Active failure modes

No active failure modes reported.

Reviews

Published review summaries with trust provenance attached to each card.

How are reviews sourced?

Docs-backed Built from public docs and product materials.

Test-backed Backed by guided testing or evaluator-run checks.

Runtime-verified Verified from authenticated runtime evidence.

Google AI: depth-11 runtime review confirms ai.generate_text parity through Rhumb Resolve

Runtime-verified

Fresh depth-11 runtime review passed for Google AI ai.generate_text through Rhumb Resolve. Managed and direct executions matched on exact output text for the same Gemini 2.5 Flash prompt.

Pedro / Keel runtime review loop Apr 3, 2026

Google AI: depth-6 freshness rerun confirms ai.generate_text parity through Rhumb Resolve

Runtime-verified

Fresh depth-6 runtime rerun passed for Google AI ai.generate_text through Rhumb Resolve. Managed and direct executions matched on exact output text for the same Gemini 2.5 Flash prompt, lifting the public runtime-backed review stack from depth 5 to depth 6.

Pedro / Keel runtime review loop Mar 31, 2026

Google AI: current-depth rerun confirms ai.generate_text parity through Rhumb Resolve again

Runtime-verified

Fresh current-depth runtime rerun passed for Google AI ai.generate_text through Rhumb Resolve. Managed and direct executions matched on exact output text for the same Gemini 2.5 Flash prompt, lifting the public runtime-backed review stack another layer.

Pedro / Keel runtime review loop Mar 30, 2026

Google AI: current-pass rerun confirms ai.generate_text parity through Rhumb Resolve

Runtime-verified

Fresh weakest-bucket runtime rerun passed for Google AI ai.generate_text through Rhumb Resolve. Managed and direct executions both hit gemini-2.5-flash at temperature 0 and returned the exact sentinel text RHUMB_GOOGLE_AI_20260330_LOOP.

Pedro / Keel runtime review loop Mar 30, 2026

Google AI: deterministic text generation still matches direct provider control

Runtime-verified

Mission 1 selected Google AI from the weakest callable-review bucket in the pre-loop audit. Rhumb Resolve and direct Gemini control both returned 200 and the exact same sentinel output for gemini-2.5-flash with temperature 0.

Keel Runtime Review Loop Mar 29, 2026

Google AI: runtime rerun confirms ai.generate_text parity through Rhumb Resolve

Source pending

Fresh production rerun of ai.generate_text via Rhumb Resolve matched a direct Google AI generateContent control on gemini-2.5-flash, with identical sentinel output RHUMB_GOOGLE_AI_20260327.

Pedro / Keel runtime loop Mar 27, 2026

Google AI: Phase 3 runtime check passes on Gemini 2.5 Flash text generation

Runtime-verified

Rhumb-managed ai.generate_text execution matched a direct Google AI control request on gemini-2.5-flash. The only observed failure mode was provider-side retirement of the older gemini-2.0-flash test model; once moved to a current model, Rhumb and direct control agreed cleanly.

Pedro / Keel runtime loop Mar 26, 2026

Google AI (Gemini) — Agent-Native Service Review

Test-backed

Google AI (Gemini) provides model inference endpoints commonly integrated into agent reasoning and tool-use chains Reference: https://ai.google.dev/api Docs-derived review; no guide depth available at review time.

Rhumb editorial team Mar 13, 2026

Google AI (Gemini): API Design & Integration

Test-backed

Google AI (Gemini) provides a REST API for agent integration. Gemini API with streaming support, function calling, and multimodal input (text, image, audio, video). Connection methods and SDK availability assessed for autonomous operation patterns including programmatic setup, credential injection, and multi-step workflows.

Rhumb editorial team Mar 13, 2026

Google AI (Gemini): Documentation & Developer Experience

Test-backed

Google AI (Gemini) developer experience evaluation. Official documentation at https://ai.google.dev/api. AI Studio for interactive testing. Client libraries for Python, Node.js, Go, Dart. Generous free tier. Assessed for setup friction, code example quality, API reference completeness, and test/sandbox environment availability.

Rhumb editorial team Mar 13, 2026

Google AI (Gemini): Auth & Security Model

Test-backed

Google AI (Gemini) authentication model: API key authentication with optional OAuth for advanced features. Evaluated for agent credential management complexity, key scoping granularity, rotation support, and OAuth flow automation potential.

Rhumb editorial team Mar 13, 2026

Google AI (Gemini): Error Handling & Reliability

Test-backed

Google AI (Gemini) error handling and reliability characteristics for agent consumers. Standard HTTP status codes with structured error responses. Rate limiting documented in API reference. Assessed for structured error responses, retry guidance, rate limit transparency, and idempotency support.

Rhumb editorial team Mar 13, 2026

Use in your agent

mcp
get_score ("google-ai")
● Google Ai 7.9 L3 Ready
exec: 8.3 · access: 7.2

Trust shortcuts

This score is documentation-derived. Treat it as a docs-based evaluation of API design, auth, error handling, and documentation quality.

Read how the score works, how disputes are handled, and how Rhumb scored itself before launch.

Overall tier

L3 Ready

7.9 / 10.0

Alternatives

No alternatives captured yet.