Amazon Polly: Comprehensive Agent-Usability Assessment
Docs-backedPolly provides high-quality TTS with Neural voices (NTTS) across 60+ languages and 100+ voices. For agents generating voice audio: synchronous SynthesizeSpeech for short texts (returns audio stream directly), async StartSpeechSynthesisTask for longer content (writes MP3/OGG to S3). SSML support for pronunciation control, pauses, emphasis, and speaking rate adjustments. Neural voices sound significantly better than standard voices. Confidence is docs-derived.