Give Your Agent A Voice: x402 pay-per-call speech with 20 voices, 10 personas, 31 languages, granular speed and quality controls, OpenAI-shaped requests, and batch audio
AWS Polly provider for server-side TTS with speech marks support
SchoolCity-backed provider for server-side TTS with speech marks support
Google Cloud Text-to-Speech provider for server-side TTS with speech marks support
Core interfaces and types for server-side TTS providers
TTS interfaces and types for PIE Assessment Toolkit - No UI dependencies
Client-side TTS provider that calls server API for synthesis
Text-to-Speech tool for PIE assessment player with word-level highlighting
Official TypeScript/JavaScript client for the My Magic Pencil developer API — generate animated, narrated whiteboard lessons, books, and question papers; synthesize speech; build playlists; embed live plays; and export PDFs.
An easy way to run AI models in React Native with ExecuTorch
Universal, cross-platform text-to-speech SDK with multi-provider support.
Official ContentHero CLI. Generate media, run the content pipeline, and read your brand/research context from the terminal or any agent shell. Rides the @contenthero/sdk kernel.
A high-performance React Native library for text-to-speech on iOS and Android
Build web apps with lifelike AI characters. The Convai Web SDK gives you real-time voice, lipsync, emotions, and dynamic context — with first-class support for React and vanilla TypeScript.
Mastra Cloudflare AI voice integration
Official TypeScript SDK for Camb.ai - Advanced voice and audio generation APIs
OpenAI adapter for TanStack AI chat, tools, images, video, speech, transcription, realtime, and structured outputs.
Google Gemini adapter for TanStack AI chat, images, speech, audio generation, and structured outputs.
fal.ai adapter for TanStack AI image, video, audio, speech, and transcription generation.
ElevenLabs adapter for TanStack AI realtime voice, text-to-speech, transcription, music, and sound effects.
Local MCP server for the Voice API (Chatterbox TTS + Whisper STT). Runs on your machine, reads local audio files, and streams them to the HTTP API — so large files never pass through the model's context.
ElevenLabs provider for sound effect generation, text-to-speech, and audio APIs.
AI Agents discover, invoke, and settle paid LLMs, APIs, and Skills — starting with Skill Boss. No manual key setup. No prepayment. Pay-per-call via x402 or Agent Card.