$ man voice
/voice(1)
PRICE / CALL
$0.05
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
synthforgeCATEGORY
ai
STATUS
● live
NAME
voice — text-to-speech / tts / voice synthesis
SYNOPSIS
POST https://x402.org/v1/voice
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Text-to-speech / TTS / voice synthesis. Venice TTS (Kokoro/xAI/ElevenLabs/Orpheus/MiniMax). 30+ voices, MP3/WAV/OPUS/AAC/FLAC.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| text | string | Max 4000 chars. | required |
| voice | string | Default 'af_sky'. | optional |
| model | string | Default 'tts-kokoro'. Other options: tts-xai-v1, tts-elevenlabs-turbo-v2-5, tts-orpheus, etc. | optional |
| speed | number | 0.25-4. Default 1. | optional |
| format | string | mp3 (default), wav, opus, aac, flac. | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| audio_url | string | Signed URL to the generated audio file hosted on R2 or similar object storage. |
| file_size_bytes | number | Size of the generated audio file in bytes. |
| content_type | string | MIME type of the audio response, like audio/mpeg or audio/wav. |
| format | string | Audio container/codec returned: mp3, wav, opus, aac, or flac. |
| voice | string | Voice ID used for synthesis from the 30+ Venice TTS voices. |
| model | string | TTS model that produced the audio: Kokoro, xAI, ElevenLabs, Orpheus, or MiniMax. |
| speed | number | Playback speed multiplier applied during synthesis, where 1.0 is normal pace. |
| input_chars | number | Number of characters in the input text that were synthesized. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/voice \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# install once claude mcp add x402 --command "npx x402-deployer-mcp" # then ask Claude Code: # "use the voice tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- ttsspeechaudiovoiceai
- env
- VENICE_API_KEY · FAL_KEY
- methods
- POST
- cluster
- synthforge
- price
- $0.05 USDC per call
ADJACENT — other endpoints in synthforge
| endpoint | description | price |
|---|---|---|
| music-generate | Music generation / text-to-music / AI music / generative song / instrumental and vocal music. | $0.05 |
| text-to-speech | Text to speech / TTS / voice generator. | $0.05 |
| remove-bg | AI background remover / background eraser / cutout tool. | $0.08 |
| image-edit | Image edit / instruction-based image edit / text-driven photo edit / nano-banana image editor / GPT-image-2 edit. | $0.02 |
| image-inpaint | Image inpainting / mask-based image edit / fill in masked region / object replacement / face swap (mask-driven) / generative fill. | $0.02 |
| image-generate | Image generate (fast/cheap) / text-to-image / AI art. | $0.01 |
| sound-effect-generate | Sound effect generation / text-to-SFX / Foley generator / ElevenLabs sound effects / ambient audio synth. | $0.01 |
| image-generate-pro | Image generate (pro) / premium text-to-image / Flux 2 Pro / Recraft / Seedream / Qwen Image 2 Pro / xAI Grok Imagine. | $0.10 |
SEE ALSO