Name: voice
Price: 0.05 USDC
Availability: InStock

$ man voice

agentutility / synthforge / voice

PRICE / CALL

$0.05

USDC · base mainnet · scheme: exact

METHOD

POST

CLUSTER

synthforge

CATEGORY

STATUS

● live

NAME

voice — converts text to speech with 30+ voices and mp3/wav/opus/aac/flac output

SYNOPSIS

POST https://x402.agentutility.ai/voice
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }

↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.

DESCRIPTION

Converts text to speech with 30+ voices and MP3/WAV/OPUS/AAC/FLAC output. Powered by Venice TTS (Kokoro/xAI/ElevenLabs/Orpheus/MiniMax). Use it as a TTS or voice synthesis API.

INPUT — request schema

property	type	description	req?
text	string	—	required
voice	string	—	optional
model	string	—	optional
speed	number	—	optional
format	string	—	optional

OUTPUT — response shape

field	type	description
audio_url	string	Signed URL to the generated audio file hosted on R2 or similar object storage.
file_size_bytes	number	Size of the generated audio file in bytes.
content_type	string	MIME type of the audio response, like audio/mpeg or audio/wav.
format	string	Audio container/codec returned: mp3, wav, opus, aac, or flac.
voice	string	Voice ID used for synthesis from the 30+ Venice TTS voices.
model	string	TTS model that produced the audio: Kokoro, xAI, ElevenLabs, Orpheus, or MiniMax.
speed	number	Playback speed multiplier applied during synthesis, where 1.0 is normal pace.
input_chars	number	Number of characters in the input text that were synthesized.

EXAMPLES — two ways to call

EXAMPLE 1 · curl

curl -X POST https://x402.agentutility.ai/voice \
  -H 'Content-Type: application/json' \
  -d '{ }'

first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.

EXAMPLE 2 · mcp

# Install the MCP package for this endpoint's cluster
npx -y @agentutility/mcp-<cluster>

# Required: EVM private key with USDC on Base
export X402_PRIVATE_KEY=0x...

# Then call the voice tool from your MCP-aware agent.

MCP server handles payment automatically — your coding agent just calls the tool by name.

METADATA

tags: ttsspeechaudiovoiceai
env: VENICE_API_KEY · FAL_KEY
methods: POST
cluster: synthforge
price: $0.05 USDC per call

ADJACENT — other endpoints in synthforge

endpoint	description	price
music-generate	Generates music from a text prompt via Venice using the minimax-music-v26 model.	$0.05
text-to-speech	Converts text to speech with 30+ voices and 5 audio formats.	$0.05
image-generate-pro	Premium text-to-image generation across margin-safe Venice models at a competitive $0.06/call.	$0.06
recraft	Generates SFW design and illustration images with Venice's recraft-v4 model on a dedicated endpoint.	$0.06
seedream	Generates SFW images with Venice's seedream-v4 model on a dedicated endpoint.	$0.06
flux-2-pro	Generates SFW images with Venice's flux-2-pro model on a dedicated endpoint.	$0.04
qwen-image	Generates SFW images with Venice's qwen-image model on a dedicated endpoint.	$0.04
background-remove	Removes the background from a public image URL and returns the subject with alpha transparency.	$0.08