Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint
$ man video-to-subtitles

/video-to-subtitles(1)

agentutility / mediakit / video-to-subtitles
PRICE / CALL
$0.02
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakit
CATEGORY
media
STATUS
live
NAME
video-to-subtitles srt / vtt subtitle generator from video or audio
SYNOPSIS
POST https://x402.org/v1/video-to-subtitles
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

SRT / VTT subtitle generator from video or audio. Whisper v3 powered. Word-wrapped, ready for VLC / Premiere / FFmpeg. Auto-detect language + translate-to-English.

INPUTrequest schema
propertytypedescriptionreq?
media_urlstringPublic URL of an audio or video file. mp3, mp4, mpeg, mpga, m4a, wav, webm. Max 60 minutes.required
formatstring'srt' (default) or 'vtt'.
enum: srt · vtt
optional
languagestringOptional ISO language code. Auto-detected if omitted.optional
taskstring'transcribe' (default) or 'translate' (translates to English).
enum: transcribe · translate
optional
max_chars_per_linenumberMax characters per subtitle line. Default 42. Range 16-120.optional
OUTPUTresponse shape
fieldtypedescription
subtitlesstringFull subtitle file content as a string.
formatstringEcho of the format used.
mime_typestringMIME type for the subtitle format ('application/x-subrip' or 'text/vtt').
cue_countnumberNumber of subtitle cues generated.
duration_secondsnumberSource media duration.
detected_languagesarrayLanguages auto-detected in the audio.
taskstringEcho of the task performed.
source_urlstringEcho of the input URL.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/video-to-subtitles \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# install once
claude mcp add x402 --command "npx x402-deployer-mcp"

# then ask Claude Code:
# "use the video-to-subtitles tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
subtitlessrtvttcaptionswhispertranscription
env
FAL_KEY_TRANSCRIBE
methods
POST
cluster
mediakit
price
$0.02 USDC per call
ADJACENTother endpoints in mediakit
endpointdescriptionprice
audio-loudnormAudio loudness normalizer (EBU R128 LUFS).$0.02
csv-to-jsonlCSV to JSON / CSV to JSONL converter / data pipeline preprocessor.$0.02
image-translateImage translator: vision-OCR + Venice translate.$0.02
image-upscaleImage upscale / 2x upscaler / 4x upscaler / super-resolution / sharpen image / enlarge image without loss.$0.02
pdf-watermarkPDF watermark / image watermark / video watermark — text or image overlay on PDFs, PNG/JPG/GIF, or MP4/MOV/WEBM.$0.02
video-trimVideo trimmer / video cutter / video clip tool.$0.02
watermarkPDF / image / video watermarking — text or image overlay.$0.02
watermark-pdfAdd watermark to PDF.$0.02
SEE ALSO
agentutility(7) · mediakit(7) · x402(7) · mcp(7) · llms.txt · registry.json · bazaar.x402.org