$ man subtitles
/subtitles(1)
PRICE / CALL
$0.08
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakitCATEGORY
media
STATUS
● live
NAME
subtitles — srt / vtt subtitle generator from video or audio
SYNOPSIS
POST https://x402.org/v1/subtitles
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
SRT / VTT subtitle generator from video or audio. Whisper v3. Word-wrapped, ready for VLC / Premiere / FFmpeg.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| media_url | string | Public URL of an audio or video file. mp3, mp4, mpeg, mpga, m4a, wav, webm. Max 60 minutes. | required |
| format | string | 'srt' (default) or 'vtt'. | optional |
| language | string | Optional ISO language code. Auto-detected if omitted. | optional |
| task | string | 'transcribe' (default) or 'translate' (translates to English). | optional |
| max_chars_per_line | number | Max characters per subtitle line. Default 42. Range 16-120. | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| subtitles | string | Full subtitle file content as a string. |
| format | string | Echo of the format used. |
| mime_type | string | MIME type for the subtitle format ('application/x-subrip' or 'text/vtt'). |
| cue_count | number | Number of subtitle cues generated. |
| duration_seconds | number | Source media duration. |
| detected_languages | array | Languages auto-detected in the audio. |
| task | string | Echo of the task performed. |
| source_url | string | Echo of the input URL. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/subtitles \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# install once claude mcp add x402 --command "npx x402-deployer-mcp" # then ask Claude Code: # "use the subtitles tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- subtitlessrtvttcaptionswhispertranscription
- env
- FAL_KEY_TRANSCRIBE
- methods
- POST
- cluster
- mediakit
- price
- $0.08 USDC per call
ADJACENT — other endpoints in mediakit
| endpoint | description | price |
|---|---|---|
| html-to-pdf | URL to PDF / HTML to PDF / webpage screenshot to PDF. | $0.08 |
| extract-tables | Extract tables from PDF / table extractor / PDF to CSV / spreadsheet from PDF. | $0.10 |
| mp4-to-mp3 | MP4 → MP3 audio extractor. | $0.10 |
| pdf-extract-tables | PDF table extractor / table from PDF / scanned-table parsing / financial-table OCR / multi-page table consolidator / Datalab Marker tables. | $0.10 |
| pdf-to-jpg | PDF to JPG / PNG / WEBP image converter. | $0.10 |
| speaker-diarize | Speaker diarization / who-said-what transcription. | $0.10 |
| transcribe | Video / audio transcription via Whisper v3. | $0.10 |
| upscale-image | AI image upscaler / super-resolution / image enlarger. | $0.10 |
SEE ALSO