$ man video-summarize
/video-summarize(1)
NAME
video-summarize — video summarizer / podcast summarizer / lecture notes generator
SYNOPSIS
POST https://x402.org/v1/video-summarize
Content-Type: application/json
X-PAYMENT: <signed-transferWithAuthorization>
{ ... }↳ first call →
402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.DESCRIPTION
Video summarizer / podcast summarizer / lecture notes generator. One call: Whisper v3 transcribes + Mistral summarizes. 5 styles: tldr, bullets, paragraph, executive, chapters. Returns summary + transcript. 60 min max.
INPUT — request schema
| property | type | description | req? |
|---|---|---|---|
| media_url | string | Public URL of the video or podcast audio file to transcribe and summarize (60 minute max length). | required |
| style | string | Summary format: tldr, bullets, paragraph, executive, or chapters. enum: tldr · bullets · paragraph · executive · chapters | optional |
| max_words | number | Target word count cap for the generated summary. | optional |
| language | string | ISO language code hint for Whisper transcription; auto-detected if omitted. | optional |
OUTPUT — response shape
| field | type | description |
|---|---|---|
| summary | string | Generated summary text in the requested style. |
| style | string | Summary style actually used (echoes the input style parameter). |
| transcript | string | Full Whisper v3 transcript of the source media. |
| transcript_chars | number | Character count of the returned transcript. |
| duration_seconds | number | Length of the source media in seconds. |
| detected_languages | array | Languages Whisper detected in the audio, as ISO codes. |
| summary_model | string | Mistral model identifier used to write the summary. |
| transcribe_model | string | Whisper model identifier used for transcription (v3). |
| source_url | string | Echo of the media_url that was transcribed. |
EXAMPLES — two ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/video-summarize \
-H 'Content-Type: application/json' \
-d '{ }'first response =
402 Payment Required with payment requirements; sign + retry with X-PAYMENT.EXAMPLE 2 · mcp
# install once claude mcp add x402 --command "npx x402-deployer-mcp" # then ask Claude Code: # "use the video-summarize tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
- tags
- videopodcastsummarizetranscribewhisper
- env
- FAL_KEY_TRANSCRIBE · VENICE_API_KEY
- methods
- POST
- cluster
- mediakit
- price
- $0.10 USDC per call
ADJACENT — other endpoints in mediakit
| endpoint | description | price |
|---|---|---|
| extract-tables | Extract tables from PDF / table extractor / PDF to CSV / spreadsheet from PDF. | $0.10 |
| mp4-to-mp3 | MP4 → MP3 audio extractor. | $0.10 |
| pdf-extract-tables | PDF table extractor / table from PDF / scanned-table parsing / financial-table OCR / multi-page table consolidator / Datalab Marker tables. | $0.10 |
| pdf-to-jpg | PDF to JPG / PNG / WEBP image converter. | $0.10 |
| speaker-diarize | Speaker diarization / who-said-what transcription. | $0.10 |
| transcribe | Video / audio transcription via Whisper v3. | $0.10 |
| upscale-image | AI image upscaler / super-resolution / image enlarger. | $0.10 |
| video-to-audio | Video → audio extractor / video to audio converter. | $0.10 |
SEE ALSO