Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint
$ man video-summarize

/video-summarize(1)

agentutility / mediakit / video-summarize
PRICE / CALL
$0.10
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
mediakit
CATEGORY
ai
STATUS
live
NAME
video-summarize video summarizer / podcast summarizer / lecture notes generator
SYNOPSIS
POST https://x402.org/v1/video-summarize
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Video summarizer / podcast summarizer / lecture notes generator. One call: Whisper v3 transcribes + Mistral summarizes. 5 styles: tldr, bullets, paragraph, executive, chapters. Returns summary + transcript. 60 min max.

INPUTrequest schema
propertytypedescriptionreq?
media_urlstringPublic URL of the video or podcast audio file to transcribe and summarize (60 minute max length).required
stylestringSummary format: tldr, bullets, paragraph, executive, or chapters.
enum: tldr · bullets · paragraph · executive · chapters
optional
max_wordsnumberTarget word count cap for the generated summary.optional
languagestringISO language code hint for Whisper transcription; auto-detected if omitted.optional
OUTPUTresponse shape
fieldtypedescription
summarystringGenerated summary text in the requested style.
stylestringSummary style actually used (echoes the input style parameter).
transcriptstringFull Whisper v3 transcript of the source media.
transcript_charsnumberCharacter count of the returned transcript.
duration_secondsnumberLength of the source media in seconds.
detected_languagesarrayLanguages Whisper detected in the audio, as ISO codes.
summary_modelstringMistral model identifier used to write the summary.
transcribe_modelstringWhisper model identifier used for transcription (v3).
source_urlstringEcho of the media_url that was transcribed.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/video-summarize \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# install once
claude mcp add x402 --command "npx x402-deployer-mcp"

# then ask Claude Code:
# "use the video-summarize tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
videopodcastsummarizetranscribewhisper
env
FAL_KEY_TRANSCRIBE · VENICE_API_KEY
methods
POST
cluster
mediakit
price
$0.10 USDC per call
ADJACENTother endpoints in mediakit
endpointdescriptionprice
extract-tablesExtract tables from PDF / table extractor / PDF to CSV / spreadsheet from PDF.$0.10
mp4-to-mp3MP4 → MP3 audio extractor.$0.10
pdf-extract-tablesPDF table extractor / table from PDF / scanned-table parsing / financial-table OCR / multi-page table consolidator / Datalab Marker tables.$0.10
pdf-to-jpgPDF to JPG / PNG / WEBP image converter.$0.10
speaker-diarizeSpeaker diarization / who-said-what transcription.$0.10
transcribeVideo / audio transcription via Whisper v3.$0.10
upscale-imageAI image upscaler / super-resolution / image enlarger.$0.10
video-to-audioVideo → audio extractor / video to audio converter.$0.10
SEE ALSO
agentutility(7) · mediakit(7) · x402(7) · mcp(7) · llms.txt · registry.json · bazaar.x402.org