Skip to content
clusters: prooflayer · edgemarket · edgefinance · synthforge · mediakit · wordmint · webprobe · locale · comppoint
$ man scrape-website

/scrape-website(1)

agentutility / web-probe / scrape-website
PRICE / CALL
$0.04
USDC · base mainnet · scheme: exact
METHOD
POST
CLUSTER
webprobe
CATEGORY
utilities
STATUS
live
NAME
scrape-website web scraper / html to text / website content extractor
SYNOPSIS
POST https://x402.org/v1/scrape-website
     Content-Type: application/json
     X-PAYMENT:    <signed-transferWithAuthorization>

     { ... }
↳ first call → 402 Payment Required. Sign USDCtransferWithAuthorization, retry with theX-PAYMENT header.
DESCRIPTION

Web scraper / HTML to text / website content extractor. Returns title, OG/Twitter meta, body in text/HTML/markdown. Optional links. Cheerio-based — fast, no headless browser.

INPUTrequest schema
propertytypedescriptionreq?
urlstringPublic http/https URL to scrape.required
formatstring'text' (default), 'html', or 'markdown'.
enum: text · html · markdown
optional
include_linksbooleanInclude array of unique links found on the page (max 500). Default false.optional
user_agentstringOverride the User-Agent header.optional
OUTPUTresponse shape
fieldtypedescription
urlstringFinal URL fetched after following redirects from the requested page.
final_urlstringFinal URL after redirects.
status_codenumberHTTP status code returned by the target server for the scrape request.
titlestringPage title pulled from the <title> tag or og:title fallback.
descriptionstringMeta description from the page's description, og:description, or twitter:description tag.
canonicalstringCanonical URL declared by the page's <link rel="canonical"> tag.
langstringLanguage code from the <html lang="..."> attribute (e.g. en, es).
h1stringText content of the page's first <h1> heading.
ogobjectAll og:* meta tags.
twitterobjectAll twitter:* meta tags.
textstringBody when format=text.
htmlstringBody when format=html.
markdownstringBody when format=markdown.
formatstringBody format returned: text, html, or markdown depending on the requested format param.
linksarrayArray of {href, text} when include_links=true.
body_charsnumberCharacter count of the extracted body content in the chosen format.
EXAMPLEStwo ways to call
EXAMPLE 1 · curl
curl -X POST https://x402.org/v1/scrape-website \
  -H 'Content-Type: application/json' \
  -d '{ }'
first response = 402 Payment Required with payment requirements; sign + retry with X-PAYMENT.
EXAMPLE 2 · mcp
# install once
claude mcp add x402 --command "npx x402-deployer-mcp"

# then ask Claude Code:
# "use the scrape-website tool to ..."
MCP server handles payment automatically — your coding agent just calls the tool by name.
METADATA
tags
scrapefetchhtmlextractmetadataurl
methods
POST
cluster
webprobe
price
$0.04 USDC per call
ADJACENTother endpoints in webprobe
endpointdescriptionprice
arxiv-summarizearXiv paper summarizer / research-paper TLDR.$0.04
rss-from-anythingRSS feed generator / HTML to RSS converter.$0.04
scrapeWeb scraper / HTML extractor.$0.04
screenshotWebsite screenshot / URL to PNG/JPG.$0.04
webpage-diffWebpage change detection / website monitor / content diff.$0.04
website-screenshotURL to PNG / JPG / website screenshot tool.$0.04
arxiv-searcharXiv full-text search.$0.03
domain-suggestBrandable domain suggester / startup-name brainstormer / product-name proposer / domain candidate generator.$0.05
SEE ALSO
agentutility(7) · webprobe(7) · x402(7) · mcp(7) · llms.txt · registry.json · bazaar.x402.org