Skip to main content
The Scrapio MCP server gives AI agents direct access to the full web data surface through a single registered server — fetch, crawl, Google search, YouTube transcripts, Amazon and Walmart product data, browser automation, and async jobs.

Install

No installation required. Run it with npx:
SCRAPIO_API_KEY=sk-... npx @scrapio/mcp
The server starts on stdio by default and is ready to accept connections from your agent runtime.

Configure your agent

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):
{
  "mcpServers": {
    "scrapio": {
      "command": "npx",
      "args": ["-y", "@scrapio/mcp"],
      "env": {
        "SCRAPIO_API_KEY": "sk-..."
      }
    }
  }
}
Restart Claude Desktop. The tools appear automatically — no further setup needed.

Available tools

ToolWhat it does
fetchFetch a URL and return its content as markdown, HTML, JSON, or a screenshot
google_searchSearch Google and return structured results — organic, news, images, or shopping
youtube_transcriptGet the full transcript for a YouTube video by URL or video ID
amazon_productGet structured product data from Amazon by URL, ASIN, or search query
walmart_searchSearch Walmart and return structured product listings
interactDrive a real browser through a multi-step workflow — click, type, scroll, wait
crawlCrawl one or more seed URLs and return content for every page visited
submit_jobSubmit a long-running job asynchronously and return a job ID
get_jobCheck job status and retrieve the result when complete

Example prompts

Once the server is connected, your agent can use natural language to call these tools:
“Fetch the content of https://news.ycombinator.com and summarize the top 5 stories.”
“Search Google for ‘best TypeScript ORM 2025’ and give me the top 3 results.”
“Get the transcript of this YouTube video: https://www.youtube.com/watch?v=dQw4w9WgXcQ
“Find the current price of the Sony WH-1000XM5 headphones on Amazon.”
“Crawl https://docs.example.com up to 20 pages and extract all the API endpoint descriptions.”
“Go to https://app.example.com, click the login button, type my email and password, and return the dashboard content.”

Async jobs

Some operations — large crawls, slow pages, multi-step interactions — take longer than a single tool call can wait. Use submit_job to queue the work and get_job to poll for the result:
Agent: submit_job(kind="crawl", input={ seeds: ["https://docs.example.com"], max_pages: 50 })
→ { job_id: "abc123", status: "queued" }

Agent: get_job(job_id="abc123")
→ { status: "running", ... }

Agent: get_job(job_id="abc123")
→ { status: "completed", result: { pages: [...] } }

Environment variables

VariableRequiredDescription
SCRAPIO_API_KEYYesYour API key. Get one at scrapio.dev.
SCRAPIO_BASE_URLNoOverride the API base URL (for local development or staging).
MCP_PORTNoPort for HTTP transport mode. Defaults to 3010.

Troubleshooting

Restart your agent runtime after adding the server config — most runtimes only discover MCP servers on startup.
The env block in your agent config is not being passed to the process. Double-check that your config file uses the correct format for your runtime and that the key name matches exactly.
Your API key is set but invalid or revoked. Generate a new key in the dashboard.
Your account has no remaining credits. Add credits at app.scrapio.dev.
The first run downloads the package. Subsequent calls use the npm cache and start in under a second. For production deployments, install globally with npm install -g @scrapio/mcp and reference the binary directly.