Web Scraping Stack
Extract, crawl, and process web data at scale with headless browsers, anti-detection, and structured file output.
Token Budget
Moderate token usage. You will have good room for prompts but watch if adding more servers.
Config
Paste in ~/.claude.json
{
"mcpServers": {
"playwright-mcp": {
"command": "npx",
"args": [
"-y",
"@playwright/mcp"
]
},
"browserbase-mcp": {
"command": "npx",
"args": [
"-y",
"@browserbasehq/mcp-server-browserbase"
],
"env": {
"BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY",
"BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID"
}
},
"filesystem-mcp": {
"command": "npx",
"args": [
"-y",
"@modelcontextprotocol/server-filesystem",
"/path/to/allowed/directory"
]
},
"puppeteer-mcp": {
"command": "npx",
"args": [
"-y",
"@modelcontextprotocol/server-puppeteer"
]
},
"firecrawl-mcp": {
"command": "npx",
"args": [
"-y",
"firecrawl-mcp"
],
"env": {
"FIRECRAWL_API_KEY": "YOUR_FIRECRAWL_API_KEY"
}
}
}
}Included Servers
Firecrawl MCP
Mendable
Scrape and crawl websites, extract structured data, and perform batch web scraping with LLM-powered content analysis.
Crawl entire websites and extract clean, structured content from any page — handles JavaScript rendering and pagination automatically.
BrowserBase MCP
BrowserBase
Automate cloud browsers with BrowserBase and Stagehand to navigate pages, extract content, take screenshots, and interact with web elements.
Run headless browsers in the cloud with built-in anti-detection, proxy rotation, and CAPTCHA solving for scraping at scale.
Playwright MCP
Microsoft
Automate browser interactions, take screenshots, fill forms, and test web applications using Microsoft Playwright from your AI editor.
Automate complex browser interactions like login flows, infinite scrolling, and multi-step navigation to reach deeply nested content.
Puppeteer MCP
Anthropic
Automate browsers with Puppeteer. Navigate pages, take screenshots, fill forms, and generate PDFs from your AI editor.
Control Chromium directly for fine-grained scraping tasks like screenshot capture, PDF generation, and dynamic page interaction.
Filesystem MCP
Anthropic
Read, write, search, and manage files on your local filesystem with secure directory-scoped access for your AI editor.
Save scraped data as JSON, CSV, or structured files locally — the final step in any data extraction pipeline.
Recommended Skills
Skills that pair with the servers in this stack. Learn more
For Firecrawl MCP
Web scraping and content extraction workflows using Firecrawl. Includes crawl configuration and data pipeline patterns.
npx skillsadd firecrawl/cli/firecrawlFor Playwright MCP
Official Microsoft Playwright CLI skill for browser automation, E2E testing, and web scraping workflows.
npx skillsadd microsoft/playwright-cliTest generation, coverage analysis, and E2E testing patterns. Pairs with Playwright MCP for automated browser testing.