lazymac2x/web-scraper-toolkit
Universal web scraper API — extract metadata, links, headlines, images, tables from any URL. REST + MCP server.
Platform-specific configuration:
{
"mcpServers": {
"web-scraper-toolkit": {
"command": "npx",
"args": [
"-y",
"web-scraper-toolkit"
]
}
}
}Add the config above to .claude/settings.json under the mcpServers key.
Universal web scraping API — extract structured data from any website. Metadata, links, headlines, images, tables, text, or custom CSS selectors. REST API + MCP server.
npm install
npm start # http://localhost:3200GET /api/v1/scrape?url=...&mode=...| Param | Default | Options | |-------|---------|---------| | url | required | Any URL | | mode | full | full, metadata, links, headlines, images, tables, text, custom | | selector | — | CSS selector (custom mode only) |
# Full scrape
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com"
# Just metadata (title, description, og:image)
curl "http://localhost:3200/api/v1/scrape?url=https://github.com&mode=metadata"
# Extract all links
curl "http://localhost:3200/api/v1/scrape?url=https://reddit.com&mode=links"
# Headlines only
curl "http://localhost:3200/api/v1/scrape?url=https://bbc.com&mode=headlines"
# Clean text (no HTML)
curl "http://localhost:3200/api/v1/scrape?url=https://example.com&mode=text"
# Custom CSS selector
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com&mode=custom&selector=.titleline>a"
# Tables
curl "http://localhost:3200/api/v1/scrape?url=https://en.wikipedia.org/wiki/List_of_countries&mode=tables"POST /api/v1/scrapecurl -X POST http://localhost:3200/api/v1/scrape \
-H "Content-Type: application/json" \
-d '{"url": "https://example.com", "mode": "metadata"}'POST /api/v1/batchScrape up to 10 URLs at once:
curl -X POST http://localhost:3200/api/v1/batch \
-H "Content-Type: application/json" \
-d '{"urls": ["https://github.com", "https://reddit.com"], "mode": "metadata"}'node src/mcp-server.jsTools: scrape_website, extract_text, batch_scrape
| Mode | Returns | |------|---------| | full | metadata + headlines + links + images + tables | | metadata | title, des
Loading reviews...