web-scraper-toolkit

MCP Tool

lazymac2x/web-scraper-toolkit

Universal web scraper API — extract metadata, links, headlines, images, tables from any URL. REST + MCP server.

Install

$ npx loaditout add lazymac2x/web-scraper-toolkit

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "web-scraper-toolkit": {
      "command": "npx",
      "args": [
        "-y",
        "web-scraper-toolkit"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

web-scraper-toolkit

Universal web scraping API — extract structured data from any website. Metadata, links, headlines, images, tables, text, or custom CSS selectors. REST API + MCP server.

Quick Start

npm install
npm start  # http://localhost:3200

API

GET /api/v1/scrape?url=...&mode=...

| Param | Default | Options | |-------|---------|---------| | url | required | Any URL | | mode | full | full, metadata, links, headlines, images, tables, text, custom | | selector | — | CSS selector (custom mode only) |

# Full scrape
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com"

# Just metadata (title, description, og:image)
curl "http://localhost:3200/api/v1/scrape?url=https://github.com&mode=metadata"

# Extract all links
curl "http://localhost:3200/api/v1/scrape?url=https://reddit.com&mode=links"

# Headlines only
curl "http://localhost:3200/api/v1/scrape?url=https://bbc.com&mode=headlines"

# Clean text (no HTML)
curl "http://localhost:3200/api/v1/scrape?url=https://example.com&mode=text"

# Custom CSS selector
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com&mode=custom&selector=.titleline>a"

# Tables
curl "http://localhost:3200/api/v1/scrape?url=https://en.wikipedia.org/wiki/List_of_countries&mode=tables"

POST /api/v1/scrape

curl -X POST http://localhost:3200/api/v1/scrape \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com", "mode": "metadata"}'

POST /api/v1/batch

Scrape up to 10 URLs at once:

curl -X POST http://localhost:3200/api/v1/batch \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://github.com", "https://reddit.com"], "mode": "metadata"}'

MCP Server

node src/mcp-server.js

Tools: scrape_website, extract_text, batch_scrape

Modes

| Mode | Returns | |------|---------| | full | metadata + headlines + links + images + tables | | metadata | title, des

Reviews

Loading reviews...

Quality Signals

Installs

Last updated28 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/21/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/lazymac2x/web-scraper-toolkit)](https://loaditout.ai/skills/lazymac2x/web-scraper-toolkit)

web-scraper-toolkit

MCP Tool

lazymac2x/web-scraper-toolkit

Universal web scraper API — extract metadata, links, headlines, images, tables from any URL. REST + MCP server.

Install

$ npx loaditout add lazymac2x/web-scraper-toolkit

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "web-scraper-toolkit": {
      "command": "npx",
      "args": [
        "-y",
        "web-scraper-toolkit"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

web-scraper-toolkit

Universal web scraping API — extract structured data from any website. Metadata, links, headlines, images, tables, text, or custom CSS selectors. REST API + MCP server.

Quick Start

npm install
npm start  # http://localhost:3200

API

GET /api/v1/scrape?url=...&mode=...

# Full scrape
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com"

# Just metadata (title, description, og:image)
curl "http://localhost:3200/api/v1/scrape?url=https://github.com&mode=metadata"

# Extract all links
curl "http://localhost:3200/api/v1/scrape?url=https://reddit.com&mode=links"

# Headlines only
curl "http://localhost:3200/api/v1/scrape?url=https://bbc.com&mode=headlines"

# Clean text (no HTML)
curl "http://localhost:3200/api/v1/scrape?url=https://example.com&mode=text"

# Custom CSS selector
curl "http://localhost:3200/api/v1/scrape?url=https://news.ycombinator.com&mode=custom&selector=.titleline>a"

# Tables
curl "http://localhost:3200/api/v1/scrape?url=https://en.wikipedia.org/wiki/List_of_countries&mode=tables"

POST /api/v1/scrape

curl -X POST http://localhost:3200/api/v1/scrape \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com", "mode": "metadata"}'

POST /api/v1/batch

Scrape up to 10 URLs at once:

curl -X POST http://localhost:3200/api/v1/batch \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://github.com", "https://reddit.com"], "mode": "metadata"}'

MCP Server

node src/mcp-server.js

Tools: scrape_website, extract_text, batch_scrape

Modes

| Mode | Returns | |------|---------| | full | metadata + headlines + links + images + tables | | metadata | title, des

Reviews

Loading reviews...

Quality Signals

Installs

Last updated28 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/21/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/lazymac2x/web-scraper-toolkit)](https://loaditout.ai/skills/lazymac2x/web-scraper-toolkit)

web-scraper-toolkit

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge

web-scraper-toolkit

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge