vllm-mlx

MCP Tool

waybarrios/vllm-mlx

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

Install

$ npx loaditout add waybarrios/vllm-mlx

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "vllm-mlx": {
      "command": "npx",
      "args": [
        "-y",
        "vllm-mlx"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

Reviews

Loading reviews...

Quality Signals

Quality Score4500

610

Stars

Installs

Last updated5 days ago

Security: A

New

vllm-mlx

Install

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge