ollama2api

MCP Tool

ssantosdanilo/ollama2api

Aggregate multiple Ollama instances into a unified OpenAI-compatible API with load balancing, health checks, auto discovery, and web management.

Install

$ npx loaditout add ssantosdanilo/ollama2api

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "ollama2api": {
      "command": "npx",
      "args": [
        "-y",
        "ollama2api"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

Ollama2API

> Ollama 后端聚合网关 — 兼容 OpenAI API，多节点负载均衡，自动发现与管理

将多个 Ollama 实例聚合为统一的 OpenAI 兼容 API，支持智能负载均衡、健康检查、节点扫描发现和 Web 管理后台。

特性

OpenAI 兼容 — /v1/chat/completions + /v1/models，可直接对接 ChatGPT 前端、Cursor 等工具
多节点负载均衡 — 基于延迟、成功率、故障次数的加权评分调度
自动健康检查 — 定时探测节点状态，故障自动冷却与恢复
节点扫描发现 — 支持 masscan 高速扫描 + 纯 Python 回退，批量发现 Ollama 实例
代理支持 — 可选集成 Xray，通过 SOCKS5/HTTP 代理访问节点
API Key 管理 — 可选鉴权，支持批量创建与用量统计
Web 管理后台 — 节点管理、扫描控制、密钥管理、配置修改、日志查看
AI 运维助手 — 内置 AI Chat，自然语言管理系统
流式响应 — 完整 SSE 流式输出支持
零依赖存储 — JSON 文件存储，无需数据库

快速开始

Docker 部署（推荐）

git clone https://github.com/yourname/ollama2api.git
cd ollama2api
docker-compose up -d

访问 http://localhost:8001/admin 进入管理后台。

> 安全提示：首次部署后请立即在管理后台修改默认管理员密码。

本地运行

python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt
python main.py

服务启动于 http://localhost:8001。

环境要求

| 依赖 | 必须 | 说明 | |------|------|------| | Python 3.10+ | 是 | 运行环境 | | masscan | 否 | 高速端口扫描，未安装时回退纯 Python | | Xray | 否 | 代理支持，不需代理可忽略 |

配置

运行时配置存储在 data/config.json，支持通过管理后台热修改：

| 配置项 | 默认值 | 说明 | |--------|--------|------| | request_timeout | 300 | 请求超时（秒） | | connect_timeout | 10 | 连接超时（秒） | | health_check_interval | 300 | 健康检查间隔（秒） | | max_retries | 3 | 请求最大重试次数 | | cooldown_threshold | 3 | 连续失败多少次后冷却 | | cooldown_duration | 300 | 冷却时长（秒） | | scanner_concurrency | 50 | 扫描并发数 | | masscan_rate | 5000 | masscan 发包速率 | | cleanup_offline_hours | 24 | 离线节点自动清理阈值（小时） |

API

完全兼容 OpenAI Chat Completions API：

# 聊天补全（流式）
curl http://localhost:8001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key" \
  -d '{"model": "your-model", "messages": [{"role": "user", "content": "Hello"}], "stream": true}'

# 模型列表
curl http://localhost:8001/v1/models

# 健康检查
curl http://localhost:8001/health

> 未配置 API Key 时无需 Authorization 头。

批量扫描

独

Reviews

Loading reviews...

Quality Signals

Installs

Last updated1 day ago

Security: BREADME

New

ollama2api

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge