MBaranekTech/pdf-rag-mcp
MCP server for RAG over messy PDFs — semantic search, OCR, table extraction. Works with Claude Desktop, Claude Code, Cursor, and any MCP client.
Platform-specific configuration:
{
"mcpServers": {
"pdf-rag-mcp": {
"command": "npx",
"args": [
"-y",
"pdf-rag-mcp"
]
}
}
}Add the config above to .claude/settings.json under the mcpServers key.
> MCP server for RAG over messy PDFs — extract, chunk, embed, and search scanned, multi-column, and table-heavy documents.
[](https://www.python.org/downloads/) [](LICENSE) [](https://modelcontextprotocol.io)
---
<p align="center"> <br/> <em>All 6 tools running in the MCP Inspector</em> </p>
RAG (Retrieval-Augmented Generation) is a technique that makes AI assistants smarter by giving them access to your own documents. Instead of relying only on training data, the AI first *retrieves* relevant chunks from your files, then uses them as context to generate accurate, grounded answers.
Traditional AI: User Question → LLM → Answer (may hallucinate)
RAG: User Question → Search Your Docs → LLM + Context → Accurate AnswerThis MCP server is the "Search Your Docs" part — it ingests PDFs, breaks them into searchable chunks, and lets any MCP-compatible AI assistant find the right information instantly.
Most PDF tools choke on real-world documents — scanned pages, multi-column layouts, embedded tables. This MCP server handles them all:
<p align="center"> <br/> <em>Ingesting a PDF — extracts text, chunks it, generates e
Loading reviews...