by nkapila6
Provides a locally run RAG‑like web search tool that fetches recent results, generates embeddings, ranks relevance, and returns markdown excerpts to language models for up‑to‑date context.
Mcp Local Rag enables language models to perform live web searches, extract relevant content, and incorporate that context into their responses without relying on external APIs. It runs entirely on the user’s machine, making it ideal for privacy‑focused or offline workflows.
mcp-local-rag
in the MCP client settings, either using the uvx
command (Python) or Docker.mcp-local-rag
tool.Q: Do I need an API key? A: No. The tool uses DuckDuckGo for search and MediaPipe for embeddings, both of which are free and do not require authentication.
Q: Which programming language is required?
A: The server is Python‑based. Installation can be done via uvx
(recommended) or Docker.
Q: Can I customize the number of search results or the embedding model? A: Yes. The source code exposes configuration variables for result count and the embedding pipeline.
Q: Is it compatible with all LLMs? A: It works with any MCP client that supports tool calling, which includes most major chat interfaces that allow custom tools.
Q: How does it handle rate limits or bans? A: Using DuckDuckGo mitigates strict rate limiting, but heavy usage should respect typical search etiquette.
"primitive" RAG-like web search model context protocol (MCP) server that runs locally. ✨ no APIs ✨
Locate your MCP config path here or check your MCP client settings.
uvx
This is the easiest and quickest method. You need to install uv for this to work. Add this to your MCP server configuration:
{
"mcpServers": {
"mcp-local-rag":{
"command": "uvx",
"args": [
"--python=3.10",
"--from",
"git+https://github.com/nkapila6/mcp-local-rag",
"mcp-local-rag"
]
}
}
}
Ensure you have Docker installed. Add this to your MCP server configuration:
{
"mcpServers": {
"mcp-local-rag": {
"command": "docker",
"args": [
"run",
"--rm",
"-i",
"--init",
"-e",
"DOCKER_CONTAINER=true",
"ghcr.io/nkapila6/mcp-local-rag:latest"
]
}
}
}
MseeP does security audits on every MCP server, you can see the security audit of this MCP server by clicking here.
The MCP server should work with any MCP client that supports tool calling. Has been tested on the below clients.
When an LLM (like Claude) is asked a question requiring recent web information, it will trigger mcp-local-rag
.
When asked to fetch/lookup/search the web, the model prompts you to use MCP server for the chat.
In the example, have asked it about Google's latest Gemma models released yesterday. This is new info that Claude is not aware about.
mcp-local-rag
performs a live web search, extracts context, and sends it back to the model—giving it fresh knowledge:
Have ideas or want to improve this project? Issues and pull requests are welcome!
This project is licensed under the MIT License.
Please log in to share your review and rating for this MCP.
Discover more MCP servers with similar functionality and use cases
by exa-labs
Provides a Model Context Protocol server that enables AI assistants to perform real‑time web searches via the Exa AI Search API, with optional company research, LinkedIn lookup, and deep research workflows.
by perplexityai
Provides real-time web search capabilities to AI models via the Perplexity Sonar API, enabling seamless integration within the Model Context Protocol ecosystem.
by brightdata
Provides real‑time web access, bypasses geo‑restrictions, handles bot detection, and offers browser automation for LLMs and AI agents via the Model Context Protocol.
by mamertofabian
mcp-everything-search is a cross-platform MCP server that provides fast and flexible file searching capabilities. It leverages native system tools to efficiently locate files and folders across Windows, macOS, and Linux.
by kagisearch
Provides web search and video summarization capabilities via the Model Context Protocol, integrating with Claude and other AI tools.
by apify
mcp-server-rag-web-browser is an MCP server for the RAG Web Browser Actor, enabling AI agents and LLMs to perform web searches and extract information from web pages.
by fatwang2
Provides web and news search, URL crawling, sitemap extraction, reasoning, and trending tools via Search1API, exposed as an MCP server for seamless integration with clients such as LibreChat, Claude Desktop, Cursor, and other MCP‑compatible tools.
by meilisearch
Enables LLMs to manage Meilisearch indexes, perform searches, and handle documents through natural‑language conversations via a Model Context Protocol server.
by ihor-sokoliuk
mcp-searxng is an MCP (Model Context Protocol) server that integrates with SearXNG to provide web search capabilities for AI models and applications.