RAGFlow Claude MCP Server

A small Model Context Protocol (MCP) server that hooks Claude Desktop (and other MCP clients) up to a RAGFlow instance. It exposes the RAGFlow REST API as a handful of tools so the LLM can query knowledge bases and pull document chunks into its context.

This is personal-use software I wrote for my own R&D. It's not bug-free and the code is not pretty. It works for what I need.

What it does

Direct retrieval: pulls raw document chunks with similarity scores from RAGFlow's /retrieval endpoint.
Multi-KB search: a single query can hit several knowledge bases at once.
DSPy query deepening: optional iterative query refinement (uses an LLM to analyse intermediate results and rewrite the query).
~~Reranking~~ — currently broken on the RAGFlow side, see Known issues.
Tunable result control: page_size, similarity_threshold, top_k, pagination.
Document filter: limit results to one document inside a dataset (fuzzy name matching).
Dataset lookup by name (case-insensitive, fuzzy) instead of by ID.
Cloudflare Zero Trust authentication when your RAGFlow sits behind it.

Installation

Clone:

git clone https://github.com/norandom/ragflow-claude-desktop-local-mcp
cd ragflow-claude-desktop-local-mcp

Install:

# On macOS, install DSPy first to dodge build issues:
pip install git+https://github.com/stanfordnlp/dspy.git

uv install

Configure: copy the sample and fill in your RAGFlow details.
```
cp config.json.sample config.json
```
Keys:
- RAGFLOW_BASE_URL: e.g. http://your-ragflow-server:9380
- RAGFLOW_API_KEY: your RAGFlow API key
- RAGFLOW_DEFAULT_RERANK: rerank model (default rerank-multilingual-v3.0)
- CF_ACCESS_CLIENT_ID (optional): Cloudflare Zero Trust service-token ID
- CF_ACCESS_CLIENT_SECRET (optional): Cloudflare Zero Trust service-token secret
- DSPY_MODEL: DSPy LM (default openai/gpt-4o-mini)
- OPENAI_API_KEY: needed for DSPy deepening

Cloudflare Zero Trust

If your RAGFlow is behind Cloudflare Zero Trust, grab a service token from the dashboard and add it to config.json:

{
  "CF_ACCESS_CLIENT_ID": "your-client-id.access",
  "CF_ACCESS_CLIENT_SECRET": "your-client-secret"
}

When both are set, every API request goes out with the CF-Access-Client-Id and CF-Access-Client-Secret headers. No code change needed.

Claude Desktop config

{
  "mcpServers": {
    "ragflow": {
      "command": "uv",
      "args": [
        "run",
        "--directory",
        "/path/to/ragflow-claude-desktop-local-mcp",
        "ragflow-claude-mcp"
      ]
    }
  }
}

Tools

`ragflow_retrieval_by_name` (the one I use most)

Retrieve chunks across one or more datasets by name. Returns raw chunks with similarity scores.

Params:

dataset_names (required) — list, e.g. ["BASF", "Quant Literature"]
query (required)
document_name (optional) — restrict to one document; fuzzy match
top_k (optional, default 1024) — vector candidates
similarity_threshold (optional, default 0.2) — 0.0–1.0
page (optional, default 1)
page_size (optional, default 10)
use_rerank (optional, default false) — currently broken upstream, see Known issues
deepening_level (optional, default 0) — DSPy refinement, 0–3

`ragflow_retrieval`

Same shape, but takes dataset_ids: List[str] instead of names.

Multi-KB search

You can search across several knowledge bases in one call. Make sure they share an embedding model — mixing incompatible embeddings will tank the relevance scores.

Use ragflow_retrieval_by_name with dataset_names ["Finance Reports", "Legal Documents"] and query "Summarize the key financial risks and compliance requirements for new market entry."

`ragflow_list_datasets`

Lists every knowledge base on your RAGFlow instance. No params. Walks all pages internally.

`ragflow_list_documents`

Lists documents in a dataset. Walks all pages.

dataset_id (required)

`ragflow_get_chunks`

Returns chunks (with references) for one document.

dataset_id (required)
document_id (required)

`ragflow_list_sessions`

Shows active chat sessions per dataset. No params.

`ragflow_list_documents_by_name`

Lists documents in a dataset, looked up by name.

dataset_name (required)

`ragflow_reset_session`

Drops the chat session for a dataset.

dataset_id (required)

Tuning the retrieval

The retrieval tools take three knobs:

page_size — chunks per page (default 10).
similarity_threshold — drops chunks below this score (default 0.2).
top_k — pool size for the vector search before filtering (default 1024).

Some starting points that work for me:

Broader recall: page_size=15, similarity_threshold=0.15.
Tight precision: page_size=5, similarity_threshold=0.4.
Heavy research: page_size=20, similarity_threshold=0.1, deepening_level=1.
Hard queries: deepening_level=2.
Speed: keep deepening_level=0 and skip rerank.

Examples

Basic retrieval by name:

Use ragflow_retrieval_by_name with dataset_names ["BASF"] and query "What is BASF's latest income statement? Revenue, operating income, net income, and other key figures."

Restrict to one document:

Use ragflow_retrieval_by_name with dataset_names ["BASF"], document_name "annual_report_2023", and query "What were the key financial highlights for 2023?"

Document names match fuzzily — "annual" will hit annual_report_2023.pdf and annual_report_2024.pdf. When several match, the server picks the most recent and lists the alternatives in the response metadata.

DSPy deepening for a tricky query:

Use ragflow_retrieval_by_name with dataset_names ["Quant Literature"], query "what is a volatility clock", deepening_level 2.

Multi-page:

Use ragflow_retrieval_by_name with dataset_names ["BASF"], query "BASF business segments", page_size 10, page 2.

List what's available:

Use ragflow_list_datasets.

Use ragflow_list_documents_by_name with dataset_name "BASF".

Pull specific chunks:

Use ragflow_get_chunks with dataset_id "43066ee0599411f089787a39c10de57b" and document_id "d74a1c105a3311f09fc94a0fcd8b7722".

Bigger prompts

Some examples of how I drive it from Claude Desktop.

Financial deep-dive:

Help me analyse BASF's recent financials.

1. Use ragflow_retrieval_by_name to search ["BASF"] for the latest income statement
   (revenue, operating income, net income). Use page_size 15,
   similarity_threshold 0.15, deepening_level 1.

2. Then run ragflow_retrieval_by_name again for the cash flow statement,
   page_size 10, similarity_threshold 0.2.

3. Finally look for year-over-year changes with page_size 12,
   similarity_threshold 0.18.

Multilingual research:

Use ragflow_retrieval_by_name with dataset_names ["BASF"],
query "Was sind die wichtigsten Geschäftsbereiche von BASF?",
deepening_level 2.

DSPy detects the query language and refines accordingly. I've used this for German, English, and mixed-language queries. It works as long as the underlying documents have content in those languages.

Document-filtered research:

1. Use ragflow_list_documents_by_name with dataset_name "BASF" to see what's in there.
2. Use ragflow_retrieval_by_name with dataset_names ["BASF"],
   document_name "sustainability_report", query "carbon neutrality goals",
   page_size 15, deepening_level 1.
3. Follow up with document_name "annual_report_2023" and
   query "environmental investments".

Cross-KB query:

Use ragflow_retrieval_by_name with dataset_names ["BASF", "Industry Reports"],
query "chemical industry sustainability benchmarks",
page_size 12, deepening_level 1.

How DSPy deepening works

deepening_level runs an LLM-driven refinement loop on top of the retrieval:

0: no deepening (default).
1: one refinement pass.
2: two passes with gap analysis.
3: three+ passes plus result merging.

Each pass: do the search, summarise the top results, ask the LLM what's missing, generate a new query, run that. The response metadata includes the original query, every refined query, and the reasoning at each step.

DSPy needs:

DSPY_MODEL — openai/gpt-4o-mini works fine
OPENAI_API_KEY

Reranking (currently broken)

When working, reranking replaces the vector cosine score with the rerank model's score (typically 10–30% better relevance in my experience). RAGFlow has a known bug right now where use_rerank=true produces:

UnsupportedProtocol: Request URL is missing an 'http://' or 'https://' protocol

So leave use_rerank=false until the upstream issue is fixed. Standard vector retrieval works normally.

How dataset lookup works

Case-insensitive name matching.
Fuzzy match for partial names.
Datasets are cached for name lookup; cache misses trigger a refresh.
If lookup fails, the error includes the available dataset names so you know what was actually there.

Document matching

When you pass document_name:

Exact match wins, then "starts with", then "contains", then partial.
Among ties, the more recently updated document wins.
Names containing 2024, 2023, latest, current, or new get a small score bonus.
All matches are returned in the response metadata so you can re-issue with a more specific name.

Error handling

Reasonable error messages for: API errors, missing datasets, unreachable RAGFlow, broken sessions, invalid input, and config problems. Sensitive values are redacted in logs.

Environment variables

RAGFLOW_BASE_URL — overrides the config file. Default in code: http://192.168.122.93:9380 (which is my local instance).
RAGFLOW_API_KEY — required.

Development

Run the server directly:

uv run ragflow-claude-mcp

It listens on stdio, the way MCP servers do.

Dev deps:

uv install --extra dev

That gets pytest + the asyncio/mock/cov plugins.

Tests:

uv run pytest
uv run pytest --cov=src --cov-report=html --cov-report=term
uv run pytest tests/test_server.py
uv run pytest -v

Coverage is around 44% with 22/23 tests passing (one is skipped because of an intermittent CI flake). Tests cover server init, RAGFlow API integration, DSPy deepening, OpenAI/OpenRouter config branches, and config loading.

Implementation notes

The retrieval API is the only RAGFlow surface the server actually relies on. No assistant/chat dependencies, no server-side prompt config — just chunks back. Easier to reason about, easier to debug.

Troubleshooting

"Dataset not found": run ragflow_list_datasets to see what's actually there.
Connection errors: double-check RAGFLOW_BASE_URL and RAGFLOW_API_KEY.
Server won't start: did uv install actually finish?
Need raw chunks: that's ragflow_retrieval_by_name / ragflow_retrieval.
Stuck session: ragflow_list_sessions then ragflow_reset_session.
Cloudflare 403s: confirm CF_ACCESS_CLIENT_ID / CF_ACCESS_CLIENT_SECRET match an active service token on the Zero Trust app.

Known issues

Rerank is broken upstream

use_rerank=true errors out with UnsupportedProtocol: Request URL is missing an 'http://' or 'https://' protocol. This is a RAGFlow-side defect. Workaround: leave it off. I'm watching the RAGFlow repo for a fix.

Contributing

PRs only — main is protected. Commits must be SSH-signed.

Fork.
git checkout -b feature/your-thing.
Make the change, write a clear commit message.
Push to your fork.
Open a PR against main.

PRs run TruffleHog automatically — don't include keys, tokens, or secrets. See CONTRIBUTING.md for the longer version.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github		.github
scripts/legacy		scripts/legacy
src/ragflow_claude_mcp		src/ragflow_claude_mcp
tests		tests
.coverage		.coverage
.coveragerc		.coveragerc
.gitignore		.gitignore
.python-version		.python-version
COMMIT_SIGNING.md		COMMIT_SIGNING.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
config.json.sample		config.json.sample
deeper_search_ragflow.sh		deeper_search_ragflow.sh
deeper_search_reranked_ragflow.sh		deeper_search_reranked_ragflow.sh
pyproject.toml		pyproject.toml
search_ragflow.sh		search_ragflow.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGFlow Claude MCP Server

What it does

Installation

Cloudflare Zero Trust

Claude Desktop config

Tools

`ragflow_retrieval_by_name` (the one I use most)

`ragflow_retrieval`

Multi-KB search

`ragflow_list_datasets`

`ragflow_list_documents`

`ragflow_get_chunks`

`ragflow_list_sessions`

`ragflow_list_documents_by_name`

`ragflow_reset_session`

Tuning the retrieval

Examples

Bigger prompts

How DSPy deepening works

Reranking (currently broken)

How dataset lookup works

Document matching

Error handling

Environment variables

Development

Implementation notes

Troubleshooting

Known issues

Rerank is broken upstream

Contributing

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAGFlow Claude MCP Server

What it does

Installation

Cloudflare Zero Trust

Claude Desktop config

Tools

ragflow_retrieval_by_name (the one I use most)

ragflow_retrieval

Multi-KB search

ragflow_list_datasets

ragflow_list_documents

ragflow_get_chunks

ragflow_list_sessions

ragflow_list_documents_by_name

ragflow_reset_session

Tuning the retrieval

Examples

Bigger prompts

How DSPy deepening works

Reranking (currently broken)

How dataset lookup works

Document matching

Error handling

Environment variables

Development

Implementation notes

Troubleshooting

Known issues

Rerank is broken upstream

Contributing

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages

`ragflow_retrieval_by_name` (the one I use most)

`ragflow_retrieval`

`ragflow_list_datasets`

`ragflow_list_documents`

`ragflow_get_chunks`

`ragflow_list_sessions`

`ragflow_list_documents_by_name`

`ragflow_reset_session`