This example originally runs on a single node with four GPUs, each requiring at least 40GB of memory.
- Prepare the RAG dataset in the wiki_retriever_mcp folder. Wiki chunks (
nq_list.pkl) and Faiss index (nq_hnsw_faiss_n32e40.index) are required. (Full wiki dump files are huge, additional information will be provided later) - Prepare the training data in the
datafolder. Download from here.musique_train.parquetandmusique_dev_128.parquetare required. - Set up the environment for wiki retriever MCP:
bash wiki_retriever_install.sh. This will install the required packages and set up the environment for the wiki retriever MCP. - Start the wiki retriever MCP:
python wiki_retriever_mcp.py. This will start the wiki retriever MCP server. - Start Ray:
bash ../../scripts/restart_ray.sh. To use Wandb, you need to set the WANDB_API_KEY environment variable before starting Ray. - Run the agent:
python rag_agent.py. This automatically launches 12 agent workers by default. - In another terminal, launch the training server:
bash train.sh.
Results are coming soon.