An agentic RAG (Retrieval-Augmented Generation) system that lets you upload documents, ask questions about them, and get answers grounded in the document content.
Upload a PDF, Word doc, spreadsheet, image, or other supported file — the agent parses it, stores the embeddings in a vector database, and uses them to answer your questions in a multi-turn conversation.
- BeeAI Framework – agent orchestration and reasoning
- Docling – document parsing and text extraction
- Milvus – vector database for semantic search
- Arize Phoenix – observability and tracing
- Agent Stack – local platform runtime
PDF, DOCX, XLSX, PPTX, Markdown, AsciiDoc, HTML, CSV, PNG, JPEG, TIFF, BMP, WEBP
- User uploads a document — The file is sent to the agent through the Agent Stack UI.
- Docling extracts text — The document is parsed asynchronously via the Docling service and split into chunks.
- Embeddings stored in Milvus — Each chunk is embedded (using OpenAI
text-embedding-3-smallby default) and stored in a Milvus collection. - User asks a question — The BeeAI agent searches the vector store for relevant chunks and reasons over them to produce an answer.
- Conversation continues — The agent maintains conversation history, so you can ask follow-up questions against the same documents.
sh -c "$(curl -LsSf https://raw.githubusercontent.com/i-am-bee/agentstack/install/install.sh)"This installs the agentstack CLI, downloads the platform, and launches the UI. For other options, see the Quickstart guide.
agentstack platform stop
agentstack platform start --set phoenix.enabled=true --set docling.enabled=trueagentstack model setupFollow the prompts to connect an LLM provider (e.g., OpenAI). This is used for both chat completions and embeddings.
- Go to milvus.io and click Try Managed Milvus
- Create a project and cluster
- Copy the Public Endpoint and Token
- Register them with Agent Stack:
agentstack env add 'RAG Milvus' \
MILVUS_DB_URI="your-endpoint" \
MILVUS_DB_TOKEN="your-token"git clone https://github.com/jenna-winkler/rag.git
cd rag
uv syncStart the agent server:
uv run serverThen open the Agent Stack UI:
agentstack uiGo to http://localhost:8334, select the RAG Milvus agent, upload a document, and start asking questions.
Traces are collected automatically via Arize Phoenix. Open http://localhost:6006 to explore agent runs, tool calls, and latency data.