Agentic RAG Example

An agentic RAG (Retrieval-Augmented Generation) system that lets you upload documents, ask questions about them, and get answers grounded in the document content.

Upload a PDF, Word doc, spreadsheet, image, or other supported file — the agent parses it, stores the embeddings in a vector database, and uses them to answer your questions in a multi-turn conversation.

Built with

BeeAI Framework – agent orchestration and reasoning
Docling – document parsing and text extraction
Milvus – vector database for semantic search
Arize Phoenix – observability and tracing
Agent Stack – local platform runtime

Supported file types

PDF, DOCX, XLSX, PPTX, Markdown, AsciiDoc, HTML, CSV, PNG, JPEG, TIFF, BMP, WEBP

How it works

User uploads a document — The file is sent to the agent through the Agent Stack UI.
Docling extracts text — The document is parsed asynchronously via the Docling service and split into chunks.
Embeddings stored in Milvus — Each chunk is embedded (using OpenAI text-embedding-3-small by default) and stored in a Milvus collection.
User asks a question — The BeeAI agent searches the vector store for relevant chunks and reasons over them to produce an answer.
Conversation continues — The agent maintains conversation history, so you can ask follow-up questions against the same documents.

Prerequisites

Python >= 3.11
uv (Python package manager)
A Milvus cloud instance (free tier available)

Setup

1. Install Agent Stack

sh -c "$(curl -LsSf https://raw.githubusercontent.com/i-am-bee/agentstack/install/install.sh)"

This installs the agentstack CLI, downloads the platform, and launches the UI. For other options, see the Quickstart guide.

2. Start the platform with Docling and Phoenix enabled

agentstack platform stop
agentstack platform start --set phoenix.enabled=true --set docling.enabled=true

3. Configure an LLM provider

agentstack model setup

Follow the prompts to connect an LLM provider (e.g., OpenAI). This is used for both chat completions and embeddings.

4. Create a Milvus cloud instance

Go to milvus.io and click Try Managed Milvus
Create a project and cluster
Copy the Public Endpoint and Token
Register them with Agent Stack:

agentstack env add 'RAG Milvus' \
  MILVUS_DB_URI="your-endpoint" \
  MILVUS_DB_TOKEN="your-token"

5. Clone and install

git clone https://github.com/jenna-winkler/rag.git
cd rag
uv sync

Running the agent

Start the agent server:

uv run server

Then open the Agent Stack UI:

agentstack ui

Go to http://localhost:8334, select the RAG Milvus agent, upload a document, and start asking questions.

Observability

Traces are collected automatically via Arize Phoenix. Open http://localhost:6006 to explore agent runs, tool calls, and latency data.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.github/workflows		.github/workflows
src/agentstack_agents		src/agentstack_agents
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic RAG Example

Built with

Supported file types

How it works

Prerequisites

Setup

1. Install Agent Stack

2. Start the platform with Docling and Phoenix enabled

3. Configure an LLM provider

4. Create a Milvus cloud instance

5. Clone and install

Running the agent

Observability

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic RAG Example

Built with

Supported file types

How it works

Prerequisites

Setup

1. Install Agent Stack

2. Start the platform with Docling and Phoenix enabled

3. Configure an LLM provider

4. Create a Milvus cloud instance

5. Clone and install

Running the agent

Observability

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages