Analysis
6 min read

Amazon Bedrock Managed Knowledge Base Goes GA — Agentic Retriever Ends DIY RAG Pipelines

Amazon Bedrock Managed Knowledge Base and Agentic Retriever reached GA on June 17, 2026. Here's what it means for enterprise RAG architectures and AWS-native AI agents.

Source: AWS What's New

Amazon Bedrock Managed Knowledge Base Goes GA — Agentic Retriever Ends DIY RAG Pipelines

By Vatsal Shah · June 18, 2026 · Cloud · Source: AWS What's New


💡 Insight

AI SUMMARY

  • Amazon Bedrock Managed Knowledge Base reached General Availability on June 17, 2026 across US East, US West, EU, APAC, and GovCloud regions.
  • The Agentic Retriever — a new multiturn, multihop retrieval engine — is the central addition: it decomposes complex queries into sub-questions and issues sequential retrieval hops to build context-rich, synthesized answers.
  • Six native connectors (Amazon S3, SharePoint Online, Confluence, Salesforce, Jira, Web Crawler) handle automated ingestion, chunking, embedding, and sync — eliminating the DIY ETL layer that most enterprise RAG teams build and maintain manually.
  • Managed vector storage removes the need to provision, scale, or back up a separate vector database for knowledge base workloads.
  • Announced during AWS Summit New York 2026, this GA is AWS's clearest signal yet that enterprise RAG is a cloud primitive — not a framework problem.

What Happened

Amazon Web Services made Bedrock Managed Knowledge Base and its Agentic Retriever generally available on June 17, 2026, announced as a headline release during AWS Summit New York 2026.

The core offering: a fully managed Retrieval-Augmented Generation (RAG) system built directly into the Amazon Bedrock platform. Teams no longer need to stand up and operate their own vector databases, write custom document ingestion pipelines, or maintain chunking and embedding logic — Bedrock handles all of it.

The centerpiece feature is the Agentic Retriever: a multi-step, multihop retrieval engine that goes beyond classic single-pass vector search. When a user asks a complex question — "What were the key decisions in the Q1 2026 product review meeting, and how do they align with the roadmap from last year's strategy document?" — the Agentic Retriever:

  1. Decomposes the query into sub-questions
  2. Issues sequential retrieval hops against the knowledge base
  3. Uses each hop's results to inform the next query
  4. Synthesizes a final answer from the accumulated context

This is categorically different from a one-shot similarity_search() call against a Pinecone or pgvector index.

Bedrock Agentic Retriever multihop query flow — user query enters Agentic Retriever for decomposition and planning, flows to Managed Knowledge Base vector index, executes Hop 1 context retrieval, Hop 2 follow-up retrieval, then synthesizes a final answer — AWS Bedrock 2026
Amazon Bedrock's Agentic Retriever decomposes complex queries into sequential retrieval hops. Unlike single-pass vector search, it builds progressively richer context across multiple knowledge base queries before synthesizing a final answer — enabling the kind of multi-document reasoning that enterprise use cases require.

What's included at GA:

  • Six native connectors: Amazon S3, SharePoint Online (Microsoft 365), Confluence, Salesforce, Jira, and a managed Web Crawler. Each connector handles scheduling, incremental sync, metadata extraction, chunking, and embedding — without custom ETL code.
  • Managed vector storage: AWS handles provisioning, scaling, backup, and the embedding model lifecycle. No separate vector database to operate.
  • Bedrock AgentCore integration: The knowledge base connects natively to Bedrock AgentCore, the agent runtime layer announced at re:Invent 2025. Agents built on AgentCore can issue Agentic Retriever calls as first-class tool invocations.
  • Multiturn memory: The retrieval context is maintained across conversation turns, enabling follow-up questions without re-ingesting the full context window.
  • Regional availability: US East (N. Virginia), US West (Oregon), EU (Ireland, Frankfurt), APAC (Tokyo, Sydney), and AWS GovCloud (US-East).

Why It Matters

Bedrock Managed KB native connector map — Amazon S3, SharePoint Online, Confluence, Salesforce, Jira, and Web Crawler surround the central Managed Knowledge Base node with managed sync, auto-chunking, and metadata handling — AWS Bedrock 2026
Bedrock Managed Knowledge Base ships with six enterprise-grade native connectors. Each connector manages the full data pipeline: scheduling, incremental sync, chunking, embedding, and metadata extraction. Teams that previously built and maintained this infrastructure manually — often a 6–10 week engineering investment — get it as a managed service with GA SLAs.

The DIY RAG Tax Is Real

Every enterprise RAG team I've worked with in 2025–2026 has the same origin story: they spent 4–10 weeks building and hardening a custom pipeline — LangChain document loaders, a chunking strategy, an embedding service, a vector database cluster (Pinecone, Weaviate, or pgvector on RDS), and a retrieval layer with hybrid search. Then they spent another 4–6 weeks debugging it in production — stale embeddings after document updates, connector failures, chunk size tuning, and embedding model version drift.

That infrastructure is now a managed service. For teams already on AWS, this is the clearest path to eliminating the operational burden of enterprise RAG.

Multihop Is the Missing Piece for Enterprise Queries

The single biggest limitation of classic RAG in enterprise deployments isn't embedding quality — it's that real enterprise questions require multi-document synthesis that a single retrieval call can't satisfy. A question about a customer's contract terms, their account history, and the relevant product documentation requires at minimum three separate retrieval passes, each informed by the previous result.

The Agentic Retriever addresses this directly. Its multihop architecture mirrors what teams were previously building manually with LangChain's MultiQueryRetriever or custom agent loops — but as a managed, latency-optimized primitive.

The AgentCore Integration Signal

The tight coupling between Bedrock Managed Knowledge Base and AgentCore tells you where AWS is going with its agent platform: knowledge retrieval as a native tool in the agent execution loop, not a side-service to call via custom HTTP. This reduces the architecture complexity of Bedrock-native agents significantly — one API surface instead of three (LLM + vector DB + retrieval orchestration).

What Teams Lose by Self-Managing

Teams on non-AWS stacks (Azure OpenAI + AI Search, Google Vertex AI Search, or self-managed pgvector + LangChain) retain full control over chunking strategy, embedding model selection, and retrieval tuning. Bedrock Managed KB makes those trade-offs for you. For most enterprise use cases where the priority is operational simplicity, that's a good trade. For teams with highly specialized retrieval requirements (domain-specific embedding models, bespoke chunking, custom hybrid search weights), the managed approach may hit ceilings that require escalation to the underlying primitives anyway.

The connector coverage also has a notable gap at GA: Google Drive, ServiceNow, and Zendesk are not in the six native connectors. Teams whose primary knowledge source is one of these three will need a custom S3-based pipeline as an intermediate layer.


What to Watch Next

  • AgentCore action groups GA: AWS is expected to GA AgentCore action groups (the tool-calling layer for Bedrock agents) within Q3 2026. The combination of Managed KB for retrieval + action groups for tool execution forms a complete Bedrock-native agent stack. Watch the AWS re:Invent 2026 preview cycle.
  • Connector expansion: The six GA connectors will grow. Google Drive and ServiceNow are the two highest-demand additions based on the pre-GA preview feedback. Expect announcements in the July–September 2026 update cycle.
  • Embedding model flexibility: GA ships with Amazon Titan Embeddings V3 and Cohere Embed v4 as the managed embedding options. Fine-tuned or domain-specific embedding models (via Bedrock Custom Models) are on the roadmap for Q4 2026.
  • Pricing at scale: Managed KB pricing combines per-document ingestion, per-query retrieval, and managed vector storage fees. At high document volumes (>10M chunks), the cost structure vs self-managed pgvector or OpenSearch Serverless needs a detailed TCO analysis before committing.

Source

AWS What's New — Amazon Bedrock Managed Knowledge Base GA (Jun 17, 2026)

Additional coverage: AWS News Blog · AWS Summit NY 2026 Recap

Related on shahvatsal.com:


Want to work together on business transformation?

Visit my personal hub for advisory scope, or connect on LinkedIn. Every engagement is principal-led with measurable outcomes.

Visit Shah Vatsal Connect on LinkedIn Book intro call
Book intro