✨

AWS Bedrock Knowledge Bases: Vector vs. Structured Type Comparison

Tran Hai Dang

2025/08/05に公開

Hi, I’m Dang, an AI engineer at Knowledgelabo, Inc. We provide a service called "Manageboard", which supports our clients in aggregating, analyzing, and managing scattered internal business data. Manageboard is set to enhance its AI capabilities in the future.
In this article, I’ll share some insights from our R&D efforts, focusing on how to use AWS Knowledge Bases effectively.

 IntroductionAWS Bedrock is a managed service that allows seamless access to multiple large language models (LLMs), such as Claude and Nova.

Among its key features is the Knowledge Base, which enables LLMs to interact with internal documents and structured data.
There are 3 types of knowledge bases available in Bedrock:
Vector store-based (for document-like, unstructured data)
Structured data-based (SQL-based integration with Redshift)
Kendra GenAI Index-based (uses Kendra’s high-performance search)
Note: This verification was conducted in the Tokyo region, where Kendra GenAI Index is currently unavailable. Therefore, only the Vector Store and Structured Data types were tested.

 Vector Store-Based Knowledge BaseThis type of knowledge base uses document embeddings to retrieve semantically similar paragraphs through vector similarity search.

 How It WorksInput data (PDF, TXT, etc.) is split into chunks (chunking)
Each chunk is converted into an embedding vector
User queries are also embedded and used to retrieve the most similar chunks
Retrieved chunks are injected into the prompt and sent to the LLM

 Setup StepsIn the Bedrock console, go to “Knowledge Bases” → “Create” → Choose “Knowledge Base with vector store”
Specify service role and data source (e.g., S3)
Configure the data source location (S3)
Choose the embedding model (e.g., Titan Text Embeddings)
Create and sync the knowledge base

 AdvantagesIdeal for FAQs, manuals, and internal documents
Easy to use with prompt injection
Chunking strategies can be adjusted to improve retrieval accuracy

 LimitationsAccuracy may drop with images, tables, or diagrams
Low relevance scores can reduce answer quality
Paragraph design and splitting rules are crucial for better performance

 Structured Data-Based Knowledge BaseIn this type, the LLM translates natural language queries into SQL, then executes them on Redshift and uses the result to form an answer.

 How It WorksLLM generates SQL from natural language
SQL is executed on Redshift
Result is passed into the prompt and sent to the LLM

 Setup StepsIn the Bedrock console, go to “Knowledge Bases” → “Create” → Choose “Knowledge Base with structured data store”
Configure the service role
In the query engine settings, connect to your Redshift cluster and database (We verified with Redshift Serverless)
Create the knowledge base
On Redshift, grant permissions to the service role:
CREATE USER "IAMR:[role name]" WITH PASSWORD DISABLE;
GRANT USAGE ON SCHEMA public TO "IAMR:[role name]";
GRANT SELECT ON [table name] TO "IAMR:[role name]";
Sync the knowledge base
Tip: Use CloudWatch to monitor SQL results and execution times.

 AdvantagesStrong for aggregation, filtering, and numerical analysis
Handles millions of rows without large token usage
Can reflect the latest data from Redshift

 LimitationsOnly Redshift is supported as a data source
Accuracy decreases with complex SQL schemas

 Practical Comparison: Vector vs. Structured

Aspect
Vector Store-Based
Structured Data-Based


Data Type
Unstructured documents
Structured data in Redshift

Accuracy
Depends on similarity score
Depends on SQL generation accuracy

Latency
Short
Depends on SQL execution time

Setup
Need paragraph design and splitting rules
Need schema explanation

Best Use Case
FAQs, manuals, internal docs
Aggregation, analytics


 ConclusionAWS Bedrock’s knowledge base feature is powerful, but accuracy depends heavily on configuration and use case:
Use Vector Store for document-based RAG (retrieval-augmented generation)
Use Structured Data (Redshift) for numerical analysis and real-time aggregation
In the next article, I’ll dive into how to improve the accuracy of structured knowledge bases by providing schema descriptions.

 ReferencesBuild a knowledge base by connecting to a data source
Build a knowledge base by connecting to a structured data store
Bedrockナレッジベースの「自然言語 to SQL」で社内DB検索エージェントを作ろう！

Aspect	Vector Store-Based	Structured Data-Based
Data Type	Unstructured documents	Structured data in Redshift
Accuracy	Depends on similarity score	Depends on SQL generation accuracy
Latency	Short	Depends on SQL execution time
Setup	Need paragraph design and splitting rules	Need schema explanation
Best Use Case	FAQs, manuals, internal docs	Aggregation, analytics

株式会社ナレッジラボテックブログPublication

Introduction

Vector Store-Based Knowledge Base

How It Works

Setup Steps

Advantages

Limitations

Structured Data-Based Knowledge Base

How It Works

Setup Steps

Advantages

Limitations

Practical Comparison: Vector vs. Structured

Conclusion

References

Discussion