Tag: rag

Generative AI

Apr 7, 2026 palaniappan p 3 min

Fine-Tuning vs RAG on AWS Bedrock: When to Use Each

Compare fine-tuning and RAG (retrieval-augmented generation) for customizing LLMs on Bedrock. Cost, latency, and accuracy trade-offs.
bedrock
rag
fine-tuning
llm
generative-ai
Read article
Generative AI

Apr 3, 2026 palaniappan p 6 min

How to Build a RAG Pipeline with Amazon Bedrock Knowledge Bases

Amazon Bedrock Knowledge Bases automate the RAG (Retrieval-Augmented Generation) pipeline — semantic search, chunking, embedding, and context injection into Claude or other foundation models. This guide covers setup, data ingestion, cost optimization, and production patterns.
how-to-guide
bedrock
genai
rag
knowledge-bases
llm
aws
Read article
Generative AI

Mar 9, 2026 palaniappan p 12 min

S3 Vectors: 10,000 Results per Query (June 2026)

On June 16, 2026, S3 Vectors raised the QueryVectors limit to 10,000 results per query and cut data-processed charges up to 80% on indexes over 10M vectors. Architecture, pagination, and cost comparison vs OpenSearch and MemoryDB.
s3-vectors
vector-storage
rag
bedrock
aws-ai
Read article
Cloud Architecture

Jan 5, 2026 palaniappan p 9 min

Amazon MemoryDB with Vector Search: Durable Redis-Compatible Storage for AI Workloads

ElastiCache loses your AI chatbot's session memory at every node replacement. MemoryDB doesn't. A decision framework for when to pick MemoryDB over ElastiCache, OpenSearch Serverless, and S3 Vectors for AI workloads — with the latency math and the failure mode that forces the switch.
memorydb
vector-search
redis
rag
aws-database
Read article