AWS Glossary

Amazon Bedrock

Fully managed service providing access to foundation models from Amazon, Anthropic, Meta, Mistral, and others — for building generative AI applications.

Last reviewed: June 2026

Amazon Bedrock User Guide Bedrock Model Access

Definition

Amazon Bedrock is a fully managed generative AI service that exposes foundation models (FMs) through a unified API inside your AWS account. As of June 2026, Bedrock offers roughly 100 serverless models from Amazon, Anthropic, Meta, Mistral, Cohere, and others — including Claude Sonnet 4.6 and Opus 4.6, the Amazon Nova family (Micro through Premier, plus Canvas for images and Reel for video), and open-weight options such as Llama. Bedrock also provides Knowledge Bases (managed RAG), Agents, Guardrails, model customization, and evaluation tools. Model inference and customer content stay within your AWS environment under the standard AWS shared responsibility model.

When to use it

Production GenAI without GPU fleet management — pay per token instead of operating inference clusters.
Model diversity in one integration — route classification to Nova Micro, extraction to Nova Lite, and complex reasoning to Claude Sonnet 4.6 or Opus 4.6 from the same codebase.
Grounded Q&A over private documents via Knowledge Bases and the Converse / RetrieveAndGenerate APIs.
Regulated workloads requiring Guardrails, CloudTrail logging, VPC endpoints, and IAM-scoped model access.
Multi-step agents that call APIs, query knowledge bases, and retain session memory — extend with Bedrock AgentCore for managed runtime infrastructure.

When not to use it

Lowest cost at very high sustained throughput on a single fixed model — self-hosted inference on SageMaker or EC2 can win economically after careful modeling.
Fully custom architecture not available on Bedrock — exotic fine-tuning, non-supported frameworks, or on-device inference need different stacks.
Single-turn tasks solvable without an LLM — traditional search, rules engines, or classical ML may be cheaper and more deterministic.

Tips

Enable model access per region for each provider in the Bedrock console before deployment — missing access shows as obscure API errors in CI.
Apply Guardrails in production on both input and output regardless of model vendor.
Run model evaluation on your own prompt set before standardizing on Opus-class models — Sonnet 4.6 or Nova Pro often meet the quality bar at lower cost.
Use cross-region inference profiles where available for resilience when a single region throttles.
Separate dev and prod IAM policies and Knowledge Base data sources to prevent test prompts from retrieving production documents.

Gotchas

Serious

No Guardrails on customer-facing endpoints — models can leak training-adjacent patterns, generate harmful content, or echo PII from prompts.
Throttling during launch — new workloads need quota headroom and exponential backoff; cross-region profiles help but do not eliminate limits.
Treating all models as interchangeable — tool use, context length, multimodal input, and fine-tuning support vary by model ID.

Regular

Hardcoding model IDs — AWS releases new versions; pin in config and plan upgrade testing when IDs deprecate.
Skipping citation review in RAG — Knowledge Bases reduce hallucination but do not eliminate wrong-chunk retrieval; validate sources in UI.
Logging full prompts with secrets — CloudWatch logs may capture API payloads; redact credentials and PHI at the application layer.

Official references

Related Services

Generative AI on AWS — Production-Ready LLM Apps in Weeks

Generative AI strategy and delivery on AWS — use-case selection, Bedrock + SageMaker architecture, governance, evaluations, and production rollout across the AWS AI stack.

Learn more

Amazon Bedrock Consulting for Production LLM Applications

Amazon Bedrock implementation consulting — Knowledge Bases, Agents, Guardrails, model routing, and production RAG. Hands-on Bedrock engineering, not GenAI strategy.

Learn more

Fine-Tuning vs RAG on AWS Bedrock: When to Use Each

Compare fine-tuning and RAG (retrieval-augmented generation) for customizing LLMs on Bedrock. Cost, latency, and accuracy trade-offs.

Learn more

Need help with this topic?

Our AWS-certified team implements, audits, and optimizes these services in production — from Bedrock RAG pipelines to multi-account landing zones.

Talk to AWS Experts

Amazon Bedrock

AI & assistant-friendly summary

Summary

Key Facts

Entity Definitions

Related Content

Definition

When to use it

When not to use it

Tips

Gotchas

Serious

Regular

Official references

Related FactualMinds content

Related Services

Generative AI on AWS — Production-Ready LLM Apps in Weeks

Amazon Bedrock Consulting for Production LLM Applications

Related Articles

Fine-Tuning vs RAG on AWS Bedrock: When to Use Each

Need help with this topic?