AWS Glossary
Amazon Bedrock
Fully managed service providing access to foundation models from Amazon, Anthropic, Meta, Mistral, and others — for building generative AI applications.
AI & assistant-friendly summary
This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.
Summary
Fully managed service providing access to foundation models from Amazon, Anthropic, Meta, Mistral, and others — for building generative AI applications.
Key Facts
- • Fully managed service providing access to foundation models from Amazon, Anthropic, Meta, Mistral, and others — for building generative AI applications
- • Definition Amazon Bedrock is a fully managed generative AI service that exposes foundation models (FMs) through a unified API inside your AWS account
- • As of June 2026, Bedrock offers **roughly 100 serverless models** from Amazon, Anthropic, Meta, Mistral, Cohere, and others — including **Claude Sonnet 4
- • 6** and **Opus 4
- • 6**, the **Amazon Nova** family (Micro through **Premier**, plus **Canvas** for images and **Reel** for video), and open-weight options such as Llama
Entity Definitions
- AWS Bedrock
- AWS Bedrock is an AWS service relevant to amazon bedrock.
- Amazon Bedrock
- Amazon Bedrock is an AWS service relevant to amazon bedrock.
- Bedrock
- Bedrock is an AWS service relevant to amazon bedrock.
- SageMaker
- SageMaker is an AWS service relevant to amazon bedrock.
- EC2
- EC2 is an AWS service relevant to amazon bedrock.
- CloudWatch
- CloudWatch is an AWS service relevant to amazon bedrock.
- IAM
- IAM is an AWS service relevant to amazon bedrock.
- VPC
- VPC is an AWS service relevant to amazon bedrock.
- RAG
- RAG is a cloud computing concept relevant to amazon bedrock.
- fine-tuning
- fine-tuning is a cloud computing concept relevant to amazon bedrock.
- serverless
- serverless is a cloud computing concept relevant to amazon bedrock.
Related Content
- GENERATIVE AI ON AWS — Related service
- AWS BEDROCK — Related service
Definition
Amazon Bedrock is a fully managed generative AI service that exposes foundation models (FMs) through a unified API inside your AWS account. As of June 2026, Bedrock offers roughly 100 serverless models from Amazon, Anthropic, Meta, Mistral, Cohere, and others — including Claude Sonnet 4.6 and Opus 4.6, the Amazon Nova family (Micro through Premier, plus Canvas for images and Reel for video), and open-weight options such as Llama. Bedrock also provides Knowledge Bases (managed RAG), Agents, Guardrails, model customization, and evaluation tools. Model inference and customer content stay within your AWS environment under the standard AWS shared responsibility model.
When to use it
- Production GenAI without GPU fleet management — pay per token instead of operating inference clusters.
- Model diversity in one integration — route classification to Nova Micro, extraction to Nova Lite, and complex reasoning to Claude Sonnet 4.6 or Opus 4.6 from the same codebase.
- Grounded Q&A over private documents via Knowledge Bases and the Converse / RetrieveAndGenerate APIs.
- Regulated workloads requiring Guardrails, CloudTrail logging, VPC endpoints, and IAM-scoped model access.
- Multi-step agents that call APIs, query knowledge bases, and retain session memory — extend with Bedrock AgentCore for managed runtime infrastructure.
When not to use it
- Lowest cost at very high sustained throughput on a single fixed model — self-hosted inference on SageMaker or EC2 can win economically after careful modeling.
- Fully custom architecture not available on Bedrock — exotic fine-tuning, non-supported frameworks, or on-device inference need different stacks.
- Single-turn tasks solvable without an LLM — traditional search, rules engines, or classical ML may be cheaper and more deterministic.
Tips
- Enable model access per region for each provider in the Bedrock console before deployment — missing access shows as obscure API errors in CI.
- Apply Guardrails in production on both input and output regardless of model vendor.
- Run model evaluation on your own prompt set before standardizing on Opus-class models — Sonnet 4.6 or Nova Pro often meet the quality bar at lower cost.
- Use cross-region inference profiles where available for resilience when a single region throttles.
- Separate dev and prod IAM policies and Knowledge Base data sources to prevent test prompts from retrieving production documents.
Gotchas
Serious
- No Guardrails on customer-facing endpoints — models can leak training-adjacent patterns, generate harmful content, or echo PII from prompts.
- Throttling during launch — new workloads need quota headroom and exponential backoff; cross-region profiles help but do not eliminate limits.
- Treating all models as interchangeable — tool use, context length, multimodal input, and fine-tuning support vary by model ID.
Regular
- Hardcoding model IDs — AWS releases new versions; pin in config and plan upgrade testing when IDs deprecate.
- Skipping citation review in RAG — Knowledge Bases reduce hallucination but do not eliminate wrong-chunk retrieval; validate sources in UI.
- Logging full prompts with secrets — CloudWatch logs may capture API payloads; redact credentials and PHI at the application layer.
Official references
- What is Amazon Bedrock?
- Supported foundation models
- Amazon Bedrock Knowledge Bases
- Guardrails for Amazon Bedrock
Related FactualMinds content
Related Services
Generative AI on AWS — Production-Ready LLM Apps in Weeks
Generative AI strategy and delivery on AWS — use-case selection, Bedrock + SageMaker architecture, governance, evaluations, and production rollout across the AWS AI stack.
Amazon Bedrock Consulting for Production LLM Applications
Amazon Bedrock implementation consulting — Knowledge Bases, Agents, Guardrails, model routing, and production RAG. Hands-on Bedrock engineering, not GenAI strategy.
Need help with this topic?
Our AWS-certified team implements, audits, and optimizes these services in production — from Bedrock RAG pipelines to multi-account landing zones.