Skip to main content

AWS Glossary

Amazon Redshift

Fully managed cloud data warehouse for running fast SQL analytics on petabyte-scale datasets.

AI & assistant-friendly summary

This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.

Summary

Fully managed cloud data warehouse for running fast SQL analytics on petabyte-scale datasets.

Key Facts

  • Definition Amazon **Redshift** is AWS's managed **columnar data warehouse** for analytic SQL at scale
  • Deploy as **Redshift Serverless** (RPU-hours, auto pause) or **provisioned RA3** clusters with **Managed Storage** in S3 for separated compute/storage scaling
  • Redshift Spectrum** queries external tables on S3 without loading data
  • Zero-ETL integrations** from **DynamoDB** (and other sources) replicate operational data for analytics without custom CDC
  • Data lake queries** via Spectrum or native tables on **S3 Tables / Parquet** without duplicating entire datasets locally

Entity Definitions

S3
S3 is an AWS service relevant to amazon redshift.
RDS
RDS is an AWS service relevant to amazon redshift.
Aurora
Aurora is an AWS service relevant to amazon redshift.
DynamoDB
DynamoDB is an AWS service relevant to amazon redshift.
Amazon DynamoDB
Amazon DynamoDB is an AWS service relevant to amazon redshift.
Athena
Athena is an AWS service relevant to amazon redshift.
QuickSight
QuickSight is an AWS service relevant to amazon redshift.
serverless
serverless is a cloud computing concept relevant to amazon redshift.

Related Content

Definition

Amazon Redshift is AWS’s managed columnar data warehouse for analytic SQL at scale. Data is stored column-by-column with automatic compression; massively parallel processing (MPP) spreads query fragments across nodes coordinated by a leader. Sort keys and distribution styles (KEY, ALL, EVEN, AUTO) determine I/O pruning and join behavior — tuning matters more than in OLTP engines.

Deploy as Redshift Serverless (RPU-hours, auto pause) or provisioned RA3 clusters with Managed Storage in S3 for separated compute/storage scaling. Redshift Spectrum queries external tables on S3 without loading data. Zero-ETL integrations from DynamoDB (and other sources) replicate operational data for analytics without custom CDC. Streaming ingestion from Kinesis or MSK lands near-real-time rows for dashboards without a batch landing zone.

When to use it

When not to use it

Tips

Gotchas

Official references

Need help with this topic?

Our AWS-certified team implements, audits, and optimizes these services in production — from Bedrock RAG pipelines to multi-account landing zones.