Sources
Amazon Redshift
Configuration
To connect to your Amazon Redshift cluster and extract data, provide the following configuration parameters:
Key | Description | Sample Value |
---|---|---|
database | Name of the database you want to load data from. | demo |
host | Hostname of your Redshift cluster. | mage-prod.3.us-west-2.redshift.amazonaws.com |
password | Password for the database user. | abc123... |
port | Port where Redshift is running (typically 5439 ). | 5439 |
region | AWS region where your Redshift cluster is located. | us-west-2 |
schema | Schema containing the tables you want to extract. | public |
user | Username with permissions to read and write to the specified schema. | awsuser |
IAM Authentication (Optional)
Alternatively, use IAM database authentication by providing the following parameters:
Key | Description | Sample Value |
---|---|---|
access_key_id | AWS access key for the IAM role or IAM user. | abc123... |
cluster_identifier | Cluster identifier of the Amazon Redshift cluster. | mage-prod |
db_user | Database user for Redshift IAM authentication. | admin |
secret_access_key | AWS secret access key for the IAM role or IAM user. | xyz123 |
Optional Configuration
Key | Description | Sample Value |
---|---|---|
batch_fetch_limit | Number of rows to fetch per batch (defaults to 50,000). Tune higher for larger clusters. | 50000 |
What is Amazon Redshift?
Amazon Redshift is a fully managed, petabyte-scale cloud data warehouse solution designed for large-scale data storage and analytics. Redshift delivers fast query performance by using columnar storage, massively parallel processing (MPP), and advanced compression.
Integrating Amazon Redshift with Mage allows you to automate data extraction from Redshift clusters, streamline ETL pipelines, and centralize data for analytics and business intelligence workloads.
Why Integrate Amazon Redshift with Mage?
- High-performance data extraction: Load massive datasets at scale.
- Flexible authentication: Connect using either traditional credentials or secure IAM authentication.
- Automated ETL pipelines: Move data from Redshift to your data lake or warehouse with zero manual effort.
- Cost-efficient analytics: Optimize Redshift query performance and storage.
- Unified data workflows: Combine Redshift data with other sources to power real-time dashboards and reports.