Basic config
bucket: The S3 bucket you want to use to store the data.prefix: The S3 path prefix. The whole S3 path will bes3://{bucket}/{prefix}.file_type:parquet,csv, ordelta(for Delta Lake tables)buffer_size_mbandbuffer_timeout_seconds: Mage puts messages in a buffer before uploading to S3. You can configure the size and timeout of the buffer to control the file size and the delay.
region parameter in the default config template, it can be ignored. If you specify any valid AWS region names or null, you can still access to S3 buckets in other regions. However, if you do include the parameter value for region, it must be a valid region name for AWS or null so it won’t error out.
Configure time-based partition
date_partition_format
to the config. Example values: %Y%m%d, %Y%m%dT%H.
Delta Lake configuration
Note: Delta Lake format (file_type: delta) is a Mage Pro only feature.
Only in Mage Pro.Try our fully managed solution to access this advanced feature.
file_type: delta, you can write data to Delta Lake tables in Amazon S3.
table_uri: (Optional) Full URI for the Delta Lake table (e.g.,s3://bucket/path/to/table). If not provided, the URI will be automatically constructed frombucketandprefix.mode: Write mode for Delta Lake tables. Valid values:append: Adds new rows to the existing table (default)overwrite: Replaces all existing rows in the table
aws_access_key_id: (Optional) AWS access key ID. If not provided, will fall back to theAWS_ACCESS_KEY_IDenvironment variable or the IAM role attached to the instance. See Referencing Variables for configuring credentials via environment variables or secret variables.aws_secret_access_key: (Optional) AWS secret access key. If not provided, will fall back to theAWS_SECRET_ACCESS_KEYenvironment variable or the IAM role attached to the instance. See Referencing Variables for configuring credentials via environment variables or secret variables.aws_region: (Optional) AWS region. If not provided, will fall back to theAWS_REGIONenvironment variable or default tous-west-2.
date_partition_format) is not applied as Delta Lake manages its own internal file structure.
Authentication
Here are the options to authenticate with the AWS S3 bucket.- Add the following keys and values to your environment variables
AWS_ACCESS_KEY_IDAWS_SECRET_ACCESS_KEYAWS_REGION
- If you deploy Mage on AWS ECS cluster, you can use ECS execution task role to authenticate. You can grant the ECS task permissions to access other AWS services by attaching IAM policies to this ECS task execution role.