Setup for Redshift Cross Account IAM Role

In this article

1 Key Components
2 Configure Redshift Cross Account with IAM Role
3 Read and Write policies

Key Components

There are three key components involved with the Redshift Bulk Snaps.

EC2 Instance
Redshift Cluster
S3

These components can reside in the same AWS account or different accounts. After Pipeline execution, these components perform the following operations:

EC2: Archives input data and writes the data into the specified S3 bucket/folder.
ABC.csv.gz -> s3://swat-3032/datalake/raw
Redshift Cluster: Copies the data from S3 to a Redshift temporary table using the COPY command.
COPY "public"."swat3032_update_temp_table_XYZ" ("id", "name", "price")
FROM 's3://swat-3032/datalake/raw/Redshift_load_temp/ABC.csv.gz'
CREDENTIALS '...'
S3: Loads data from a temporary table to a target table. For the Upsert Snap, this will be an UPDATE followed by an INSERT operation.

You can inspect the queries in the Redshift console for more information about each component's operation.

Configure Redshift Cross Account with IAM Role

The following flow chart illustrates the cross-account roles that you should configure for each key component.

The values in the legend indicate the values that you can use in your account configuration.

If all your components are in the same AWS account, you must use a Redshift IAM Account, which means you do not need a Redshift Cross Account IAM Role Account.
If all your components are in different AWS accounts, you must use Redshift Cross Account IAM Role Account for your setup to be successful.

Here are the typical combinations of your Redshift cross IAM Role account configuration, when all the components are in different AWS accounts:

Configuration 1—S3, EC2, and Redshift are in different AWS accounts

When all the components are in three different AWS accounts, the configuration centers around accessing S3. In which case, you need Read (read from S3) and Write (write to S3) permissions for S3. EC2 should be able to write, and Redshift should be able to read.

Configuration 2—Redshift in one AWS account, and EC2 and S3 in another account

When Redshift is in one account and EC2 and S3 are in a different account, you need to define a role with an S3 write policy, define another role in the same account for EC2 to assume that role, and assign this role to the EC2 instance.

Configuration 3—Redshift and S3 in one AWS account and EC2 in another account

When Redshift and S3 are in the same account and EC2 is in another account, you can define a role (role1) with S3 read permission, and then define another role (role2) in the same account to assume that role (role1). Assign role2 to the Redshift Cluster, and role1 to the S3 read field. Alternatively, leave S3 Read blank and ensure the role assigned to the Redshift Cluster has S3 read permission.