• Login
Sunday, June 7, 2026
The Cloud Guru
  • Home
  • AWS
  • Data Center
  • GCP
  • Technology
  • Tutorials
  • Blog
    • Blog
    • Reviews
No Result
View All Result
Sunday, June 7, 2026
  • Home
  • AWS
  • Data Center
  • GCP
  • Technology
  • Tutorials
  • Blog
    • Blog
    • Reviews
No Result
View All Result
The Cloud Guru
No Result
View All Result

Manually Migrate Data Between Redshift Clusters

Team TCG by Team TCG
March 3, 2023
in AWS, Data Analytics, Design, HOW To's, Serverless, Tutorials
0 0
0
Home AWS
0
SHARES
10
VIEWS
Share on FacebookShare on Twitter

Introduction

You have been presented with a few pain points to solve around your company’s Redshift solution. The original Redshift cluster that was launched for the company’s analytics stack has become underpowered over time. Several groups wish to create incremental backups of certain tables to S3 in a format that can be plugged into data lake solutions, as well as other groups wishing to have select pieces of the main Redshift schema splintered to new department-specific clusters.

You’ve come up with a plan to utilize the UNLOAD and COPY commands to facilitate all of the above and need to test a proof of concept to ensure that all pain points above can be addressed in this manner.

We’ll utilize the Redshift UNLOAD and COPY commands to migrate data between an existing Redshift cluster, which we will launch in the course of completing the lab.

Solution

Log in to the AWS Management Console using the credentials provided for the lab. Make sure you’re in the us-east-1 region.

Investigate the Lab Environment

  1. Navigate to S3 using the Services menu or the unified search bar.
  2. Select the users-data-<ACCOUNT_NUMBER> bucket and note the bucket name. There are currently no objects in the bucket.
  3. Navigate to Amazon Redshift using the Services menu or the unified search bar.
  4. Select the users-cluster link to review the cluster’s configuration. You will recreate this cluster with the ID users-cluster-2.
  5. Select Editor from the sidebar menu to open the query editor.
  6. Click Connect to Database and fill in the connection details:
    • Connection: Create a new connection
    • Authentication: Temporary credentials
    • Cluster: users-cluster (Available)
    • Database name: users
    • Database user: users_admin
  7. After the database connection details are filled in, click Connect.
  8. In the Resources pane on the left, use the Select schema dropdown to select public. You should see 2 schemas: users_data and users_data_pkey.
  9. In the query editor, run the following query to review some sample data:select * from users_data limit 10;

Launch the Target Redshift Cluster

  1. Select Clusters from the sidebar menu, then click Create cluster.
  2. Configure the Cluster configuration settings:
    • Cluster identifier: users-cluster-2
    • Select the Free trial option to automatically set your cluster configuration.
  3. Configure the Database configurations settings:
    • Admin user name: users_admin
    • Admin user password: Enter a password (e.g., Strongpass1).
  4. Leave all other default settings and click Create cluster. The cluster takes some time to create.
  5. After the cluster is created, open it and select the Properties tab.
  6. Scroll down to Associated IAM roles and click Associate IAM role.
  7. Select the RedshiftS3 role and click Associate IAM roles.

Copy the Existing Redshift Table to S3

  1. Select Editor from the sidebar menu to open the query editor.
  2. Copy the UNLOAD statement and paste it into the query editor:UNLOAD ('select * from users_data') TO '<users-data-bucket>' IAM_ROLE '<RedshiftS3 ARN>' FORMAT AS PARQUET;
  3. Replace <users-data-bucket> with your S3 bucket name and preface the bucket name with s3://, so the bucket name follows the format s3://<users-data-bucket>.
  4. Replace <RedshiftS3 ARN> with the ARN provided in the lab resources.
  5. Click Run. If your session times out, reconnect to the users-cluster database.
  6. Open S3 in a new browser tab and select the users-data-<ACCOUNT_NUMBER> bucket name. You should see 2 .parquet objects.
  7. Go back to the query editor and click Change connection.
  8. Update the connection details:
    • Connection: Create a new connection
    • Authentication: Temporary credentials
    • Cluster: users-cluster-2 (Available)
    • Database name: dev
    • Database user: users_admin
  9. Click Connect to connect to the cluster.
  10. In the Resources pane on the left, use the Select schema dropdown to select public. Note there are some default tables in the schema however, you must create a table that has the appropriate variables before you can copy data to the new cluster.
  11. Copy the create table statement and paste it into the query editor:create table users_data( id_value varchar(64), name_first varchar(64), name_last varchar(64), location_country varchar(32), dob_age int, picture_large varchar(64), primary key(id_value) ) distkey(location_country) compound sortkey(id_value);
  12. Click Run. You should now see 2 new public schemas: users_data and users_data_pkey.

Copy Data from S3 to the Newly Launched Redshift Cluster

  1. Copy the COPY statement and paste it into the query editor:COPY users_data FROM '<users-data-bucket>' IAM_ROLE '<RedshiftS3 ARN>' FORMAT AS PARQUET;
  2. Replace <users-data-bucket> with your S3 bucket name and preface the bucket name with s3://, so the bucket name follows the format s3://<users-data-bucket>.
  3. Replace <RedshiftS3 ARN> with the ARN provided in the lab resources.
  4. Click Run to run the query.
  5. After the COPY query is complete, run the following query to view some sample data:select * from users_data limit 10;

This data should be identical to the data you originally saw in users-cluster.

Check Your Data

  1. To verify the data is identical for both clusters, review the data in your users-cluster-2 cluster and note a couple of the entries.
  2. Click Change connection and fill in the connection details:
    • Connection: Use a recent connection
    • Recent connection: users-cluster (Available)
  3. Click Connect to connect to the cluster. Your select * from users_data limit 10; query should still be available in the query editor.
  4. Click Run to run the query again. Your data should be the same as the data you noted from users-cluster-2.
Tags: AWSData AnalyticsTutorials
Previous Post

A short story on AI

Next Post

Scalability

Team TCG

Team TCG

Related Posts

AWS

Cloud Monitoring: CloudWatch vs Azure Monitor vs Operations Suite

Discover the power of cloud monitoring with Amazon CloudWatch, Azure Monitor, and Operations Suite. As 94% of businesses experience downtime...

by Team TCG
December 31, 2025
AWS

Infrastructure as Code: CloudFormation vs ARM Templates vs Deployment Manager

Discover the transformative power of Infrastructure as Code (IaC) in managing cloud infrastructure. This article delves into the benefits of...

by Team TCG
December 31, 2025
AWS

Cloud CLI Tools: AWS CLI vs Azure CLI vs gcloud

Discover the power of Cloud CLI tools—AWS CLI, Azure CLI, and gcloud—that over 60% of businesses rely on for efficient...

by Team TCG
December 30, 2025
AWS

Hybrid Cloud Solutions: AWS Outposts, Azure Stack, and GCP Anthos

Discover the surge in hybrid cloud solutions, with 70% of organizations eyeing adoption. Merging public cloud with on-premises infrastructure, offerings...

by Team TCG
December 30, 2025
AWS

Cloud Cost Management: AWS Cost Explorer vs Azure Cost Management vs GCP Billing

Unlock the potential of your cloud budget with effective cost management! Discover how AWS, Azure, and GCP can help you...

by Team TCG
December 29, 2025
AWS

Multi-Cloud IAM: AWS IAM vs Azure AD vs GCP IAM

Navigating multi-cloud environments? Discover the critical role of Identity and Access Management (IAM) in ensuring robust user access across AWS,...

by Team TCG
December 29, 2025
Next Post

Scalability

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest

Azure Compliance: Policy, Blueprints, and Compliance Manager

September 21, 2025

Azure Sphere: Securing IoT Devices

October 21, 2025

Understanding Azure Subscriptions and Resource Groups

December 23, 2024

Azure Managed Identities vs Service Principals: Security Best Practices

October 22, 2025

AWS SnowMobile

0

Passwordless Login Using SSH Keygen in 5 Easy Steps

0

Create a new swap partition on RHEL system

0

Configuring NTP using chrony

0

Cloud Monitoring: CloudWatch vs Azure Monitor vs Operations Suite

December 31, 2025

Infrastructure as Code: CloudFormation vs ARM Templates vs Deployment Manager

December 31, 2025

Cloud CLI Tools: AWS CLI vs Azure CLI vs gcloud

December 30, 2025

Hybrid Cloud Solutions: AWS Outposts, Azure Stack, and GCP Anthos

December 30, 2025

Recommended

Cloud Monitoring: CloudWatch vs Azure Monitor vs Operations Suite

December 31, 2025

Infrastructure as Code: CloudFormation vs ARM Templates vs Deployment Manager

December 31, 2025

Cloud CLI Tools: AWS CLI vs Azure CLI vs gcloud

December 30, 2025

Hybrid Cloud Solutions: AWS Outposts, Azure Stack, and GCP Anthos

December 30, 2025

About Us

Let's Simplify the cloud for everyone. Whether you are a technologist or a management guru, you will find something very interesting. We promise.

Categories

  • 2 Minute Tutorials (7)
  • AI (3)
  • Ansible (1)
  • Architecture (3)
  • Artificial Intelligence (3)
  • AWS (508)
  • Azure (3)
  • books (2)
  • Consolidation (4)
  • Containers (1)
  • Data Analytics (1)
  • Data Center (11)
  • Design (1)
  • GCP (13)
  • HOW To's (17)
  • Innovation (1)
  • Kubernetes (8)
  • LifeStyle (2)
  • LINUX (6)
  • Microsoft (2)
  • news (3)
  • People (4)
  • Reviews (1)
  • RHEL (2)
  • Security (2)
  • Self-Improvement and Professional Development (1)
  • Serverless (2)
  • Social (2)
  • Switch (1)
  • Technology (473)
  • Terraform (3)
  • Tools (1)
  • Tutorials (13)
  • Uncategorized (9)
  • Video (1)
  • Videos (1)

Tags

2Min's (7) Agile (1) AI (5) Appication Modernization (1) Application modernization (1) Architecture (1) AWS (43) AZURE (4) BigQuery (1) books (2) Case Studies (17) CI/CD (1) Cloud Computing (525) Cloud Optimization (1) Comparo (17) Consolidation (1) Courses (1) Data Analytics (1) Data Center (8) Emerging (1) GCP (11) Generative AI (1) How to (14) Hybrid Cloud (5) Innovation (2) Kubernetes (4) LINUX (5) lunch&learn (473) memcache (1) Microsoft (1) monitoring (1) NEWS (2) NSX (1) Opinion (3) SDDC (2) security (1) Self help (2) Shorties (1) Stories (1) Team Building (1) Technology (3) Tutorials (20) vmware (3) vSAN (1) Weekend Long Read (1)
  • About
  • Advertise
  • Privacy & Policy

© 2023 The Cloud Guru - Let's Simplify !!

No Result
View All Result
  • Home
  • AWS
  • HOW To’s
  • Tutorials
  • GCP
  • 2 Minute Tutorials
  • Data Center
  • Artificial Intelligence
  • Azure
  • Videos
  • Innovation

© 2023 The Cloud Guru - Let's Simplify !!

Welcome Back!

Sign In with Facebook
Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Create New Account!

Sign Up with Facebook
Sign Up with Google
Sign Up with Linked In
OR

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In