Jump-start your Cloud Migration: Matillion Data Loader

As more organizations migrate to the cloud from legacy and traditional databases / warehouses, managers want to know the best route. How do we modernize our data quickly and effectively in a manner that offers a balance in ease of use and cost while maintaining the familiarity with data structures for end-users, analysts and developers?

Lucky for us, Matillion recently launched the Matillion Data Loader. Matillion is a cloud-based, drag-and-drop ELT application. Data Loader is their solution to streamlined (easy and low cost) data pipelines for cloud migration.

Matillion Data Loader will allow us to migrate an existing MySQL database and its structure to our Snowflake warehouse. This is called the “lift and shift” approach and though we may only need the E (extract) and L (load) of ELT for now, we want to use an application (or set of applications) that provide flexibility in case we want to leverage the cloud for our transformations in the future.

To show just how easy it is, we’ll walk through an example of configuring a pipeline. In our example, we will migrate data from an existing MySQL database ultimately landing our data in a Snowflake environment.

In order to build our pipeline, we’ll first need to sign up for an account.

After signing up and logging into the system you’ll see a mostly blank screen—with directions to add a pipeline.

Matillion Data Loader - Pipeline

In order to start leveraging a pipeline, you’ll need to configure source and destination data sources. In our case, this will be MySQL to Snowflake. We’re not limited to MySQL. Matillion Data Loader comes with a wide range of source databases / APIs. Destinations are currently available for Amazon Redshift, Google BigQuery and Snowflake cloud data warehouses.

Matillion Data Loader Screenshots

During configuration of the destination data source you’ll need the following information:

Account: You can find this information via the url in your browser window while logged into Snowflake. Make sure to ONLY include information after https:// and before snowflakecomputing.com. For example, if my URL reads https://my-account.us-east-1.snowflakecomputing.com then I would simply put my-account.us-east-1 in this section.
Username: If possible, I would suggest creating a new user for Matillion purposes as this makes it easier to track usage and compute credits.
Password Type: Your organization may have security requirements to use private keys over a password—both are available options.
Password:

You will need to add a password via the Manage button, this will make adding future pipelines more straightforward (eliminating the need to enter your password each time the pipeline runs).

Matillion Data Loader - Destination 2

Next, you’ll need to identify the appropriate role, warehouse, database and schema. I recommend testing to ensure connectivity.

Congrats! You’ve now configured your destination—now we’ll need to move onto our source, which is a similar process.

When navigating through the source configuration, you’ll need the following information:

JDBC Connection String / URL and database
E.g. – jdbc:mysql://mysqlhoststring/database
Username
Password (add a new password via the Manage button)

Click next until you see the following menu, which will allow you to decide which tables you’d like to load data from.

After selecting your tables, you will have the option of defining field types as well as any incremental columns to leverage as an incremental field. Each table is limited to a single incremental field. Incremental fields will allow for faster data loading, as only the newest rows will be migrated instead of full table refreshes.

Click Next, and define the database, schema and warehouse (for compute purposes) you want your pipeline to use. You’ll need to do this twice—once for staging (where data will live temporarily as Matillion loads the data incrementally) and then again as your target (where data will ultimately land for analysis).

Finally, you’ll be presented with scheduling options—configure these as needed but note that each run will leverage compute (and will consume Snowflake usage credits). Additionally, you can choose to receive failure notices should a pipeline fail.

Congratulations! You’ve now configured your first pipeline! The tool presents you with a view that displays historical run information as well as the frequency of run and the number of rows migrated. Should you ever need to turn off the pipeline or make a change in frequency, you can do so using the menu in the top left.

Now that we’ve built our pipeline, we can leverage the power of our Snowflake warehouse to power our company’s dashboards, analysis, and advanced analytics projects—the possibilities are virtually endless. That said, at the foundation, we’ll need a well thought out organizational data strategy before doing so.

Reach out to UDig to explore the multitude of cloud tools that allow organizations to get started moving data quickly with minimal cost or support.

Digging In

Data & Analytics
Ensuring Data Strategy Adoption: The Power of a Test Drive with Blueprinting and Mock Outputs
Despite years of investment in data platforms and analytics tools, many organizations still face a familiar challenge: their data strategy looks great on paper, but never delivers the value that was expected. Dashboards sit untouched, and self-service portals fail to gain traction. The data team checked every technical box, yet business users continue defaulting to […]
Read More
Data & Analytics
Piloting Data Discovery and Governance: The Open-Source Data Catalog
As organizations grow increasingly data-driven, the ability to quickly discover, understand, and trust internal data becomes more than a convenience—it’s a necessity. Over the past year, I’ve spent more time exploring data catalog solutions and the pivotal role they play in solving a challenge I frequently hear from clients: “We know we have the data, […]
Read More
Data & Analytics
2025 Data Trends
Read More
Data & Analytics
Legacy Data Modernization: A Comprehensive Guide to Upgrading Your Data Platform
Though they may have been more than functional in the past, legacy data platforms can become a burden to your organization and prevent it from realizing its full potential. That’s why legacy data modernization can effectively transform your organization’s obsolete data systems into modern platforms that are scalable, efficient, and better equipped to handle today’s […]
Read More
Data & Analytics
Masking Data 101: Safeguarding PII in Your Organization
In today’s digital age, data security and privacy are paramount. As organizations increasingly collect, store, and process personal data, protecting Personally Identifiable Information (PII) has never been more critical. One essential practice that organizations can implement at the database level to secure this sensitive information is to obfuscate it through the usage of data masking […]
Read More
Data & Analytics
Unlocking the Full Potential of a Customer 360: A Comprehensive Guide
In today’s fast-paced digital economy, understanding your customer has never been more critical. The concept of a customer 360 view has emerged as a revolutionary approach to gaining a comprehensive understanding of consumers by integrating data from different touchpoints to offer a holistic view. A customer 360 view is about taking an overarching approach to […]
Read More

Your Privacy

Jump-start your Cloud Migration: Matillion Data Loader

Digging In

Ensuring Data Strategy Adoption: The Power of a Test Drive with Blueprinting and Mock Outputs

Piloting Data Discovery and Governance: The Open-Source Data Catalog

2025 Data Trends

Legacy Data Modernization: A Comprehensive Guide to Upgrading Your Data Platform

Masking Data 101: Safeguarding PII in Your Organization

Unlocking the Full Potential of a Customer 360: A Comprehensive Guide