Jump-start your Cloud Migration: Matillion Data Loader

As more organizations migrate to the cloud from legacy and traditional databases / warehouses, managers want to know the best route. How do we modernize our data quickly and effectively in a manner that offers a balance in ease of use and cost while maintaining the familiarity with data structures for end-users, analysts and developers?

Lucky for us, Matillion recently launched the Matillion Data Loader. Matillion is a cloud-based, drag-and-drop ELT application. Data Loader is their solution to streamlined (easy and low cost) data pipelines for cloud migration.

Matillion Data Loader will allow us to migrate an existing MySQL database and its structure to our Snowflake warehouse. This is called the “lift and shift” approach and though we may only need the E (extract) and L (load) of ELT for now, we want to use an application (or set of applications) that provide flexibility in case we want to leverage the cloud for our transformations in the future.

To show just how easy it is, we’ll walk through an example of configuring a pipeline. In our example, we will migrate data from an existing MySQL database ultimately landing our data in a Snowflake environment.

In order to build our pipeline, we’ll first need to sign up for an account.

After signing up and logging into the system you’ll see a mostly blank screen—with directions to add a pipeline.

Matillion Data Loader - Pipeline

In order to start leveraging a pipeline, you’ll need to configure source and destination data sources. In our case, this will be MySQL to Snowflake. We’re not limited to MySQL. Matillion Data Loader comes with a wide range of source databases / APIs. Destinations are currently available for Amazon Redshift, Google BigQuery and Snowflake cloud data warehouses.

Matillion Data Loader Screenshots

During configuration of the destination data source you’ll need the following information:

Account: You can find this information via the url in your browser window while logged into Snowflake. Make sure to ONLY include information after https:// and before snowflakecomputing.com. For example, if my URL reads https://my-account.us-east-1.snowflakecomputing.com then I would simply put my-account.us-east-1 in this section.
Username: If possible, I would suggest creating a new user for Matillion purposes as this makes it easier to track usage and compute credits.
Password Type: Your organization may have security requirements to use private keys over a password—both are available options.
Password:

You will need to add a password via the Manage button, this will make adding future pipelines more straightforward (eliminating the need to enter your password each time the pipeline runs).

Matillion Data Loader - Destination 2

Next, you’ll need to identify the appropriate role, warehouse, database and schema. I recommend testing to ensure connectivity.

Congrats! You’ve now configured your destination—now we’ll need to move onto our source, which is a similar process.

When navigating through the source configuration, you’ll need the following information:

JDBC Connection String / URL and database
E.g. – jdbc:mysql://mysqlhoststring/database
Username
Password (add a new password via the Manage button)

Click next until you see the following menu, which will allow you to decide which tables you’d like to load data from.

After selecting your tables, you will have the option of defining field types as well as any incremental columns to leverage as an incremental field. Each table is limited to a single incremental field. Incremental fields will allow for faster data loading, as only the newest rows will be migrated instead of full table refreshes.

Click Next, and define the database, schema and warehouse (for compute purposes) you want your pipeline to use. You’ll need to do this twice—once for staging (where data will live temporarily as Matillion loads the data incrementally) and then again as your target (where data will ultimately land for analysis).

Finally, you’ll be presented with scheduling options—configure these as needed but note that each run will leverage compute (and will consume Snowflake usage credits). Additionally, you can choose to receive failure notices should a pipeline fail.

Congratulations! You’ve now configured your first pipeline! The tool presents you with a view that displays historical run information as well as the frequency of run and the number of rows migrated. Should you ever need to turn off the pipeline or make a change in frequency, you can do so using the menu in the top left.

Now that we’ve built our pipeline, we can leverage the power of our Snowflake warehouse to power our company’s dashboards, analysis, and advanced analytics projects—the possibilities are virtually endless. That said, at the foundation, we’ll need a well thought out organizational data strategy before doing so.

Reach out to UDig to explore the multitude of cloud tools that allow organizations to get started moving data quickly with minimal cost or support.

Digging In

Data & Analytics
Unlocking Value: A Practical Playbook for Centralized vs. Federated Data Services
Enterprise data and technology leaders face a familiar dilemma: how much control should central data teams maintain versus empowering business units with federated access? It’s a debate that’s been heating up as organizations struggle to balance governance with agility, often swinging between extremes that create new problems. As someone who’s guided numerous enterprises through this […]
Read More
Data & Analytics
How to Blend Software and Data Engineers on a Single Team | The Jam Session
Josh Bartels, UDig CTO, joined Wayne Eckerson, Elliott Cordo, and Carlos Bossy, during a recent Insight Jam Session exploring the growing collision between software and data engineering teams as AI reshapes enterprise applications. The group tackled cultural friction, practical solutions, and the future of a unified engineering discipline in an AI-driven world.
Read More
Data & Analytics
How Business Leaders Can Evaluate the Productivity of their Data Engineering Teams
Read More
Data & Analytics
Ensuring Data Strategy Adoption: The Power of a Test Drive with Blueprinting and Mock Outputs
Despite years of investment in data platforms and analytics tools, many organizations still face a familiar challenge: their data strategy looks great on paper, but never delivers the value that was expected. Dashboards sit untouched, and self-service portals fail to gain traction. The data team checked every technical box, yet business users continue defaulting to […]
Read More
Data & Analytics
Piloting Data Discovery and Governance: The Open-Source Data Catalog
As organizations grow increasingly data-driven, the ability to quickly discover, understand, and trust internal data becomes more than a convenience—it’s a necessity. Over the past year, I’ve spent more time exploring data catalog solutions and the pivotal role they play in solving a challenge I frequently hear from clients: “We know we have the data, […]
Read More
Data & Analytics
2025 Data Trends
Read More

Your Privacy

Jump-start your Cloud Migration: Matillion Data Loader

Digging In

Unlocking Value: A Practical Playbook for Centralized vs. Federated Data Services

How to Blend Software and Data Engineers on a Single Team | The Jam Session

How Business Leaders Can Evaluate the Productivity of their Data Engineering Teams

Ensuring Data Strategy Adoption: The Power of a Test Drive with Blueprinting and Mock Outputs

Piloting Data Discovery and Governance: The Open-Source Data Catalog

2025 Data Trends