Ask the Experts: Data Integration in Azure Synapse Analytics (at Data Toboggan 2021)

On Saturday, June 12th, 2021, I will be moderating an Ask the Experts session at Data Toboggan! This is a free event focusing on Azure Synapse Analytics. There are over twenty sessions and lightning talks scheduled, covering topics such as architecture, performance, tools, data integration, machine learning and much more.

If you have any questions about Data Integration in Azure Synapse Analytics (or Azure Data Factory), join us! You don’t want to miss this session 🤓

Data Toboggan logo showing a toboggan (sled) going down a hill.

Speaking at Global Azure Norway 2021

On Friday, April 16th, 2021, I will be speaking at Global Azure Norway! Global Azure is a virtual 3-day event where communities from around the world organize live streams that are open for everyone to join. All the live streams add up to one global event with more than 500 speakers and sessions, so you should be able to find something interesting 🤓

I will be presenting my session called Pipeline and Data Flows: Introduction to Data Integration in Azure Synapse Analytics. Do you want to learn about something else? You can find all the worldwide sessions on the Global Azure website, or the local sessions on the Global Azure Norway website.

Speaker card showing Cathrine Wilhelmsen presenting at Global Azure Norway.

Speaking at Data Toboggan 2021

On Saturday, January 30th, 2021, I will be speaking at the first Data Toboggan event! This is a free event focusing on Azure Synapse Analytics. There are 14 sessions scheduled in 12 hours, covering topics such as data integration, machine learning, data warehousing, data governance, and more. Join us 🤓

I will be presenting a session called Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse Analytics.

Data Toboggan logo showing a toboggan (sled) going down a hill.

Overview of Azure Data Factory User Interface

Woman standing next to a projector showing the Azure Data Factory logo.

In the previous post, we started by creating an Azure Data Factory, then we navigated to it. In this post, we will navigate inside the Azure Data Factory. Let’s look at the Azure Data Factory user interface and the four Azure Data Factory pages.

Azure Data Factory Pages

On the left side of the screen, you will see the main navigation menu. Click on the arrows to expand and collapse the menu:

Animation of expanding and collapsing the pages menu in the Azure Data Factory user interface.

Renaming the default branch in Azure Data Factory Git repositories from "master" to "main"

Branch icon.

In Azure Data Factory, you can connect to a Git repository using either GitHub or Azure DevOps. When connecting, you have to specify which collaboration branch to use. In most cases, the default branch is used. Historically, the default branch name in git repositories has been “master”. This is problematic because it is not inclusive and is very offensive to many people.

The Git project, GitHub, and Azure DevOps are making changes to allow users to specify a different default branch name. GitHub and Azure DevOps will be changing their default branch names to “main” in 2020. I fully support this change and will be doing the same in my projects.

In this post, we will go through how to rename the default branch from “master” to “main” in Azure Data Factory Git repositories hosted in GitHub and Azure DevOps. Then we will reconnect Azure Data Factory and configure it to use the new “main” branch as the collaboration branch.

For these examples, I’m using my personal demo projects. I’m not taking into consideration any branch policies, other users, third-party tools, or external dependencies. As always, keep in mind that this is most likely a larger change, both technically and organizationally, in production and enterprise projects. 😊

The Short Version

  1. Create a new “main” branch in your Git repository
  2. Set the new “main” branch as the default branch in your Git repository
  3. Delete the old “master” branch in your Git repository
  4. Disconnect from your Git repository in Azure Data Factory
  5. Reconnect to your Git repository in Azure Data Factory using the new “main” branch as the collaboration branch