To access your data in Microsoft Azure Databricks, create a connection asset for it.
Databricks is a big data analytics tool that is based on Apache Spark.
Supported Databricks Runtime versions
The Microsoft Azure Databricks connection runs on the Azure Cloud runtimes.
Create a connection to Microsoft Azure Databricks
To create the connection asset, you need to enter the connection details and to select an authentication method.
Connection details
- Hostname or IP address of the database
- Port number of the database
- HTTP path: Path of the endpoint for which the server is configured in HTTP transport mode.
Credentials
Choose an authentication method:
- Entra ID token
Microsoft Entra ID is a cloud-based identity and access management service. To obtain connection values for the Entra ID authentication method, sign in to the Microsoft Azure portal. For information about Microsoft Entra ID, see What is Microsoft Entra ID? and Get Microsoft Entra ID tokens for service principals.
- Service principal credentials
Client ID and client secret of the service principal.
A service client principal is a credential created for Microsoft Azure Databricks that is used for automated tools, jobs and applications. For more inforation, see Manage service principals. To create a service client principal, see Use a service principal to authenticate with Azure Databricks.
- Username and password
Username and password for accessing the database.
Choose the method for creating a connection based on where you are in the platform
- In a project
- Click Assets > New asset > Connect to a data source. See Adding a connection to a project.
- In a deployment space
- Click Import assets > Data access > Connection. See Adding data assets to a deployment space.
- In the Platform assets catalog
- Click New connection. See Adding platform connections.
Next step: Add data assets from the connection
Where you can use this connection
You can use the Microsoft Azure Databricks connection in the following workspaces and tools:
Projects
- DataStage (DataStage service). See Connecting to a data source in DataStage.
- Data Virtualization service
- You can connect to this data source from Data Virtualization.
Catalogs
- Platform assets catalog
Microsoft Azure Databricks setup
Get started: Account and workspace setup
Running SQL statements
To ensure that your SQL statements run correctly, refer to the Azure Databricks SQL language reference for the correct syntax.
Learn more
Parent topic: Supported connections