This article describes how to connect Tableau to a Databricks database and set up the data source.
Before you begin, gather this connection information:
Name of the server that hosts the database you want to connect to
HTTP path to the data source
- Azure Active Directory via OAuth
Note: To use this option, your system administrator must enable “user consent” in your organization's Azure Active Directory tenant. For more information, see Microsoft documentation on configuring user consent.
- Personal Access Token
- Username / Password
- Azure Active Directory via OAuth
Sign in credentials that related to your chosen authentication method:
- Azure AD endpoint
- Personal Access Token password
- Username and password
In Databricks, find your cluster server hostname and HTTP path using the instructions in Construct the JDBC URL on the Databricks website.
(Optional) Initial SQL statement to run every time Tableau connects
This connector requires a driver to talk to the database. You might already have the required driver installed on your computer. If the driver is not installed on your computer, Tableau displays a message in the connection dialog box with a link to the Driver Download(Link opens in a new window) page where you can find driver links and installation instructions.
Note: If you are connecting with Azure Active Directory via OAuth, you will need version 2.6.15 or later for this driver.
Start Tableau and under Connect, select Databricks. For a complete list of data connections, select More under To a Server. Then do the following:
Enter the Server Hostname.
Enter the HTTP Path to the data source.
- Select the Authentication method: Azure Active Directory via OAuth, Personal Access Token, or Username / Password.
Note: You can use the Username / Password option only if single sign-on (SSO) is disabled on the server. Otherwise, use your personal access token.
Enter the sign in credentials for the authentication method you chose:
- For Azure AD, type the URL for the Azure AD Endpoint.
- For personal access token, type the corresponding Password. (See Personal Access Tokens on the Databricks website for information on access tokens.)
- For username and password, type those in the fields provided.
(Optional) Select Initial SQL to specify a SQL command to run at the beginning of every connection, such as when you open the workbook, refresh an extract, sign in to Tableau Server, or publish to Tableau Server. For more information, see Run Initial SQL.
Select Sign In.
If Tableau can't make the connection, verify that your credentials are correct. If you still can't connect, your computer is having trouble locating the server. Contact your network administrator or database administrator.
On the data source page, do the following:
(Optional) Select the default data source name at the top of the page, and then enter a unique data source name for use in Tableau. For example, use a data source naming convention that helps other users of the data source figure out which data source to connect to.
Under Schema, select a schema or use the text box to search for a schema by name.
- Under Table, select a table or use the text box to search for a table by name.
Drag the table to the canvas, and then select the sheet tab to start your analysis.
Use custom SQL to connect to a specific query rather than the entire data source. For more information, see Connect to a Custom SQL Query.
If you use Tableau Desktop on a Mac, when you enter the server name to connect, use a fully qualified domain name, such as mydb.test.ourdomain.lan, instead of a relative domain name, such as mydb or mydb.test.
Alternatively, you can add the domain to the list of Search Domains for the Mac computer so that when you connect, you need to provide only the server name. To update the list of Search Domains, go to System Preferences > Network > Advanced, and then open the DNS tab.
Databricks best practices and troubleshooting
For information about best practices and troubleshooting when using Tableau with Databricks clusters, see the Tableau(Link opens in a new window) topic on the Databricks website.