In this article

Overview

You can use this account type to connect Databricks Snaps with data sources that use Databricks Account with Azure Data Lake Storage (ADLS) Gen2 as source.

Prerequisites

A valid Databricks account.
Certified JDBC JAR File: databricks-jdbc-2.6.25-1.jar

Limitations and Known Issues

None.

Account Settings

Asterisk ( * ): Indicates a mandatory field.
Suggestion icon ( ): Indicates a list that is dynamically populated based on the configuration.
Expression icon ( ): Indicates whether the value is an expression (if enabled) or a static value (if disabled). Learn more about Using Expressions in SnapLogic.
Add icon ( ): Indicates that you can add fields in the fieldset.
Remove icon ( ): Indicates that you can remove fields from the fieldset.

Field Name	Field Type	Field Dependency	Description
Label* Default Value: N/A Example: STD DB Acc DeltaLake AWS ALD	String	None.	Specify a unique label for the account.
Download JDBC Driver Automatically Default Value: Not Selected Example: Selected	Checkbox	None.	Select this checkbox to allow the Snap account to download the certified JDBC Driver for DLP. The following fields are disabled when this checkbox is selected. JDBC JAR(s) and/or ZIP(s) : JDBC Driver JDBC driver class To use a JDBC Driver of your choice, clear this checkbox, upload (to SLDB), and choose the required JAR files in the JDBC JAR(s) and/or ZIP(s): JDBC Driver field. Use of Custom JDBC JAR version You can use a different JAR file version outside of the recommended listed JAR file versions. Spark JDBC and Databricks JDBC If you do not select this checkbox and use an older JDBC JAR file (older than version 2.6.25), ensure that you use: The old format JDBC URL ( `jdbc:spark://` ) instead of the new one ( `jdbc:databricks://` ) For JDBC driver prior to version 2.6.25, the JDBC URL starts with `jdbc:spark://` For JDBC driver version 2.6.25 or later, the JDBC URL starts with `jdbc:databricks://` The older JDBC Driver Class `com.simba.spark.jdbc.Driver` instead of the new `com.databricks.client.jdbc.Driver`.
JDBC URL* Default Value: N/A Example: jdbc:spark://adb-2409532680880038.18.azuredatabricks.net:443/default;transportMode=http;ssl=1;httpPath=sql/protocolv1/o/2409532680880038/0326-212833-drier754;AuthMech=3;	String	None.	Enter the JDBC driver connection string that you want to use in the syntax provided below, for connecting to your DLP instance. See Microsoft's JDBC and ODBC drivers and configuration parameters for more information. jdbc:spark://dbc-ede87531-a2ce.cloud.databricks.com:443/default;transportMode=http;ssl=1;httpPath= sql/protocolv1/o/6968995337014351/0521-394181-guess934;AuthMech=3;UID=token;PWD=<personal-access-token> Avoid passing Password inside the JDBC URL If you specify the password inside the JDBC URL, it is saved as it is and is not encrypted. We recommend passing your password using the Password field provided, instead, to ensure that your password is encrypted.
Use Token Based Authentication Default value: Selected Example: Not selected	Checkbox	None.	Select this checkbox to use token-based authentication for connecting to the target database (DLP) instance. Activates the Token field.
Token* Default value: N/A Example: <Encrypted>	String	Use Token Based Authentication checkbox is selected.	Enter the token value for accessing the target database/folder path.
Database name* Default value: N/A Example: Default	String	None.	Enter the name of the database to use by default. This database is used if you do not specify one in the Databricks Select or Databricks Insert Snaps.
Source/Target Location* Default value: N/A Example: Default	Dropdown list	None.	Select the source or target data warehouse into which the queries must be loaded, that is ADLS Gen2. This activates the following fields: Azure storage account name Azure Container Azure Folder Azure Auth Type SAS Token Azure Storage Account
Azure storage account name* Default value: N/A Example: tonyblob	String	Source is ADLS Gen2.	Enter the name with which Azure Storage was created. The Bulk Load Snap automatically appends the '.blob.core.windows.net' domain to the value of this property.
Azure Container* Default value: N/A Example: sl-bigdata-qa	String	Source is ADLS Gen2.	Enter the name of an existing Azure container.
Azure folder* Default value: N/A Example: test-data	String	Source is ADLS Gen2.	Enter the name of an existing Azure folder to be used within the container for hosting files.
Azure Auth Type Default value: Shared Access Signature Example: Shared Access Signature	Dropdown list	Source is ADLS Gen2.	Select the authorization type that you want to consider while setting up the account. Options available are: Storage account Key Shared Access Signature: Select when you want to enter the SAS Token associated with the Azure storage account.
SAS Token* Default value: N/A Example: ?sv=2020-08-05&st=2020-08-29T22%3A18%3A26Z&se=2020-08-30T02%3A23%3A26Z&sr=b&sp=rw&sip=198.1.2.60-198.1.2.70&spr=https&sig=A%1DEFGH1Ijk2Lm3noI3OlWTjEg2tYkboXr1P9ZUXDtkk%3D	String	Azure Auth Type is Shared Access Signature.	Enter the SAS token which is the part of the SAS URI associated with your Azure storage account. See Getting Started with SAS for details.
Azure storage account key* Default value: N/A Example: ABCDEFGHIJKL1MNOPQRS	String	Azure Auth Type is Storage account key.	Enter the access key ID associated with your Azure storage account.
Advanced Properties	Other parameters that you want to specify to configure the account. This field set consists of the following fields: URL Properties Batch Size Fetch Size Min Pool Size Max Pool Size Max Life Time
URL properties	Use this field set to define the account parameter's name and its corresponding value. Click + to add the parameters and the corresponding values. Add each URL property-value pair in a separate row. It consists of the following fields: URL property name URL property value
URL property name Default Value: N/A Example: queryTimeout	N/A	None	Specify the name of the parameter for the URL property.
URL property value Default Value: N/A Example: 0	N/A	None	Specify the value for the URL property parameter.
Batch size* Default Value: N/A Example: 3	Integer	None	Specify the number of queries that you want to execute at a time. If the Batch Size is one, the query is executed as-is, that is the Snap skips the batch (non-batch execution). If the Batch Size is greater than one, the Snap performs the regular batch execution.
Fetch size* Default Value: 100 Example: 12	Integer	None	Specify the number of rows a query must fetch for each execution. Large values could cause the server to run out of memory.
Min pool size* Default Value: 3 Example: 0	Integer	None	Specify the minimum number of idle connections that you want the pool to maintain at a time.
Max pool size* Default Value: 15 Example: 0	Integer	None	Specify the maximum number of connections that you want the pool to maintain at a time.
Max life time* Default Value: 60 Example: 50	Integer	None	Specify the maximum lifetime of a connection in the pool, in seconds. Ensure that the value you enter is a few seconds shorter than any database or infrastructure-imposed connection time limit. 0 (zero) indicates an infinite lifetime, subject to the Idle Timeout value. An in-use connection is never retired. Connections are removed only after they are closed. Minimum value: 0 Maximum value: No limit
Idle Timeout* Default Value: 5 Example: 4	Integer	None	Specify the maximum amount of time in seconds that a connection is allowed to sit idle in the pool. 0 (zero) indicates that idle connections are never removed from the pool. Minimum value: 0 Maximum value: No limit
Checkout timeout* Default Value: 10000 Example: 9000	Integer	None	Specify the maximum time in milliseconds you want the system to wait for a connection to become available when the pool is exhausted. Minimum value: 0 Maximum value: No limit

Troubleshooting

Error	Reason	Resolution
Account validation failed.	The Pipeline ended before the batch could complete execution due to a connection error.	Verify that the Refresh token field is configured to handle the inputs properly. If you are not sure when the input data is available, configure this field as zero to keep the connection always open.

Snap Pack History

Click here to expand...

Release	Snap Pack Version	Date	Type	Updates
February 2025	main29887	12 Feb 2025	Stable	Enhanced the following Databricks Snaps to support the Unity catalog that enables you to access catalog data from the database. Databricks - Bulk Load Databricks - Delete Databricks - Insert Databricks - Merge Into Databricks - Select Databricks - Unload
November 2024	main29029	13 Nov 2024	Stable	Updated and certified against the current SnapLogic Platform release.
August 2024	main27765	21 Aug 2024	Stable	Upgraded the `org.json.json` library from v20090211 to v20240303, which is fully backward compatible.
May 2024	437patches27246	08 Aug 2024	Latest	Added Databricks - Run Job. This Snap executes a job, checks its status in Databricks, and, based on the job's status, completes or fails the pipeline.
May 2024	437patches26400	15 May 2024	Latest	Fixed an invalid session handle issue with the Databricks Snap Pack that intermittently triggered an error message when the Snaps failed to connect with Databricks to execute the SQL statement.
May 2024	main26341	08 May 2024	Stable	Updated the Delete Condition (Truncates a Table if empty) field in the Databricks - Delete Snap to Delete condition (deletes all records from a table if left blank) to indicate that all entries will be deleted from the table when this field is blank, but no truncate operation is performed.
February 2024	main25112	14 Feb 2024	Stable	Updated and certified against the current SnapLogic Platform release.
November 2023	main23721	08 Nov 2023	Stable	Updated and certified against the current SnapLogic Platform release.
August 2023	main22460	16 Aug 2023	Stable	Updated and certified against the current SnapLogic Platform release.
May 2023	433patches21630	28 Jun 2023	Latest	Enhanced the performance of the Databricks - Insert Snap to improve the amount of time it takes for validation.
May 2023	main21015	10 May 2023	Stable	Upgraded with the latest SnapLogic Platform release.
February 2023	main19844	09 Feb 2023	Stable	Upgraded with the latest SnapLogic Platform release.
November 2022	main18944	10 Nov 2022	Stable	The Databricks - Insert Snap now creates the target table only from the table metadata of the second input view when the following conditions are met: The Create table if not present checkbox is selected. The target table does not exist. The table metadata is provided in the second input view.
September 2022	430patches18305	29 Sep 2022	Latest	The name of the Databricks - Multi Execute Snap is simplified to Databricks - Execute Snap. The Use Result Query checkbox in the Databricks - Execute Snap enables you to include in the Snap's output the result of running (during validation) each SQL statement specified in the Snap. The Retry mechanism for the Databricks Snap Pack enables the following Databricks Snaps to repeatedly perform the selected operations for the specified number of times when the Snap account connection fails or times out. Databricks - Delete Databricks - Insert Databricks - Select Databricks - Execute Databricks - Bulk Load (when the Source Type is Input View) Databricks - Merge Into (when the Source Type is Input View) The following fields are added to each Databricks Snap as part of this enhancement: Number of Retries: The number of attempts the Snap should make to perform the selected operation when the Snap account connection fails or times out. Retry Interval (seconds): The time interval in seconds between two consecutive retry attempts.
September 2022	430patches17796	28 Sep 2022	Latest	The Manage Queued Queries property in the Databricks Snap Pack enables you to decide whether a given Snap should continue or cancel executing the queued Databricks SQL queries.
August 2022	main17386	11 Aug 2022	Stable	Upgraded with the latest SnapLogic Platform release.
4.29.2.0	42920rc17045	15 Jul 2022	Latest	A new Snap Pack for Databricks Lakehouse Platform (Databricks or DLP) introduces the following Snaps: Databricks - Select: Retrieves information from the target Databricks table. Databricks - Insert: Inserts new rows of data in the target Databricks table. Databricks - Delete: Deletes data from a target Databricks table. Databricks - Bulk Load: Loads millions of rows of data in the target table through a single load operation. Databricks - Unload: Unloads data from a target Databricks table through a single unload operation. Databricks - Merge Into: Updates millions of existing rows and inserts new rows in a target Databricks table through a single operation. Databricks - Multi Execute: Runs multiple SQL statements on the target Databricks instance.

Databricks Account (Source: ADLS Gen2)