Hive Database Account
In this article
Overview
You can use this account type to connect Hive Snaps with data sources that use the Hive Database Account account.
Prerequisites
A Hive instance.
Limitations
The Hive Snap Pack does not validate with Apache Hive JDBC v1.2.1 jars or earlier because of a defect in Hive. HDP 2.6.3 and HDP 2.6.1 run on Apache Hive JDBC v1.2.1 jars.
To validate Snaps that must work with HDP 2.6.3 and HDP 2.6.1, use JDBC v2.0.0 jars.
Known Issues
"Method not supported" error while validating Apache Hive JDBC v1.2.1 or earlier
Account Settings
Asterisk ( * ): Indicates a mandatory field.
Suggestion icon ( ): Indicates a list that is dynamically populated based on the configuration.
Expression icon ( ): Indicates the value is an expression (if enabled) or a static value (if disabled). Learn more about Using Expressions in SnapLogic.
Add icon ( ): Indicates that you can add fields in the fieldset.
Remove icon ( ): Indicates that you can remove fields from the fieldset.
Field Name | Field Type | Description | |
---|---|---|---|
Label*
Default Value: N/A | String | Specify a unique label for the account. | |
Account properties* | |||
Hostname*
Default Value: N/A
| String/Expression | Specify the server address to connect to. | |
Port Number*
Default Value: 10000 | String/Expression | Specify the database server's port to connect to. | |
Database name*
Default Value: N/A | String/Expression | Specify the name of the database which the account is to be connected to. | |
Username
Default Value: N/A | String/Expression | Specify the username that is allowed to connect to the database. Username will be used as the default username when retrieving connections. The username must be valid in order to set up the data source. | |
Password
Default Value: N/A | String/Expression | Specify the password used to connect to the data source. Password will be used as the default when retrieving connections. The password must be valid in order to set up the data source. | |
JDBC URL
Default Value: N/A | String/Expression | Specify the URL of the JDBC database. | |
JDBC JARs | Use this fieldset to specify the JDBC JAR files to be loaded. A different driver binary for a driver must have a different name, the same name can not be reused for a different driver. If this property is left blank, a default JDBC driver will be loaded. Enter the following JDBC jars to configure the Hive Database account for the respective cluster. For HDP
For CDH
| ||
JDBC Driver Class*
Default Value: org.apache.hive.jdbc.HiveDriver | String | Specify the JDBC Driver class name. For HDP Clusters Enter the following value: org.apache.hive.jdbc.HiveDriver For CDH Clusters Enter the following value: com.cloudera.hive.jdbc4.HS2Driver | |
Advanced properties | |||
Auto commit
Default Value: Selected | Checkbox/Expressions | Select this checkbox to commit a batch immediately after it is executed. If the Snap fails, only the batch being executed at that moment is rolled back. When deselected, the Snap execution output is committed only after all the batches are executed. If the Snap fails, the entire transaction is rolled back, unless the Snap finds invalid input data before it sends the insert request to the server, and routes the error documents to the Error view. For a DB Execute Snap, assume that a stream of documents enter the input view of the Snap and the SQL statement property has JSON paths in the WHERE clause. If the number of documents are large, the Snap executes in more than one batches rather than executing one per each document. Each batch would contain a certain number of WHERE clause values. If Auto commit is turned on, a failure would only roll back the records in the current batch. If Auto commit is turned off, the entire operation would be rolled back. For a single execute statement (with no input view), the setting has no practical effect. | |
Batch size*
Default Value: 50 | Integer/Expressions | Specify the number of statements to execute at a time. Using a large batch size could use up the JDBC placeholder limit of 2100.
| |
Fetch size*
Default Value: 100 | Integer/Expressions | Specify the number of rows to fetch at a time when executing a query. Large values could cause the server to run out of memory.
| |
Max pool size*
Default Value: 50 | Integer/Expressions | Specify the maximum number of idle connections a pool will maintain at a time.
| |
Max lifetime (minutes)* Default Value: 30 | Integer/Expressions | Specify the maximum lifetime of a connection in the pool. Ensure that the value you enter is a few seconds shorter than any database or infrastructure-imposed connection time limit. A value of 0 indicates an infinite lifetime, subject to the Idle Timeout value. An in-use connection is never retired. Connections are removed only after they are closed. | |
Idle Timeout (minutes)*
Default Value: 5 | Integer/Expressions | Specify the maximum amount of time a connection is allowed to sit idle in the pool. A value of 0 indicates that idle connections are never removed from the pool. | |
Checkout timeout (milliseconds)*
Default Value: 10000 | Integer/Expressions | Specify the number of milliseconds to wait for a connection to be available in the pool. Zero waits forever. After set time, then an exception will be thrown and the pipeline will fail.
| |
Url properties
| Use this fieldset to specify properties to use in JDBC Url. These properties will need to be configured when setting up SSL connection. See Advanced Configurations: Configuring Hive with SSL section below for details. | ||
Url property name
Default Value: N/A | String/Expression | Specify a name for the URL property to be used by the account. | |
Url property value
Default Value: N/A | String/Expression | Specify a value for the URL property name.
| |
Hadoop properties | |||
Authentication method*
Default Value: None | Dropdown list | Select the Authentication method to use when connecting to the Hadoop service.
| |
Use Zookeeper
Default Value: Deselected | Checkbox | Select if Zookeeper be used to locate the Hadoop service instead of a specific hostname. If the checkbox is selected, use Zookeeper to resolve the location of the database instead of using the hostname field in the standard block. | |
Zookeeper URL
Default Value: N/A | String | Specify the URL of the Zookeeper service. Zookeeper URL formats are different for CDH and HDP. If you intend to use Zookeeper, then you must provide the following details:
| |
Hive properties | |||
JDBC Subprotocol*
Default Value: Hive | Dropdown list | Specify the JDBC Subprotocol to be used. This is required when the Authentication method is Kerberos. Available options are:
| |
Kerberos properties | Use this fieldset to configure information required for the Kerberos authentication. These properties must be configured if you select Kerberos in the Authentication method property. | ||
Client Principal
Default Value: N/A | String | Specify the principal used to authenticate to Kerberos KDC (Kerberos Key Distribution Center - Network service used by the clients and servers for authentication). | |
Keytab File
Default Value: N/A | String | Specify the Keytab file (file used to store encryption keys) used to authenticate to Kerberos KDC. | |
Service Principal
Default Value: N/A | String | Specify the principal used by an instance of a service. Examples:
|
Snap Pack History
Related Content
Have feedback? Email documentation@snaplogic.com | Ask a question in the SnapLogic Community
© 2017-2024 SnapLogic, Inc.