Table of Contents |
---|
Overview
The Hive Execute Snap in Pack supports KNOX configuration using the Hive Snap Pack require a Cloudera instance to access the Hive database. For the Generic Hive Datbase account to authorize successfully, ensure to create and configure an application corresponding to the accountGeneric Hive Database Account to connect to Hive through the knox reverse proxy by using a given workload user's credentials. The Clouderra Data Platform (CDP) requires authentication through Knox reverse proxy and it uses the Workload user's credentials or a service account. The Knox reverse proxy hides the complex details for managing kerberos based authentication and authorization from the client.
Prerequisites
Valid Cloudera Workload credentials.
Clouder instance.
Info |
---|
This document provides details assuming you have an existing Data Hub Cluster in Cloudera Data Platform. |
Locate Details in Cloudera
To validate your Generic Hive Database account in SnapLogic against the Clouder Knox reverse proxy, you must have the following details:
JDBC URL
JDBC Driver Class
Workload User name and password.
Log in to Cloudera Data Platform portal (https://sso.cloudera.com/) or register yourself if you do not have an account.
The Cloudera home page is displayed.Click your Profile icon and select Cloudera Data Platform. For example, Cloudera Data Platform US West. You will be redirected to the Data Services page.
To demonstrate the procedure of accessing the details from CDP, we are using Cloudera Data Platform US West.Under Data Management, click Management Console.
Click Data Hub Clusters. The list of hub clusters is displayed.
Click the Data Hub Cluster that is active and running.
Under Cloudera Manager Info, click the Endpoints tab.
All services that are running on CDP is displayed.Copy or make note of the JDBC URL of the Hive Server to use it in the Generic Hive Database Account.
Navigate to User Management.
Click Snap Team the <user> and select Profile . Make and make note of the Workload credentials.
Configuring Generic Hive Database Account with Cloudera
Drag the Hive Execute Snap.
Add the Generic Hive Database Account to the Snap.
Enter the Username and Password of your Cloudera workload. (See Step 8 above.)
Enter the JDBC URL of the Hive Server.
Note: By default, the raw URL is got from Cloudera; you have to enter the port number right after the domain and update the SSL value from true to 1; wherein 0 is false and 1 is true. (Refer Step 6 above)
If the cluster is SSL enabled, the default value of SSL is true in JDBC URL; replace it with 1. For example(JDBC URL : jdbc:hive2://data-hub-cluster-1-gateway.snap-tea.j7vi-6hdh.a3.cloudera.site:443/;ssl=1;transportMode=http;httpPath=data-hub-cluster-1/cdp-proxy-api/hive
)Enter the Site number as 443.
If the SSL is enabled on the Hive server, leave SSL as true. If the SSL is not enabled on the server, enter SSL as 0 or 1. Add example url format below after change:
Upload the JDBC Driver. You can download the latest JDBC Hive driver from here.
Specify the JDBC Driver Class as
com.cloudera.hive.jdbc.HS2Driver
.
...
Click Validate. Your account should be validated successfully.