Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents
minLevel1
maxLevel7

Overview

The The Hive Snap Pack supports KNOX configuration using the  supports Apache Knox authentication to connect to the Hive service on the Cloudera Data Platform (CDP)'s Data Hub clusters. Using the Workload user's credentials, you can use a Generic Hive Database Account to connect to the Hive database through the knox Knox reverse proxy by using a given workload user's credentials. The Clouderra Data Platform (CDP) requires authentication through . The Cloudera Workload user credentials are used to authenticate with the Knox reverse proxy and it uses the Workload user's credentials or a service accountsubsequently connect to the Hive database. The Knox reverse proxy hides the complex details for managing kerberos based the authentication and authorization from based on the clientKerberos protocol.

Prerequisites

  • Valid Cloudera Workload user’s credentials.

  • Clouder Cloudera instance.

Info

This document article provides details assuming you have , for an existing Data Hub Cluster in the Cloudera Data Platform.

Locate Details in the Cloudera Data Platform

To validate your Generic Hive Database account in the SnapLogic against platform with the Clouder Cloudera Knox reverse proxy, you must have the following details:

  • JDBC URL

  • JDBC Driver Class

  • Workload User name and password.

  1. Log in to Cloudera Data Platform the CDP portal (https://sso.cloudera.com) or register yourself if you do not have an account.
    The Cloudera MyCloudera home page is displayedopens.

    Image Added

  2. Click your Profile icon and select the Cloudera Data Platform. For example, Cloudera Data Platform US West. You will be are redirected to the Cloudera Data Services Platform page.
    To Note: We are using Cloudera Data Platform US West to demonstrate the procedure of accessing the details from CDP, we are using Cloudera Data Platform US Westthe CDP.


  3. Under Data Management, click the Management Console card.

  4. Click Data Hub Clusters. The list of hub clusters is displayed.Click and select the Data Hub Cluster that is active and running. The list of hub clusters is displayed.

    Image Removedactive-data-hub-cluster.pngImage Addedendpoints.pngImage Added

  5. Under Cloudera Manager Info, click the Endpoints tab.
    All services that are running on the CDP is displayeddisplay.

  6. Copy or make note of the JDBC URL of the Hive Server to use it in the Generic Hive Database Account.

    Image Removedhive-server-url.pngImage Added


  7. Navigate to User Management.Click in the left navigation pane, click the <user> and , select Profile, and make note of the Workload credentials.

    Image RemovedImage Added


Configuring a Generic Hive Database Account with

...

Settings from the CDP

  1. Drag the Hive Execute Snap.

  2. Add the Generic Hive Database Account account to the Snap.

  3. Enter the Username and Password of your Cloudera workloadWorkload instance. (See Refer to Step 8 7 above.)

  4. Enter the JDBC URL of the Hive Server.
    Note: By default, the raw URL is got from Cloudera; you have to (refer to the Step 6 in the section above for the raw URL format) is obtained from the CDP; you must enter the port number right after the domain and update the SSL value from true to 1; , wherein 0 is false and 1 is true. (Refer Step 6 above)

    1. Enter the Site number as 443.

    2. If the

    cluster
    1. SSL is

    SSL
    1. enabled on the Hive server,

    the default value of SSL is true in JDBC URL; replace it with 1. For example (JDBC URL : jdbc
    1. leave SSL as true. If the SSL is not enabled on the server, enter SSL as 0 or 1.
      For example, jdbc:hive2://

    data
    1. <cdp-

    hub
    1. datahub-cluster-

    1-gateway.snap-tea.j7vi-6hdh.a3.cloudera.site
    1. host>:443/;ssl=1;transportMode=http;httpPath=data-hub-cluster-1/cdp-proxy-api/hive

    )
    1. Enter the Site number as 443.

    2. If the SSL is enabled on the Hive server, leave SSL as true. If the SSL is not enabled on the server, enter SSL as 0 or 1. Add example url format below after change:

    Upload the JDBC Driver
  5. Upload the Hive driver in the JDBC Driver field. You can download the latest JDBC Hive driver from here.

  6. Specify the JDBC Driver Class as com.cloudera.hive.jdbc.HS2Driver.

    Image Removedgeneric-hive-database-account.pngImage Added
  7. Click Validate. Your account should be validated successfully.