On this page

SnapLogic Hadoop Snaps support encryption zones transparently. If the HDFS cluster is configured with "Transparent at rest Encryption", the Snaps read/write into encryption zones. 

HDFS Encryption

HDFS implements transparentend-to-end encryption. Once configured, data read from and written to special HDFS directories is encrypted and decrypted without any changes to the user application code. This encryption is end-to-end, means only the client can encrypt and decrypt the data. HDFS never stores or has access to unencrypted data or unencrypted data encryption keys. This satisfies two typical requirements for encryption: at-rest encryption (meaning data on persistent media, such as a disk) as well as in-transit encryption (such as when data is traveling over the network).

A new cluster service is required to manage encryption keys: the Hadoop Key Management Server (KMS). In the context of HDFS encryption, the KMS performs three basic responsibilities:

  1. Providing access to the stored encryption zone keys.

  2. Generating new encrypted data encryption keys for storage on the NameNode.

  3. Decrypting the encrypted data encryption keys for use by HDFS clients.

Once a KMS has been set up and the NameNode and HDFS clients have been correctly configured, use the hadoop key and hdfs crypto command-line tools to create encryption keys and set up new encryption zones.

Setting Up HDFS Encryption

Follow the documents below to setup HDFS encryption:

Once HDFS encryption is configured, following are the instructions to create/delete encryption zones.

Once encrypted zones are created, follow the instructions below to enable it to work with SnapLogic Pipelines.

https://hadoop.apache.org/docs/r2.8.0/hadoop-kms/index.html#Decrypt_Encrypted_Key
https://www.cloudera.com/documentation/enterprise/5-8-x/topics/cdh_sg_kms_security.html#concept_fgf_42v_mp
https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_security/content/ranger-kms-admin-guide.html