Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

You can use this Snap to convert documents to the Parquet format and write the data to HDFS, ADL (Azure Data Lake), ABFS (Azure Data Lake Blob File Storage Gen 2), WASB (Azure storage), or an S3 bucket. This Snap supports a nested schema such as LIST and MAP. You can also use this Snap to write schema information to the Catalog Insert Snap.

This Snap supports HDFS, ADL (Azure Data Lake), ABFS (Azure Data Lake Blob File Storage Gen 2), WASB (Azure storage), and S3 protocols.

...

  1. Download hadoop.dlland winutils.exe ​https://github.com/cdarlint/winutils/tree/master/hadoop-3.2.2/bin (SnapLogic’s Hadoop version is 3.2.2)

  2. Create a temporary directory.

  3. Place the hadoop.dlland winutils.exe files in this path: c:\winutils\bin

  4. Set the environment variable HADOOP_HOME to point to c:\winutils

  5. Add c:\winutils\bin to the environment variable PATH as shown below:

  6. Add the JVM options in the Windows Snaplex:jcc.jvm_options= -Djava.library.path=c:\winutils\bin

    If you already have an existing jvm_options, then add: "-Djava.library.path=C:\winutils\testbin" after the space.
    For example:jcc.jvm_options = -agentlib:jdwp=transport=dt_socket,server=y,suspend=n,address=8000 -Djava.library.path=c:\winutils\bin

  7. Restart the JCC for configurations to take effect.

...