On this Page
Snap type: | Format | |||||||
---|---|---|---|---|---|---|---|---|
Description: | This Snap formats the incoming document from the upstream Snaps to Hadoop sequence file format, the native binary data format to persist intermediate data between different stages of MapReduce jobs.
| |||||||
Prerequisites: | [None] | |||||||
Support and limitations: | Works in Ultra Pipelines. | |||||||
Account: | Accounts are not used with this Snap. | |||||||
Views: |
| |||||||
Settings | ||||||||
Label | Required. The name for the Snap. You can modify this to be more specific, especially if you have more than one of the same Snap in your pipeline. | |||||||
Key | Required. JSON path for the key. Example: $input_column_name Default value: [None] | |||||||
Value | Required. JSON path for the value. Example: $input_column_name Default value: [None] | |||||||
Compression type | For Spark mode, when compression is required, this value has to be populated. In standard mode, it is not required. Sequence file compression type. The options available include:
Default value: [None] | |||||||
Compression codec | Required for any Compression Type in Spark mode. Fully qualified compression codec class name Example: org.apache.hadoop.io.compress.GzipCodec | |||||||