Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

In this article

...

You can use this Snap to list the attributes of S3 objects in a given bucket matching the prefix. The attributes provided in the output document are bucket, objectKey, path, size, lastModified and owner. It internally lists all S3 objects with a given bucket name and prefix, and then applies the filter if any. Note that AWS S3 does not define any folder object and the '/' characters are part of the S3 object key names.

<overview image>

Snap Type

<Snap name> Snap is a READ/WRITE/PARSE/TRANSFORM/FLOW/FORMAT-type Snap that reads/fetches/writes/parses/executes/transforms/calls/creates…

...

Type

Format

Number of Views

Examples of Upstream and Downstream Snaps

Description

Input 

Document

  • Min: 0

  • Max: 1

  • Mapper

  • Copy

  • ..

Requires the EDI data and internal ID as a JSON document.

Output

Document

  • Min: 1

  • Max: 12

  • ..

  • ..

The EDI transaction ID and order confirmation
  • .

Error

Error handling is a generic way to handle errors without losing data or failing the Snap execution. You can handle the errors that the Snap might encounter while running the Pipeline by choosing one of the following options from the When errors occur list under the Views tab. The available options are:

  • Stop Pipeline Execution: Stops the current pipeline execution when the Snap encounters an error.

  • Discard Error Data and Continue: Ignores the error, discards that record, and continues with the rest of the records.

  • Route Error Data to Error View: Routes the error data to an error view without stopping the Snap execution.

Learn more about Error handling in Pipelines.

...

Field Name

Field Type

Field Dependency

Description

Label*

Default ValueS3 Browser
ExampleS3 Browser

String

N/A

The Specify a name for the Snap. You can modify this to be more specific, especially if you have more than one of the same Snap in your Pipeline.

Bucket

Default Value: N/A
ExampleExamples:

  • mybucket

  • mybucket@eu-west-1

  • mybucket@s3.eu-west-1.amazonaws.com (an example of <S3_bucket_name>@<s3_endpoint>)

  • testBucket@bucket.vpce-028b7814794578709-vu0vvauy.s3.us-west-2.vpce.amazonaws.com (an example of <S3_bucket>@<VPC_S3_endpoint>)

String/Expression/Suggestion

N/A

Specify or select the S3 bucket name, from where the S3 objects are to be listed.

A bucket name can be selected from the suggested list. Bucket names are unique globally and can be accessed without its region name in most cases. If you cannot access a bucket name you use cannot be accessed without its region name, the region information can be specified as in the following syntax:

<S3_bucket_name>@<region_name>

<S3_bucket_name>@<endpoint>

<endpoint> also applies to VPC (Virtual Private Cloud) S3 endpoint.

Note that, when : When you enter a wrong region name is given, and if the bucket name is valid, AWS S3 service may successfully access the bucket without any error.

 

Prefix

Default Value: [None]
Examples

  • test

  • test/

  • abc/xyz/

IntegerString/Expression/Suggestion

N/A

Specify the prefix of S3 object key names. If Bucket and Prefix are given and all other properties are at default values, the Snap lists all S3 objects matching the prefix in the bucket. An object key can be selected from the suggested list and edited for the prefix. The maximum length of the suggested list is 1,000. If Prefix is expression-enabled, it can be evaluated from an input header document as well as a pipeline parameter.

Filter

Default Value: [None]**
Examples

  • *.csv Matches an object name that represents a file name ending in .csv

  • * .* Matches object names containing a dot

  • {.csv, .json} Matches object names ending with .csv or .json

  • foo.? Matches object names starting with foo. and a single character extension

  • home/*/*Matches home/gus/data

  • home/**. Matches home/gus and home/gus/data

Integer/Expression

N/A

Specify the filter the Snap should use internally to list all S3 objects first matching given bucket and prefix. Filter is applied to the entire string of each S3 object key. Filter is a glob pattern which is defined as following:

  • The * character matches zero or more characters of an S3 object name without crossing '/' boundaries.

  • The ** characters matches zero or more characters crossing '/' boundaries.

  • The ? character matches exactly one character of a name component.

  • The backslash character (\) is used to escape characters that would otherwise be interpreted as special characters. The expression \\ matches a single backslash and "\{" matches a left brace for example.

  • The [ ] characters are a bracket expression that match a single character of a name component out of a set of characters. For example, [abc] matches "a", "b", or "c". The hyphen (-) may be used to specify a range so [a-z] specifies a range that matches from "a" to "z" (inclusive). These forms can be mixed so [abce-g] matches "a", "b", "c", "e", "f" or "g". If the character after the [ is a ! then it is used for negation so [!a-c] matches any character except "a", "b", or "c".

    Within a bracket expression the *, ? and \ characters match themselves. The (-) character matches itself if it is the first character within the brackets, or the first character after the ! if negating.

  • The { } characters are a group of subpatterns, where the group matches if any subpattern in the group matches. The "," character is used to separate the subpatterns. Groups cannot be nested.

  • Leading period and dot characters in names are treated as regular characters in match operations. For example, the "*" glob pattern matches file name ".login".

Show Advanced Properties

CheckboxN/A

Displays Advanced Properties when you select this checkbox.

Select this checkbox to display the advanced properties. Deselect the checkbox to hide the properties.

Maximum Object Keys

Default Value: All
Example: 2500

Integer/Expression

Appears on selecting Show Advanced Properties checkbox.

Maximum number of object keys at the output view. ‘All’ or an empty field produces all matching object key names.

Examples: 2500Ignore Empty Result

Default Value: All Selected

Ignore Empty Result

Checkbox

Appears on selecting Show Advanced Properties checkbox.

If selected and there is no matching S3 object key, the Snap produces no output document, and if not selected, an empty document.

Default Value: Selected

Maximum Retries*

Default Value: 3
ExamplesExample: 5

Integer/Expression

Appears on selecting Show Advanced Properties checkbox.

Specify the maximum number of retry attempts to to perform in case of a temporary network loss.

Pass Through

Default Value: Not Selected Deselected

Checkbox

Appears on selecting Show Advanced Properties checkbox.

Select this checkbox to pass the input document to the output view as a value to the 'original' key .

Delimiter


Default Value: [None]
ExamplesExample: /

Integer/Expression

Appears on selecting Show Advanced Properties checkbox.

Optional delimiter that causes to produce object keys that contain the same string between the prefix and the first occurrence of the delimiter. For example, a delimiter '/' causes the Snap to produce a non-recursive object-key listing.

Snap Execution

Default Value
Example: Validate & Execute

Dropdown listN/A

Select one of the three modes in which the Snap executes. Available options are:

  • Validate & Execute: Performs limited execution of the Snap, and generates a data preview during Pipeline validation. Subsequently, performs full execution of the Snap (unlimited records) during Pipeline runtime.

  • Execute only: Performs full execution of the Snap during Pipeline execution without generating preview data.

  • Disabled: Disables the Snap and all Snaps that are downstream from it.

Troubleshooting

Error

Reason

Resolution

Account validation failed.

The Pipeline ended before the batch could complete execution due to a connection error.

Verify that the Refresh token field is configured to handle the inputs properly. If you are not sure when the input data is available, configure this field as zero to keep the connection always open.

Examples

Excluding Fields from the Input Data Stream

...