SnapLogic Glossary
Top
This article explains the SnapLogic concepts and terminology as they pertain to the SnapLogic Intelligent Integration Platform (IIP).
A
Access Token
An authentication token that SnapLogic uses to access an API.
API Consumer
The API consumer user is a user that may or may not have a SnapLogic account, but is primarily interested in consuming APIs that are managed through the SnapLogic API Gateway. These users interface with the Developer Portal.
API Meter
A tool in SnapLogic Manager that provides information on the API limits for your Org and API usage reports.
Asset
Any resource that can be created by a SnapLogic user and is tracked in SnapLogic Manager. Accounts, Files, Tasks, Pipelines, Snap Packs, Snaplexes, and Tables qualify as Assets. In Manager, you can search for your Assets, and depending on the Asset type, view the details, statistics, or activity logs.
AutoLink
A feature that simplifies data mapping by suggesting Input Schema mapping to the Target Schema in the Mapper Snap.
AutoLink History Cleanup
Cleans up the AutoLink history.
B
Binary Data
A file that contains data in binary format and a header that contains the metadata for the binary content. For example, a file with JSON data might have a header with a content-type field set to application/json.
C
Canvas
The area in SnapLogic Designer where you can design, render, validate, and execute integration Pipelines.
Categorical Dataset
Data points that represent characteristics, groups, or categories. The data may be numeric. For example, Gender and Marital Status.
CC User
CC user is a SnapLogic system user, such as cc+<org_name>@snaplogic.com, which is automatically created for use by Snaplex instances for registering with the SnapLogic control plane.
Cloudplex
A Snaplex that runs inside the SnapLogic Cloud. A Cloudplex is managed by SnapLogic.
Clustering
Applicable to SnapLogic Data Science, clustering involves grouping a set of objects in a way that objects in a group are more similar to each other than objects in other groups.
Control Plane
The back-end management mechanism for the SnapLogic IIP. The Control Plane runs on the cloud as a multi-tenant service, stores Pipeline definitions and metadata information, while also managing triggered and scheduled Pipeline executions.
D
Data Catalog
A collection of relational tables that stores database metadata and allows you to query and update attributes in the relational tables in the SnapLogic platform.
Data Lineage
A new service within the SnapLogic Dashboard as a separate subtab that enables you to visualize the relationship between your Pipelines and their data sources. When you fetch the data lineage for a target data type, the Data Lineage displays the data sources to which a Pipeline connects as a diagram.
Data Plane
The data processing engine of the SnapLogic IIP. The Data Plane that runs in the SnapLogic Cloud or in a customer data center (if on-premises), executes the Pipelines and monitors the JCC nodes that handle the Pipeline executions.
Data Preview
A sample display of the data passing through the Pipeline per Snap in two views: tabular and visual. You can access the Data Preview by clicking on the connector for a Snap that shows a text-like icon in the circular connector.
Data Visualization
A feature of Data Preview in Pipelines that enables you to view your data in a variety of charts. You can generate pie, line, bar, and scatter charts from the given data.
Disabled Snap
A Snap that neither executes or validates. You can use disabled Snaps when building and testing Pipelines. In a Pipeline, all Snaps downstream from a Disabled Snap do not execute or validate.
Downstream Snap
The succeeding Snap of any given Snap in a Pipeline. The input of the downstream Snap is the output of the given Snap.
E
Execute Pipeline
A Pipeline execution operation that executes the Pipeline completely. This mode of execution impacts databases and/or endpoints.
F
Feature
Applicable to SnapLogic Data Science, Feature is a measurement of the data point. For example, height, width, mean, mode.
FeedMaster
A type of node, which is a Java Virtual Machine (JVM) installed on a Linux or a Windows server. The FeedMaster is an interface between the node executing Ultra Pipelines and the clients used for Ultra Pipelines.
Field Snap Pack
A collection of private Snap Packs built by SnapLogic to meet specific use cases of customers. Field Snap Packs may not be visible in the Snap catalog, they may or may not be documented, usually are not tested by QA, and are not supported/maintained by our Development team.
G
Groundplex
A Snaplex that runs behind a firewall, provides on-premises connectivity at a customer site, and is administered by the customer.
H
Health Dashboard
A tab in SnapLogic Dashboard that lets you monitor the health of your Snaplex instances and Pipelines.
I
IIP
The SnapLogic Intelligent Integration Platform. A multi-tenant administration console that empowers you to control and monitor the performance of integration Pipelines as well as user permissions, projects, and security settings.
Infobox
The dialog of any Snap where you configure the Snap settings. Infoboxes typically have tabs for Settings, Views, Info, and Accounts associated with the concerned SnapInfo-boxes typically have tabs for , , , and associated with the concerned Snap.
Insights Wall
A tab in SnapLogic Dashboard that displays view time series data and logs. The Insights tab also retrieves logs from the runtime database to generate user activity graphs.
Integration Assistant
A SnapLogic suggestion engine that recommends the most likely preceding and succeeding Snaps in a Pipeline. The recommendation is based on the usage of successor and predecessor Snaps and historical frequency.
J
JCC (Java Component Container)
The JCC is a node on the Data Plane that handles Pipeline executions. A Snaplex can contain multiple JCC nodes.
K
k-fold Cross Validation
Applicable to SnapLogic Data Science Snaps, k-fold Cross Validation is a statistical method used to estimate the performance of machine learning algorithms. The dataset is separated into k equal-sized chunks and the model is trained on k-1 chunks and evaluated on the last chunk. This process is repeated k-times so every data point has a chance to be in the test chunk once. For example, you could break the dataset into 10 chunks and train the model on nine of the ten chunks and evaluate the updated model on the tenth chunk. You could repeat this process ten times using each chunk until each data chunk has been used to evaluate–and train–the updated model.
Key Passphrase
A word or phrase that protects private key files from unauthorized users encrypting/decrypting the private keys.
L
Latest Snaps
A patch release of a Snap or Snap Pack, done on top of the prior GA release. For example, you might configure a Developer Org to always have the Latest Snaps to test new features. But, for a Production Org, you might choose Stable Snaps to ensure everything works as expected.
M
Max-in-flight
Applicable to Ultra Pipelines, the maximum number of documents that can be processed by an instance at a given time.
Model (Statistical)
Applicable to SnapLogic Data Science Snaps, a model is a mathematical formula that has been generated by a machine learning algorithm based on the dataset. The model can be stored and used to give a prediction for the data in the future.
N
Node
A Snaplex node is one of the virtual machines or a set of computing resources that process data. In SnapLogic documentation, we qualify a node based on context. Snaplex node is the most generic reference, and often introduces contexts that involve the following distinctions:
- Groundplex/Cloudplex nodes: Groundplex nodes are configured and maintained by a customer, whereas Cloudplex nodes are hosted by SnapLogic. When documentation content is specific to Groundplex users, references to Groundplex nodes indicate that the content is intended for Groundplex users. Likewise, references to Cloudplex nodes are intended for Cloudplex users.
- JCC/FeedMaster node: The distinction here is of type. A JCC node executes Pipelines, and a FeedMaster node is the queue broker for running Ultra Pipeline Tasks. When the distinction is relevant, both terms typically appear in the documentation.
Numerical Dataset
A dataset whose data points represent mathematical values. For example, height, weight, age.
O
On-premises URL
A URL that can be used to trigger a task to run on a Groundplex without making a request outside the local network.
Organization (Org)
An entity created by SnapLogic used by a customer site administrator for provisioning user accounts and groups. An Org admin manages users, groups, permissions, and other access settings within the UI.
Org Admin
An administrative user defined for a company within the SnapLogic application. This user can create users and assign permissions within the organization, create Project spaces and may create projects. Organization Admin is also referred to as an Org admin.
Override URL
Similar to the On-premises URL, the Override URL can trigger a task to execute on a Groundplex. The Override URL ignores the Task's configured Snaplex and instead executes on the Snaplex that receives the request.
P
Pattern
A reusable template for creating Pipelines, typically created to address common use cases. Patterns enable you to easily configure the properties of Snaps in the pattern.
Patterns Catalog
Enables you to access all the SnapLogic Pattern Pipelines that are available to you. Pattern Pipelines are grouped by Project Spaces and Projects.
PCA
Applicable to SnapLogic Data Science, PCA is a dimension-reduction technique that can be used to reduce a large set of variables to a small set that still contains most of the information in the original set.
Pipeline
A chain of one or more Snaps linked together to orchestrate a data flow between endpoints.
Pipelines Catalog
Enables you to access all the Pipelines available to you. Pipelines are grouped by Project Spaces and Projects.
Pipeline Recommendation
Enables Iris to suggest probable Pipelines based on the existing Pipelines from your Org to which you have access, Pattern Pipelines from the SnapLogic Cloud Pattern Catalog, and synthesized Pipelines.
Pipeline Wall
A tab in SnapLogic Dashboard that displays Pipeline execution statistics and details.
Platform
A set of HTML file-based and cloud-based applications that are delivered as a multi-tenant cloud service. These applications allow you to design, manage, and monitor integrations.
Project
Stores all SnapLogic Assets: Accounts, Files, Tasks, Pipelines, Snap Packs, Tables, Proxies, and Snaplexes.
Project Space
An additional level above projects to help you better organize your Pipelines. Project Space consists of one or more projects and contains a shared project to maintain assets shared between the projects of that Project Space.
R
Record Linkage
The task of finding records that refer to the same entity across different data sources. Record linkage is necessary when joining data from different sources, based on entities that may not share a common identifier (such as database key or URI).
Refresh Token
A credential that SnapLogic uses to acquire new access tokens after the expiry of an existing access token. The lifetime of a refresh token is longer compared to the lifetime of an access token.
Resumable Pipeline
A Pipeline that has the capability to be suspended when an endpoint fails during the processing of its documents. You can troubleshoot the suspended Pipeline on the SnapLogic Dashboard and resume the Pipeline when the endpoint is up again.
S
Scheduled Task
A method to execute your Pipelines in a fixed schedule.
Score
An AutoLink feature that indicates the confidence level of a match. For example, a score of 5 stars is a match in which the model has high confidence.
Score Threshold
An AutoLink feature that filters suggestions based on the score.
Seed Value
A numeric value that is used by the randomizer in identifying the set of a sampled or shuffled dataset in machine learning algorithms. Using the same seed value helps you generate the same sample or shuffling order.
SLDB (SnapLogic DataBase Service)
An internal component of the SnapLogic Control Plane.
SLFS (SnapLogic File System)
A sub-component of SLDB that allows you to upload and download files to and from SnapLogic IIP, such as when you use File Reader and File Writer Snaps in your Pipelines.
SLPROPZ file
A file that contains the configuration details of a Snaplex node.
SNC (Secure Network Communications)
A proprietary security mechanism that SAP has developed to integrate SAP NetWeaver Single Sign-On or an external security product with SAP systems. With SNC, you receive application-level, end-to-end security. All communication that takes place between two SNC-protected components is secured (for example, between the SAP GUI for Windows and the application server). See Secure Network Communications (SNC) in the SAP Help Portal for more information.
Snap
An object in the SnapLogic application that introspects the source and target and performs a single complete function, such as read, write, or act on data.
Snaplex
The data processing engine used for Pipeline execution. A Snaplex contains multiple nodes and can be a Cloudplex or a Groundplex.
Snaplex Slot
An abstraction used to quantify the resource requirements/usage for a Pipeline. Each Snap in a Pipeline consumes a slot during execution and each slot is a running thread on a Snaplex node.
Snaplex Dashboard
A tab in SnapLogic Dashboard that lets you view Snaplex statistics.
Snap Pack
A logical grouping of Snaps based on functionality or an application endpoint.
SnapLogic Cloud Pattern Catalog
An extensive library of Pipeline patterns built and maintained by SnapLogic.
Snaps Catalog
The tab in SnapLogic Designer that contains the list of available Snaps and the version of the Snap Pack. The Snaps Catalog also enables you to search for Snaps.
SnapLogic Expressions
Used in Snaps to manipulate data. In Standard mode, the expressions are based on a subset of JavaScript and act accordingly.
SnapLogic Data Science
A self-service SnapLogic platform for end-to-end machine learning that offers a low-code approach to data acquisition, data exploration and preparation, and model training and validation.
SnapLogic Dashboard
A SnapLogic reporting interface that provides information on Pipeline runs and Snaplex instances.
SnapLogic Designer
SnapLogic user interface where you can select Snaps from the available Snap Packs, build and manage Pipelines, and choose from Pipeline Patterns.
SnapLogic Manager
The administration console for SnapLogic from where you can manage users, projects, assets, security settings, and view account and Snap statistics.
Stable Snaps
The most stable set of Snaps as recommended by SnapLogic. Latest Snaps may or may not work properly immediately after a release. For example, for a Production Org, you might choose Stable Snaps to ensure everything works as expected. But you might configure a Developer Org to always have the Latest Snaps to test new features.
Standard Mode Pipeline
The default Pipeline type. All regular Snaps can be used in this Pipeline.
Stratified Sampling
A machine learning sampling technique that results in the same number of data points from each group. For example, if we have 100 people from 10 countries, and we implement stratified sampling at 10%, the result will contain one person from each country.
Subscription Feature
SnapLogic Organization-level features that you can enable by contacting your SnapLogic representative. Some current subscription features are Native API Manager, Resumable Pipelines, and Ultra Pipeline Tasks.
Supervised learning
Applicable to SnapLogic Data Science, it is the task of inferring a function from labeled training data. The training data consists of a set of training examples.
Suspended Pipeline
A Resumable Pipeline whose endpoint has failed or is offline.
Synthesized Pipeline
A Pipeline that is generated using Pipeline metadata and existing Snaps on the Canvas. Iris provides synthesized Pipelines as a part of the list of suggested Pipelines provided by Pipeline Recommendation.
SQL Expressions
A combination of one or more values, operators and SQL functions that evaluate to a value.
T
Table
A type of Asset that you can create in Manager or through a Pipeline with the Catalog Insert Snap. You use this Asset to collect metadata and schema information about the source data being run through your Pipelines. You can modify this data by editing your Tables in Manager.
Triggered Task
A method to execute your Pipelines through an HTTP request. You can also feed input data into a Pipeline and receive the output using an HTTP request.
U
Unsupervised learning
Applicable to SnapLogic Data Science, it is a technique in which you can draw inferences from datasets consisting of data without labeled responses.
Upstream Snap
The preceding Snap of any given Snap in a Pipeline. The output of the upstream Snap is the input of the given Snap.
Ultra Pipeline
Ultra mode enables a Pipeline to continuously consume documents from external sources. Typically, the Ultra Pipelines are used for tasks that are not compatible with triggered tasks or require low-latency processing of their inputs. By default, only one document is supported as input and output.
V
Validate Pipeline
A Pipeline execution operation that executes the Pipeline partially. In Read-type Snaps such as MySQL Select, a preview dataset is also generated. The maximum number of records in a preview dataset is 50. This mode of execution does not impact databases and/or endpoints but is useful in ensuring that a Pipeline is configured correctly.
Virtual Columns
Columns in a table whose values are generated based on a formula are virtual columns.
Variance
Applicable to SnapLogic Data Science, variance indicates how spread out the data (numbers) is in a dataset. In simple terms, variance represents the distribution of data in a dataset.
W
Weighted Stratified Sampling
Applicable to SnapLogic Data Science, it is a sampling technique similar to Stratified Sampling, except that the ratio of data points per group is maintained.
Workbench
The portion of SnapLogic Designer that consists of the canvas, menus, and toolbars.
Workspace
The user session that you log into.