Skip to end of banner
Go to start of banner

SnapLogic Glossary

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 73 Current »

Top

This article explains the SnapLogic concepts and terminology as they pertain to the SnapLogic Intelligent Integration Platform (IIP).

A

Access Token

An authentication token that SnapLogic uses to access an API.

API Consumer

The API consumer user is a user that may or may not have a SnapLogic account, but is primarily interested in consuming APIs that are managed through the SnapLogic API Gateway. These users interface with the Developer Portal.

API Meter

A tool in SnapLogic Manager that provides information on the API limits for your Org and API usage reports

Asset

Any resource that can be created by a SnapLogic user and is tracked in SnapLogic Manager. Accounts, Files, Tasks, Pipelines, Snap Packs, Snaplexes, and Tables qualify as Assets. In Manager, you can search for your Assets, and depending on the Asset type, view the details, statistics, or activity logs.

A feature that simplifies data mapping by suggesting Input Schema mapping to the Target Schema in the Mapper Snap.

AutoLink History Cleanup

Cleans up the AutoLink history.

Back to top

B

Binary Data

A file that contains data in binary format and a header that contains the metadata for the binary content. For example, a file with JSON data might have a header with a content-type field set to application/json.

Back to top

C

Canvas

The area in SnapLogic Designer where you can design, render, validate, and execute integration Pipelines.

Categorical Dataset

Data points that represent characteristics, groups, or categories. The data may be numeric. For example, Gender and Marital Status.

CC User

CC user is a SnapLogic system user, such as cc+<org_name>@snaplogic.com, which is automatically created for use by Snaplexes for registering with the control plane.

Cloudplex

A Snaplex that runs inside the SnapLogic Cloud. A Cloudplex is managed by SnapLogic.

Clustering

Applicable to SnapLogic Data Science, clustering involves grouping a set of objects in a way that objects in a group are more similar to each other than objects in other groups.

Control Plane

The back-end management mechanism for the SnapLogic IIP. The Control Plane runs on the cloud as a multi-tenant service, stores Pipeline definitions and metadata information, while also managing triggered and scheduled Pipeline executions.

Back to top

D

Data Catalog

A collection of relational tables that stores database metadata and allows you to query and update attributes in the relational tables in the SnapLogic platform.

Data Lineage

A new service within the SnapLogic Dashboard as a separate subtab that enables you to visualize the relationship between your Pipelines and their data sources. When you fetch the data lineage for a target data type, the Data Lineage displays the data sources to which a Pipeline connects as a diagram.

Data Plane

The data processing engine of the SnapLogic IIP. The Data Plane that runs in the SnapLogic Cloud or in a customer data center (if on-premises), executes the Pipelines and monitors the JCC nodes that handle the Pipeline executions.

Data Preview

A sample display of the data passing through the Pipeline per Snap in two views: tabular and visual. You can access the Data Preview by clicking on the connector for a Snap that shows a text-like icon in the circular connector.

Data Visualization

A feature of Data Preview in Pipelines that enables you to view your data in a variety of charts. You can generate pie, line, bar, and scatter charts from the given data.

Disabled Snap

A Snap that neither executes or validates. You can use disabled Snaps when building and testing Pipelines. In a Pipeline, all Snaps downstream from a Disabled Snap do not execute or validate.

Downstream Snap

The succeeding Snap of any given Snap in a Pipeline. The input of the downstream Snap is the output of the given Snap.

Back to top

E

eXtreme

Extends the SnapLogic Intelligent Integration Platform (IIP) to Big Data as a Service (BDaaS). eXtreme provides capabilities for building powerful Spark-based Pipelines and managing clusters like Amazon EMR. You can use Spark SQL 2.x, eXtreme Script, and eXtreme SnowFlake Snap Packs to build eXtreme Pipelines.

eXtremeplex

A Snaplex that you use in a Big Data environment to process data at scale in cloud infrastructure, such as EMR systems. eXtremeplex is supports only a cloud version.

eXtreme Mode Pipeline

A Pipeline type that runs on an eXtremeplex and uses Spark SQL 2.x Snaps. You can view the supported Snaps in the Snap Catalog after selecting eXtreme as the Pipeline type.

Execute Pipeline

A Pipeline execution operation that executes the Pipeline completely. This mode of execution impacts databases and/or endpoints.

Back to top

F

Feature

Applicable to SnapLogic Data Science, Feature is a measurement of the data point. For example, height, width, mean, mode.

FeedMaster

A type of node, which is a Java Virtual Machine (JVM) installed on a Linux or a Windows server. The FeedMaster is an interface between the node executing Ultra Pipelines and the clients used for Ultra Pipelines.

Field Snap Pack

A collection of private Snap Packs built by SnapLogic to meet specific use cases of customers. Field Snap Packs may not be visible in the Snap catalog, they may or may not be documented, usually are not tested by QA, and are not supported/maintained by our Development team.

Back to top

G

Groundplex

A Snaplex that runs behind a firewall, provides on-premises connectivity at a customer site, and is administered by the customer. 

Back to top

H

Health Wall

A tab in SnapLogic Dashboard that lets you monitor the health of Snaplexes and Pipelines.

Hybrid Scaling

An eXtremeplex option for cluster management that selects both EMR On Demand and Spot Instance types to optimize cost, performance, and reliability when executing a Pipeline.

Back to top

I

IIP

The SnapLogic Intelligent Integration Platform. A multi-tenant administration console that empowers you to control and monitor the performance of integration Pipelines as well as user permissions, projects, and security settings.

Infobox

The dialog of any Snap where you configure the Snap settings. Infoboxes typically have tabs for Settings, Views, Info, and Accounts associated with the concerned Snap.

Insights Wall

A tab in SnapLogic Dashboard that displays view time series data and logs. The Insights tab also retrieves logs from the runtime database to generate user activity graphs.

Instance type

The Amazon Elastic Compute Cloud (EC2) Instance Type that you specify while creating an eXtremeplex. Instance Type lets you choose the appropriate mix of resources like CPU, memory, storage, and networking capacity for your applications.

Integration Assistant

A SnapLogic suggestion engine that recommends the most likely preceding and succeeding Snaps in a PipelineThe recommendation is based on the usage of successor and predecessor Snaps and historical frequency.

Back to top

J

JCC (Java Component Container)

The JCC is a node on the Data Plane that handles Pipeline executions. A Snaplex can contain multiple JCC nodes.

Back to top

K

k-fold Cross Validation

Applicable to SnapLogic Data Science Snaps, k-fold Cross Validation is a statistical method used to estimate the performance of machine learning algorithms. The dataset is separated into k equal-sized chunks and the model is trained on k-1 chunks and evaluated on the last chunk. This process is repeated k-times so every data point has a chance to be in the test chunk once. For example, you could break the dataset into 10 chunks and train the model on nine of the ten chunks and evaluate the updated model on the tenth chunk. You could repeat this process ten times using each chunk until each data chunk has been used to evaluate–and train–the updated model.

Key Passphrase

A word or phrase that protects private key files from unauthorized users encrypting/decrypting the private keys.

Back to top

L

Latest Snaps

A patch release of a Snap or Snap Pack, done on top of the prior GA release. For example, you might configure a Developer Org to always have the Latest Snaps to test new features. But, for a Production Org, you might choose Stable Snaps to ensure everything works as expected.

Back to top

M

Max-in-flight

Applicable to Ultra Pipelines, the maximum number of documents that can be processed by an instance at a given time.

Model (Statistical)

Applicable to SnapLogic Data Science Snaps, a model is a mathematical formula that has been generated by a machine learning algorithm based on the dataset. The model can be stored and used to give a prediction for the data in the future.

Back to top

N

Node

A Snaplex node is one of the virtual machines or set of computing resources that process data. In SnapLogic documentation, we qualify a node based on context. Snaplex node is the most generic reference, and often introduces contexts that involve the following distinctions:

  • Groundplex/Cloudplex nodes: Grounplex nodes are customer configured and maintained, whereas Cloudplex nodes are hosted by SnapLogic. When documentation content is specific to Groundplex users, references to Groundplex nodes indicate that the content is intended for Groundplex users. Likewise, references to Cloudplex nodes are intended for Cloudplex users.
  • JCC/FeedMaster Node: The distinction here is of type. A JCC node executes Pipelines, and a FeedMaster node is the queue broker for running Ultra Pipeline Tasks. When the distinction is relevant, both terms typically appear in the documentation. 

Numerical Dataset

A dataset whose datapoints represent mathematical values. For example, height, weight, age.

Back to top

O

On-premises URL

A URL that can be used to trigger a task to run on a Groundplex without making a request outside the local network.

Organization (Org)

An entity created by SnapLogic used by a customer site administrator for provisioning user accounts and groups. An Org admin manages users, groups, permissions, and other access settings within the UI.

Org Admin

An administrative user defined for a company within the SnapLogic application. This user can create users and assign permissions within the organization, create Project spaces and may create projects. Organization Admin is also referred to as an Org admin.

Override URL

Similar to the On-premises URL, the Override URL can trigger a task to execute on a Groundplex. The Override URL ignores the Task's configured Snaplex and instead executes on the Snaplex that receives the request. 

Back to top

P

Pattern

A reusable template for creating Pipelines, typically created to address common use cases. Patterns enable you to easily configure the properties of Snaps in the pattern.

Patterns Catalog

Enables you to access all the SnapLogic Pattern Pipelines that are available to you. Pattern Pipelines are grouped by Project Spaces and Projects.

PCA

Applicable to SnapLogic Data Science, PCA is a dimension-reduction technique that can be used to reduce a large set of variables to a small set that still contains most of the information in the original set. 

Pipeline

A chain of one or more Snaps linked together to orchestrate a data flow between endpoints.

Pipelines Catalog

Enables you to access all the Pipelines available to you. Pipelines are grouped by Project Spaces and Projects.

Pipeline Recommendation

Enables Iris to suggest probable Pipelines based on the existing Pipelines from your Org to which you have access, Pattern Pipelines from the SnapLogic Cloud Pattern Catalog, and synthesized Pipelines.

Pipeline Wall

A tab in SnapLogic Dashboard that displays Pipeline execution statistics and details.

Platform

A set of HTML file-based and cloud-based applications that are delivered as a multi-tenant cloud service. These applications allow you to design, manage, and monitor integrations.

Project

Stores all SnapLogic Assets: Accounts, Files, Tasks, Pipelines, Snap Packs, Tables, Proxies, and Snaplexes.

Project Space

An additional level above projects to help you better organize your Pipelines. Project Space consists of one or more projects and contains a shared project to maintain assets shared between the projects of that Project Space.

Back to top

R

Record Linkage

The task of finding records that refer to the same entity across different data sources. Record linkage is necessary when joining data from different sources, based on entities that may not share a common identifier (such as database key or URI).

Refresh Token

A credential that SnapLogic uses to acquire new access tokens after the expiry of an existing access token. The lifetime of a refresh token is longer compared to the lifetime of an access token.

Resumable Pipeline

A Pipeline that has the capability to be suspended when an endpoint fails during the processing of its documents. You can troubleshoot the suspended Pipeline on the SnapLogic Dashboard and resume the Pipeline when the endpoint is up again.

Back to top

S

Scheduled Task

A method to execute your Pipelines in a fixed schedule.

Score

An AutoLink feature that indicates the confidence level of a match. For example, a score of 5 stars is a match in which the model has high confidence.

Score Threshold

An AutoLink feature that filters suggestions based on the score.

Seed Value

A numeric value that is used by the randomizer in identifying the set of a sampled or shuffled dataset in machine learning algorithms. Using the same seed value helps you generate the same sample or shuffling order.

SLDB (SnapLogic DataBase Service)

An internal component of the SnapLogic Control Plane.

SLFS (SnapLogic File System)

A sub-component of SLDB that allows you to upload and download files to and from SnapLogic IIP, such as when you use File Reader and File Writer Snaps in your Pipelines.

SLPROPZ file

A file that contains the configuration details of a Snaplex node.

SNC (Secure Network Communications) 

A proprietary security mechanism that SAP has developed to integrate SAP NetWeaver Single Sign-On or an external security product with SAP systems. With SNC, you receive application-level, end-to-end security. All communication that takes place between two SNC-protected components is secured (for example, between the SAP GUI for Windows and the application server). See Secure Network Communications (SNC) in the SAP Help Portal for more information.

Snap

An object in the SnapLogic application that introspects the source and target and performs a single complete function, such as read, write, or act on data.

Snaplex

The data processing engine used for Pipeline execution. A Snaplex contains multiple nodes and can be a Cloudplex or a Groundplex.

Snaplex Slot

An abstraction used to quantify the resource requirements/usage for a Pipeline. Each Snap in a Pipeline consumes a slot during execution and each slot is a running thread on a Snaplex node.

Snaplex Wall

A tab in SnapLogic Dashboard that lets you view Snaplex statistics.

Snap Pack

A logical grouping of Snaps based on functionality or an application endpoint.

SnapLogic Cloud Pattern Catalog

An extensive library of Pipeline patterns built and maintained by SnapLogic.

Snaps Catalog

The tab in SnapLogic Designer that contains the list of available Snaps and the version of the Snap Pack. The Snaps Catalog also enables you to search for Snaps.

SnapLogic Expressions

Used in Snaps to manipulate data. In Standard mode, the expressions are based on a subset of JavaScript and act accordingly unless otherwise noted; in eXtreme-mode, the expressions are based on a subset of Spark SQL 2.x.

SnapLogic Data Science

A self-service SnapLogic platform for end-to-end machine learning that offers a low-code approach to data acquisition, data exploration and preparation, and model training and validation.

SnapLogic Dashboard

A SnapLogic reporting interface that provides information on Pipeline runs and Snaplexes.

SnapLogic Designer

SnapLogic user interface where you can select Snaps from the available Snap Packs, build and manage Pipelines, and choose from Pipeline Patterns. 

SnapLogic Manager

The administration console for SnapLogic from where you can manage users, projects, assets, security settings, and view account and Snap statistics.

Stable Snaps

The most stable set of Snaps as recommended by SnapLogic. Latest Snaps may or may not work properly immediately after a release. For example, for a Production Org, you might choose Stable Snaps to ensure everything works as expected. But you might configure a Developer Org to always have the Latest Snaps to test new features.

Standard Mode Pipeline

The default Pipeline type. All regular Snaps can be used in this Pipeline.

Stratified Sampling

A machine learning sampling technique that results in the same number of data points from each group. For example, if we have 100 people from 10 countries, and we implement stratified sampling at 10%, the result will contain one person from each country.

Subscription Feature

SnapLogic Organization-level features that you can enable by contacting your SnapLogic representative. Some current subscription features are Native API Manager, Resumable Pipelines, and Ultra Pipeline Tasks.

Supervised learning

Applicable to SnapLogic Data Science, it is the task of inferring a function from labeled training data. The training data consists of a set of training examples.

Suspended Pipeline

A Resumable Pipeline whose endpoint has failed or is offline.

Synthesized Pipeline

A Pipeline that is generated using Pipeline metadata and existing Snaps on the Canvas. Iris provides synthesized Pipelines as a part of the list of suggested Pipelines provided by Pipeline Recommendation.

SQL Expressions

A combination of one or more values, operators and SQL functions that evaluate to a value.

Back to top

T

Table

A type of Asset that you can create in Manager or through a Pipeline with the Catalog Insert Snap. You use this Asset to collect metadata and schema information about the source data being run through your Pipelines. You can modify this data by editing your Tables in Manager.

Triggered Task

A method to execute your Pipelines through an HTTP request. You can also feed input data into a Pipeline and receive the output using an HTTP request.

Back to top

U

Unsupervised learning

Applicable to SnapLogic Data Science, it is a technique in which you can draw inferences from datasets consisting of data without labeled responses.

Upstream Snap

The preceding Snap of any given Snap in a Pipeline. The output of the upstream Snap is the input of the given Snap.

Ultra Pipeline

Ultra mode enables a Pipeline to continuously consume documents from external sources. Typically, the Ultra Pipelines are used for tasks that are not compatible with triggered tasks or require low-latency processing of their inputs. By default, only one document is supported as input and output.  

Back to top

V

Validate Pipeline

A Pipeline execution operation that executes the Pipeline partially. In Read-type Snaps such as MySQL Select, a preview dataset is also generated. The maximum number of records in a preview dataset is 50. This mode of execution does not impact databases and/or endpoints but is useful in ensuring that a Pipeline is configured correctly.

Virtual Columns

Columns in a table whose values are generated based on a formula are virtual columns.

Variance

Applicable to SnapLogic Data Science, variance indicates how spread out the data (numbers) is in a dataset. In simple terms, variance represents the distribution of data in a dataset. 

Back to top

W

Weighted Stratified Sampling

Applicable to SnapLogic Data Science, it is a sampling technique similar to Stratified Sampling, except that the ratio of data points per group is maintained. 

Workbench

The portion of SnapLogic Designer that consists of the canvas, menus, and toolbars.

Workspace

The user session that you log into.

Back to top

  • No labels