Xplenty terminology

The main terms you will encounter in the Xplenty documentation include:

Connection

A connection defines the endpoint and credentials to use to connect to a data repository.

Package

An Xplenty package is a dataflow or a workflow definition. Dataflows describe the data to process (location, schema, fields), data manipulation to perform, and the output destinations (location, schema).

Workflows define dependencies between tasks. For example: after dataflow A finishes successfully run dataflow B.

Once the package is defined, it is run as a job on a cluster.

Cluster

An Xplenty cluster is a group of machines (nodes) that is allocated exclusively to your account's users. You can create one or more clusters, and you can run one or more jobs on each cluster. A cluster that you've created remains allocated to your account until you request to terminate the cluster.

Job

An Xplenty job is a process that is responsible for running a specific package on a cluster. The job is a batch process that processes a finite amount of data and then terminates. Several jobs can run the same package simultaneously. When you run a new job, you select the name of the package to execute, and the cluster on which to execute it.

Account and User

An Xplenty account represents a related group (usually a company) of Xplenty users connected to a specific provider/region. An account is created when a user signs up to the Xplenty service. An account is linked to a region, which is where clusters will be created and jobs will execute. The region should therefore be where you data is.

Feedback and Knowledge Base