Xplenty terminology

The main terms you will encounter in the Xplenty documentation include:


A connection defines the endpoint and credentials to use to connect to a data repository.


An Xplenty package is a data flow definition. It describes the data to process (location, schema, fields), data manipulation to perform, and the output destinations (location, schema). Once the package is defined, it is run as a job on a cluster.


An Xplenty cluster is a group of machines (nodes) that is allocated exclusively to your account's users. You can create one or more clusters, and you can run one or more jobs on each cluster. A cluster that you've created remains allocated to your account until you request to terminate the cluster.


An Xplenty job is a process that is responsible for running a specific package on a cluster. The job is a batch process that processes a finite amount of data and then terminates. Several jobs can run the same package simultaneously. When you run a new job, you select the name of the package whose workflow the job should perform, and the cluster on which to run.

Account and User

An Xplenty account represents a related group (usually a company) of Xplenty users connected to a specific provider/region. An account is created when a user signs up to the Xplenty service. An account is linked to a region, which is where clusters will be created and jobs will execute. The region should therefore be where you data is.

Feedback and Knowledge Base