Glossary #

Terms A-E #
Amazon AWS marketplace #

A provider for all aspects of the required infrastructure. This includes using AWS CloudFormation for provisioning, Amazon Simple Storage Service (S3) for storage, Amazon Machine Images (AMI), and Amazon Elastic Compute Cloud (EC2) for computes, Amazon Glue as metadata catalog, and others. For more information, see Amazon AWS Marketplace.

Bare-metal server #

A physical computer server dedicated to a single tenant. See bare-metal server

Catalog #

Catalogs define and name the configuration to connect to, and query a data source. For more information, see Catalogs.

Cluster #

A cluster provides the resources to run queries against numerous data sources. Clusters define the number of workers, the configuration for the JVM runtime, configured data sources, and others aspects. For more information, see cluster basics.

Connector #

Transforms the underlying data into the SEP concepts of schemas, tables, columns, rows, and data types. A connector is specific to the data source it supports, and are named as properties in catalogs.

Container #

A lightweight virtual package of software that contains libraries, binaries, code, configuration files, and other dependencies needed to deploy an application. A running container does not include an operating system. It uses the operating system of the host machine, typically Linux. Learn more at Container concept in the Kubernetes documentation.


Common off-the-shelf. Refers to commodity hardware components.

Data consumer persona #

Owns data products such as reports, dashboards, models, and the quality of analysis. For more information, see Starburst personas.

Data engineer persona #

Owns schemas and is responsible for the source data quality and ETL SLA. For more information, see Starburst personas

Data source #

A data source is a system from which data is retrieved. In SEP, you must connect to a data source so you can query that source by using a catalog. See Configure and define catalogs

External ID (AWS) #

An external ID is an identifier in AWS that is required for using Starburst Galaxy. It is used to ensure that only trusted AWS accounts are given permission to operate the Starburst Galaxy clusters based on their assigned role and trust policy. For more information on AWS Identity and Access Management, see How to use an external ID when granting access to your AWS resources to a third party.

Terms F-J #
Google Cloud Marketplace #

Deploy in the Google Cloud Marketplace or using the Starburst Kubernetes solution on the Google Kubernetes Engine (GKE). GKE is a secure, production ready, managed Kubernetes service in Google Cloud managing for containerized applications. For more information, see Google Cloud Marketplace.

Terms K-O #
Marketplace #

Purchase a preconfigured set of machine images, containers, and other needed resources to run SEP on their cloud hosts under your control. See Marketplace deployments.

Microsoft Azure marketplace #

Deploy using in the Azure Marketplace or using the Starburst Kubernetes solution onto the Azure Kubernetes Services (AKS). AKS is a secure, production-ready, managed Kubernetes service on Azure for managing for containerized applications. For more information, see Microsoft Azure Marketplace.

Terms P-T #
Platform administrator persona #

Owns platforms and services (ITIL-style). Has service SLA responsibility for the infrastructure supporting the cluster. For more information, see Starburst personas.

Presto and PrestoSQL #

Old name for Trino.

Red Hat OpenShift marketplace #

A container platform using Kubernetes operators that automates the provisioning, management and scaling of applications to any cloud platform or even on-prem. Starburst Enterprise is available on Red Hat marketplace as of OpenShift version 4. For more information, see Red Hat Marketplace.

Starburst Enterprise platform #

Helps companies harness the value of open source Trino, the fastest distributed query engine available today. Starburst adds connectors, security, and support that meets the needs for fast data access at scale. For more information, see Starburst Enterprise.


Abbreviation of Starburst Enterprise platform. For more information, see Starburst Enterprise.


Structured Query Language. The standard language used with relational databases. For more information, see SQL.

Trino #

Fast distributed SQL query engine for big data analytics, formerly PrestoSQL.

Virtual machine (VM) #

An emulation of the hardware of a computer system on a physical host machine, so any operating system suitable for that hardware can run in the emulator. A typical example is a Linux virtual machine running on a Windows-based host machine. See virtual machine.