Clusters #

A cluster in Starburst Galaxy uses Starburst Enterprise and provides the resources to run queries against numerous data sources. Clusters define the number of workers, the configuration for the JVM runtime, configured data sources, and other aspects.

The Starburst Galaxy platform (SGP) allows you to create, edit, and delete clusters from the interface. Access your clusters at any time by clicking Clusters on the left hand menu.

Add a new cluster #

Before you can create a cluster in the SGP, you need to add one or more data sources.

  1. On the Clusters page, select + New. If this is your first cluster, you can select + New cluster from the Dashboard or Clusters pages.
  2. Enter a unique Cluster name. Names can use lowercase letters, numbers, and hyphens.
  3. Select your cluster profile (review cluster profile details below):
    • General purpose
    • Compute optimized
    • Memory optimized
  4. From the Add data source(s) drop down menu, select the data sources for your cluster.
  5. Confirm your deployment settings, Starburst provides default settings that you can edit at any time:
    • Region
    • Availability zone
    • Minimum workers (nodes)
    • Maximum workers (nodes)
  6. Add tags to your cluster to organize and apply metadata. Select tags from the
    Add an existing tag drop down menu or add new ones using the Create a new tag option (select Key > add Value > click Save).
  7. Select Create cluster to finish.

You can start, stop, and edit your cluster at any time from the Clusters page.

Cluster profile overview #

Starburst Galaxy provides three cluster profiles to allow you to create a cluster that is right for your purposes. Review the cluster profile differences:

  • General purpose: Balance your compute, memory, and networking resources. Suitable for a variety of diverse workloads.
    • Instance type: m5a.8xlarge
    • vCPUs: 32
    • Memory (GiB): 128
    • Instance Storage (GiB): EBS Only
    • Network Bandwidth (Gbps): Up to 10
    • EBS Bandwidth (Mbps): 4,750
  • Compute optimized: Ideal for compute bound applications that benefit from high performance processors.
    • Instance type: c5.18xlarge
    • vCPUs: 72
    • Memory (GiB): 144
    • Instance Storage (GiB): EBS Only
    • Network Bandwidth (Gbps): 25
    • EBS Bandwidth (Mbps): 19,000
  • Memory optimized: Deliver fast performance for workloads that process large data sets in memory.
    • Instance type: r5.8xlarge
    • vCPUs: 32
    • Memory (GiB): 256
    • Instance Storage (GiB): EBS Only
    • Network Bandwidth (Gbps): 10
    • EBS Bandwidth (Mbps): 6,800