Tableau Cloud #

Tableau Cloud was formerly known as Tableau Online.

Before you begin #

Determine the connection information for your cluster, including its network name, listening port (or default port), and your login credentials.

For Starburst Galaxy clusters, use the Connection info pane to obtain the connection information.

Tableau data connector #

Tableau Cloud comes with the Starburst Enterprise by Starburst data connector included for every customer. This option can be used to connect to both Starburst Galaxy and SEP clusters, with a username and password as the authentication method. For SEP, 354-e or later is required.

Note that a Tableau data connector is not the same as a data source connector in SEP.

Connect to Starburst #

To establish a connection to Starburst Galaxy or SEP, do the following:

  1. Log in to Tableau Cloud. Click Create, then select Workbook. In the Connect to Data pane, click the Connectors tab.

  2. In the list of server types, select Starburst Enterprise by Starburst. This opens a Connector Details dialog that describes the data connector.

  3. Click Install. This opens a connection dialog:

      Tableau Connector login dialog

  4. Fill in the connection dialog with the following parameters:

    Field Value
    Server Hostname or IP address of your cluster.
    Port Port used by your cluster.
    Authentication Use the drop-down list to select among three options: Username for a cluster with no authentication, Username and Password for a cluster with authentication enabled.
    Username Your username for Starburst Galaxy or SEP.
    Password (If selected) the password for the specified Username.
    Require SSL Select this checkbox if your cluster has any authentication enabled. When selected, the following field appears.
    SSL Verification Select an entry in the drop-down list to specify how rigorously the server’s certificate is to be validated.

    The values for SSL Verification have the following meanings:

    FULL
    Confirm that the certificate's validity is chained all the way back to a root Certificate Authority (CA).
    CA
    Confirm that the certificate is valid as far back as the included intermediate CA.
    NONE
    Confirm that the server's certificate matches the DNS name and private key of the server.
  5. When the connection is made, the Connections panel shows a list of the catalogs configured in your cluster.

      Tableau Connector list of catalogs

  6. To see the available tables, select a catalog, then a schema from that catalog.

      Tableau list of tables

  7. Select a table, or select multiple tables and specify relationships or joins.

Starburst advantage #

Remember that Starburst Enterprise and Starburst Galaxy are not databases. They are SQL query engines that can connect to multiple data sources at the same time. Each cluster can query multiple catalogs in a wide range of different data sources.

Although Tableau is typically configured to access one specific catalog and schema, it is possible to query more than one data source with a single Tableau connection.

To query multiple catalogs, select New Custom SQL and then reference the fully-qualified name of any table in the cluster using the full catalog.schema.table syntax.

The following example query accesses four catalogs: postgresql, hive, mysql, and sqlserver.

SELECT c.custkey
  , c.state
  , c.estimated_income
  , cp.customer_segment
  , a.cc_number
  , pp.cc_type
  , a.mortgage_id
  , a.auto_loan_id
FROM postgresql.burst_bank.customer c
JOIN hive.burst_bank.account a on c.custkey = a.custkey
JOIN mysql.burst_bank.product_profile pp on a.custkey = pp.custkey
JOIN sqlserver.burst_bank.customer_profile cp on c.custkey = cp.custkey
WHERE c.country = 'US'
AND c.state NOT IN ('AA', 'AE', 'AP')

This approach is faster because all table access is managed by Starburst on the Starburst Galaxy or SEP cluster, and takes advantage of Starburst query optimization.

Tableau is also capable of joining data from multiple data sources, but this approach creates an unnecessary workload on Tableau, and can negatively impact Tableau performance.