Data product feature configuration#

The data products feature is disabled by default. Before configuring data products, make sure your cluster is configured to fulfill the requirements.

Configuration properties for data products are set in the coordinator’s config.properties file. In Kubernetes, set this as additional properties.

starburst.data-product.enabled=true
data-product.starburst-jdbc-url=jdbc:trino://coordinator.example.com?SSL=true
data-product.starburst-user=alice
data-product.starburst-password=my123password
Mapping configuration properties#

Property name

Description

Default

starburst.data-product.enabled

Enable or disable data products.

false

data-product.starburst-jdbc-url

Required JDBC URL connection string of the cluster that data products is running on. Use JDBC driver parameters to configure details of the connection. For example, append SSL=true for a cluster secured with TLS, or other parameters as needed for authentication, client identification, or client tags.

data-product.starburst-user

SEP user configured as an application user to execute the data product’s interactions with SEP, such as the creation of schemas, views, and so on. The interactions with SEP impersonate the logged-in user, so make sure this user has the permission to impersonate other users.

If data products is used in conjunction with built-in access control, make sure the user or group membership for the username specified with this property is granted impersonate privileges over all SEP users of the data products tool.

We recommend keeping this user’s credentials secured using secrets, and rotate its password frequently.

data-product.starburst-password

Password for the user specified by data-product.starburst-user.

data-product.publishing-threads-count

Number of threads in the coordinator pool allocated to publishing data products. We strongly recommend using the default value unless you observe slowness in concurrent publishing jobs.

2

data-product.statistics-enabled

Enable the background task that runs once a day to calculate stats for published data products.

true

data-product.time-zone-id

The time zone id used to calculate the start of each day. The background task that calculates data product statistics runs at the start of each day in this timezone. Example value: America/New_York.

The coordinator’s system timezone