Starburst Galaxy

  •  Get started

  •  Working with data

  •  Data engineering

  •  Developer tools

  •  Cluster administration

  •  Security and compliance

  •  Troubleshooting

  • Galaxy status

  •  Reference

  • Query result caching #

    Query result caching is an industry standard mechanism used to accelerate query execution. When a query is executed, the result is cached for a specified period of time. By caching the results of queries, Starburst Galaxy can rely on pre-computed results and can avoid unnecessarily processing queries if the results can be read from the cache. Retrieving a result directly from the cache substantially reduces query time.

    Query result caching is enabled by default when you create a new AWS cluster, and can be disabled in the Advanced section of the Create cluster dialog.

    In Starburst Galaxy, specify query result caching by selecting Cache query results for improved query performance under Advanced settings in the Create cluster or Edit cluster dialogs. You can make this designation either when creating a cluster or when editing an existing cluster.

    Additionally, set a duration for Cache reuse period by entering an integer and selecting Minutes or Hours from the drop-down menu. The cache reuse period must be between 5 minutes and 12 hours, inclusive. Run a query to cache its results. There is a 1MB size limit for cached results. For all subsequent runs of the same query on the same cluster within the cache reuse period, Galaxy fetches the cached results of the original query.

    Result set caching toggle

    The query history pane includes a Cached results column that shows a circled checkmark for queries run on a cluster that has result set caching enabled.