Tuning your cluster performance #
Starburst Enterprise platform (SEP) is a more feature-rich version of Trino, formerly PrestoSQL, providing enhanced query performance, security, connectivity, and ease of use.
Learn how to size your cluster and the machines in it to ensure the best performance possible for your workload in this training video presented by one of our founders, Dain Sundstrom. For your convenience, we’ve divided the video training course up into topic sections, and provided links to the relevant parts of our documentation below.
General tuning strategy & baseline advice #
- Starting big
- Stabilizing, then tuning
- Options to disable
Running time: ~9 min.
Cluster sizing, and how SEP uses CPU and memory resources #
- How memory affects JOIN, GROUP BY, ORDER BY and window functions
- Availability
- Concurrency
Running time: ~19 min.
Machine sizing and its impact #
- Memory and memory allocation
- Shared join hash
- Distributed join
- Skew
- Machine sizes and types
- Spilling
- Small clusters
Running time: ~38 min.
Additional resources on resources management and spilling in SEP:
Tuning the workload #
- Query plan
- Precomputing
- Connectors
Running time: ~16 min.
Hive data organization #
- Organize your data for the Hive connector
- Hive partitioning and bucketing
- ORC and Parquet
- File size
- Bad parquet files
- Rewrite table with the ORC writer
Running time: ~16 min.
Making queries faster #
- What to look for in a query
- Using more hardware
- Underutilization
- Hive caching
Running time: ~13 min.
For more in-depth information on this topic, watch our query optimization training video.
Sharing resources, and resource groups #
- Concurrency
- User experience, expectations and satisfaction
- Social engineering
Running time: ~3 min.
Is the information on this page helpful?
Yes
No
Is the information on this page helpful?
Yes
No