Cloudera Data Platform support#

Use the Starburst Hive connector to query Cloudera Data Platform (CDP) version 7.1 or higher.
Note
The Cloudera Data Platform support requires a valid Starburst Enterprise license.
Requirements#
The Starburst Hive connector can query the Cloudera Data Platform (CDP), available as version 7.x. It also supports the predecessor Cloudera Distributed Hadoop (CDH) platform, available in version 6.x. Support and compatibility vary based on the version you use, and is detailed in the following table:
Cloudera version |
345-e and higher |
---|---|
CDP 7.x |
Yes |
CDH 6.x |
Yes, see details in following sections |
CDH 5.x and lower |
No |
The following details apply for CDH 6.x users:
reading tables and data files created by CDH 6.x is supported
transactional table usage is not supported
CDH 6.x Hive cannot read ORC files created by SEP, due to the behavior of the included Hive version
Configuration#
Edit your catalog properties file using the Hive connector
Set the metastore to use
thrift-cdp7
when using CDP 7, andthrift
for older versions.Configure the URI to point to your Hive metastore Thrift service
connector.name=hive
hive.metastore=thrift-cdp7
hive.metastore.uri=thrift://cdp-master:9083
SQL support#
When using the Starburst Hive connector with Cloudera Data Platform, the following aspects apply in addition to the available SQL support:
Reading data#
CDP support includes read operations on the following tables:
compacted tables
bucketed tables
partitioned tables
unpartitioned tables
The following file formats can be read:
Avro
CSV
ORC ACID
Parquet
RCFile
Writing data#
Write operations, such as INSERT, DELETE and UPDATE, on ORC ACID tables are not supported.