Cloudera Enterprise 6.0.x | Other versions

Configuring Apache Hive in CDH

Hive offers a number of configuration settings related to performance, file layout and handling, and options to control SQL semantics. Depending on your cluster size and workloads, configure HiveServer2 memory, table locking behavior, and authentication for connections. See Configuring HiveServer2 for CDH for details about required configuration changes that you must perform.

The Hive metastore service, which stores the metadata for Hive tables and partitions, must also be configured. See Configuring the Hive Metastore for CDH for details about deployment modes, information about supported metastore databases, and specific configurations for MySQL, PostgreSQL, and Oracle.

To configure Hive to use the Amazon S3 filesystem for transient ETL jobs, see Configuring Transient Apache Hive ETL Jobs to Use the Amazon S3 Filesystem in CDH

Continue reading:

Configuring the Hive Metastore for CDH
Configuring HiveServer2 for CDH
Starting the Hive Metastore in CDH
Apache Hive File System Permissions in CDH
Starting, Stopping, and Using HiveServer2 in CDH
Using Apache Hive with HBase in CDH
Using the Hive Schema Tool in CDH
Installing the Hive JDBC Driver on Clients in CDH
Setting HADOOP_MAPRED_HOME for Apache Hive in CDH
Configuring the Hive Metastore to Use HDFS High Availability in CDH

Page generated July 25, 2018.