Configuring an External Database for Sqoop 2
Note: This page contains references to CDH 5
components or features that have been removed from CDH 6. These references are only applicable if you are managing a CDH 5 cluster with Cloudera Manager 6. For more information, see Deprecated Items.
Sqoop 2 has a built-in Derby database, but Cloudera recommends that you use a PostgreSQL database instead, for the following reasons:
Note:
Cloudera currently has no recommended way to migrate data from an existing Derby database into the new PostgreSQL database.
Use the procedure that follows to configure Sqoop 2 to use PostgreSQL instead of Apache Derby.
Create the Sqoop 2 User and Sqoop 2 Database
$ psql -U postgres Password for user postgres: ***** postgres=# CREATE ROLE sqoop LOGIN ENCRYPTED PASSWORD 'sqoop' NOSUPERUSER INHERIT CREATEDB NOCREATEROLE; CREATE ROLE postgres=# CREATE DATABASE "sqoop" WITH OWNER = sqoop ENCODING = 'UTF8' TABLESPACE = pg_default LC_COLLATE = 'en_US.UTF8' LC_CTYPE = 'en_US.UTF8' CONNECTION LIMIT = -1; CREATE DATABASE postgres=# \q
Configure Sqoop 2 to use PostgreSQL
Minimum Required Role: Configurator (also provided by Cluster Administrator, Full Administrator)
- Go to the Sqoop 2 service.
- Click the Configuration tab.
- Select .
- Select .
- Set the following properties:
- Sqoop Repository Database Type - postgresql
- Sqoop Repository Database Host - the hostname on which you installed the PostgreSQL server. If the port is non-default for your database type, use host:port notation.
- Sqoop Repository Database Name, User, Password - the properties you specified in Create the Sqoop 2 User and Sqoop 2 Database.
- Enter a Reason for change, and then click Save Changes to commit the changes.
- Restart the service.
Page generated July 25, 2018.
<< Install and Configure Oracle Database for Cloudera Software | ©2016 Cloudera, Inc. All rights reserved | Step 5: Set up the Cloudera Manager Database >> |
Terms and Conditions Privacy Policy |