Configuring the connection to Hive
Configuring tHiveConnection
-
Double-click tHiveConnection to open its
Component view.
-
From the Property type list, select
Built-in. If you have created the
connection to be used in Repository, then
select Repository, click the
button to open the [Repository
content] dialog box and select that connection. This way, the
Studio will reuse that set of connection information for this Job.For further information about how to create a Hadoop connection in
Repository, see the chapter describing the Hadoop
cluster node of the
Talend Open Studio for Big Data Getting Started
Guide
. -
In the Version area, select the Hadoop
distribution to be used and its version. If you cannot find from the list
the distribution corresponding to yours, select Custom so as to connect to a Hadoop distribution not
officially supported in the Studio.For a step-by-step example about how to use this Custom option, see Connecting to a custom Hadoop distribution. -
In the Connection area, enter the
connection parameters to the Hive database to be used. -
In the Name node field,
enter the location of the master node, the NameNode, of the distribution to be
used. For example, talend-hdp-all:50300. If you are using WebHDFS, the location should be
webhdfs://masternode:portnumber; if this WebHDFS is secured
with SSL, the scheme should be swebhdfs and you need to use
a tLibraryLoad in the Job to load the library required by
the secured WebHDFS.
-
In the Job tracker field, enter the
location of the JobTracker of your distribution. For example, hdfs://talend-hdp-all:8020.Note that the notion Job in this term JobTracker designates the MR or the
MapReduce jobs described in Apache’s documentation on http://hadoop.apache.org/.
Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Login
0 Comments