tGoogleDataprocManage
Platform.
tGoogleDataprocManage Standard properties
These properties are used to configure tGoogleDataprocManage running in the Standard Job framework.
The Standard
tGoogleDataprocManage component belongs to the Cloud family.
The component in this framework is available in all Talend products with Big Data
and in Talend Data Fabric.
Basic settings
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
Cluster identifier |
Enter the ID of your Dataproc cluster to be used. |
Provide Google Credentials in file |
Leave this check box clear, when you When you launch your Job from a remote For further information about this Google |
Action |
Select the action you want tGoogleDataprocManage to
perform on the your cluster:
|
Version |
Select the version of the image to be used to create a Dataproc cluster. |
Region |
From this drop-down list, select the Google Cloud region to |
Zone |
Select the geographic zone in which the computing resources A zone in terms of Google Cloud is an isolated location |
Instance configuration |
Enter the parameters to determine how many masters and workers to be used by |
Advanced settings
Wait for cluster ready |
Select this check box to keep this component running until the cluster is When you clear this check box, this component stops running immediately after |
Master disk size |
Enter a number without quotation marks to determine the size of the disk of |
Master local SSD |
Enter a number without quotation marks to determine the number of local According to Google, these local SSDs are suitable only |
Worker disk size |
Enter a number without quotation marks to determine the size of the disk of |
Worker local SSD |
Enter a number without quotation marks to determine the number of local According to Google, these local SSDs are suitable only |
Network or Subnetwork |
Select either check box to use a Google Compute Engine network or subnetwork As Google does not allow network and subnetwork to be used concurrently, For further information about Google Dataproc cluster network configuration, |
Initialization action |
In this table, select the initialization actions that are available in the If you need to use custom initialization scripts, upload them to this shared
For further information about this shared bucket and the initialization |
tStatCatcher Statistics |
Select this check box to collect log data at the component level. |
Usage
Usage rule |
This component is used standalone in a subJob. |