tGSConfiguration
Job.
tGSConfiguration properties for Apache Spark Batch
These properties are used to configure tGSConfiguration running in the Spark Batch Job framework.
The Spark Batch
tGSConfiguration component belongs to the Storage family.
The component in this framework is available in all subscription-based Talend products with Big Data
and Talend Data Fabric.
Basic settings
When you use this component with Google Dataproc:
Google Storage bucket |
Enter the name of the bucket to be For example, if you enter |
When you use this component with other distributions:
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
Google Storage bucket |
Enter the name of the bucket to be For example, if you enter |
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Global Variables
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
Usage rule |
This component is used standalone in a subJob to provide connection configuration to Google Storage for the whole Job. |
Spark Connection |
In the Spark
Configuration tab in the Run view, define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
tGSConfiguration properties for Apache Spark Streaming
These properties are used to configure tGSConfiguration running
in the Spark Streaming Job framework.
The Spark Streaming
tGSConfiguration component
belongs to the Storage family.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.
Basic settings
When you use this component with Google Dataproc:
Google Storage bucket |
Enter the name of the bucket to be For example, if you enter |
When you use this component with other distributions:
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
Google Storage bucket |
Enter the name of the bucket to be For example, if you enter |
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Global Variables
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
Usage rule |
This component is used standalone in a subJob to provide connection configuration to Google Storage for the whole Job. |
Spark Connection |
In the Spark
Configuration tab in the Run view, define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |