tGSConfiguration
Job.
tGSConfiguration properties for Apache Spark Batch
These properties are used to configure tGSConfiguration running in the Spark Batch Job framework.
The Spark Batch
tGSConfiguration component belongs to the Storage family.
The component in this framework is available only if you have subscribed to one
of the
Talend
solutions with Big Data.
Basic settings
When you use this component with Google Dataproc:
|
Google Storage bucket |
Enter the name of the bucket to be used by the whole For example, if you enter |
When you use this component with other distributions:
|
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
|
Google Storage bucket |
Enter the name of the bucket to be used by the whole For example, if you enter |
|
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
|
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Global Variables
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
|
Usage rule |
This component is used standalone in a Subjob to provide connection configuration to Google Storage for the whole Job. |
|
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
tGSConfiguration properties for Apache Spark Streaming
These properties are used to configure tGSConfiguration running
in the Spark Streaming Job framework.
The Spark Streaming
tGSConfiguration component
belongs to the Storage family.
The component in this framework is available only if you have subscribed to Talend Real-time Big Data Platform or Talend Data
Fabric.
Basic settings
When you use this component with Google Dataproc:
|
Google Storage bucket |
Enter the name of the bucket to be used by the whole For example, if you enter |
When you use this component with other distributions:
|
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
|
Google Storage bucket |
Enter the name of the bucket to be used by the whole For example, if you enter |
|
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
|
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Global Variables
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
|
Usage rule |
This component is used standalone in a Subjob to provide connection configuration to Google Storage for the whole Job. |
|
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |