tBigQueryConfiguration
for a Spark Job.
tBigQueryConfiguration properties for Apache Spark Batch
These properties are used to configure tBigQueryConfiguration running in the Spark Batch Job framework.
The Spark Batch
tBigQueryConfiguration component belongs to the Storage and the Databases families.
The component in this framework is available only if you have subscribed to one
of the
Talend
solutions with Big Data.
Basic settings
When you use this component with Google Dataproc:
BigQuery temp GCS path |
Enter the directory on Google Storage to temporarily When you use BigQuery with Dataproc, in |
When you use this component with the other distributions
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
BigQuery temp GCS path |
Enter the directory on Google Storage to temporarily When you use BigQuery with Dataproc, in |
Usage
Usage rule |
This component is used standalone in a Subjob to provide |
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
tBigQueryConfiguration properties for Apache Spark Streaming
These properties are used to configure tBigQueryConfiguration running
in the Spark Streaming Job framework.
The Spark Streaming
tBigQueryConfiguration component
belongs to the Storage and the Databases families.
The component in this framework is available only if you have subscribed to Talend Real-time Big Data Platform or Talend Data
Fabric.
Basic settings
When you use this component with Google Dataproc:
BigQuery temp GCS path |
Enter the directory on Google Storage to temporarily When you use BigQuery with Dataproc, in |
When you use this component with the other distributions
Project identifier |
Enter the ID of your Google Cloud Platform project. If you are not certain about your project ID, check it in the Manage |
Path to Google Credentials file |
Enter the path to the credentials file associated to the user account to If you use Talend Jobserver to run your |
Use P12 credentials file format |
When the Google credentials file to be used is in P12 format, select this |
BigQuery temp GCS path |
Enter the directory on Google Storage to temporarily When you use BigQuery with Dataproc, in |
Usage
Usage rule |
This component is used standalone in a Subjob to provide |
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |