tJDBCConfiguration
Stores connection information and credentials to be reused by other JDBC
components.
You configure the connection to a given database in
tJDBCConfiguration and configure the other JDBC related components to
reuse this configuration. At runtime, the Spark executors read this configuration in order
to connect to this database.
Depending on the Talend solution you
are using, this component can be used in one, some or all of the following Job
frameworks:
-
Spark Batch: see tJDBCConfiguration properties for Apache Spark Batch.
The component in this framework is available only if you have subscribed to one
of the
Talend
solutions with Big Data. -
Spark Streaming: see tJDBCConfiguration properties for Apache Spark Streaming.
The component in this framework is available only if you have subscribed to Talend Real-time Big Data Platform or Talend Data
Fabric.
tJDBCConfiguration properties for Apache Spark Batch
These properties are used to configure tJDBCConfiguration running in the Spark Batch Job framework.
The Spark Batch
tJDBCConfiguration component belongs to the Storage and the Databases families.
The component in this framework is available only if you have subscribed to one
of the
Talend
solutions with Big Data.
Basic settings
|
Property type |
Either Built-In or Repository. Built-In: No property data stored centrally.
Repository: Select the repository file where the |
|
JDBC URL |
Specify the JDBC URL of the database to be used. For example, the Available only for Spark V1.4. and onwards. |
|
Driver JAR |
Complete this table to load the driver JARs needed. To do this, click the |
|
Driver Class |
Enter the class name for the specified driver between double quotation marks. |
|
Username and Password |
Enter the authentication information to the database you need to connect To enter the password, click the […] button next to the Available only for Spark V1.4. and onwards. |
|
Additional JDBC parameters |
Specify additional connection properties for the database connection you are This field is not available if the Use an existing |
Advanced settings
|
Connection pool |
In this area, you configure, for each Spark executor, the connection pool used to control
|
|
Evict connections |
Select this check box to define criteria to destroy connections in the connection pool. The
|
Usage
|
Usage rule |
This component is used with no need to be connected to other components. The configuration in a tJDBCConfiguration This component, along with the Spark Batch component Palette it belongs to, appears only Note that in this documentation, unless otherwise |
|
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
Related scenarios
For a scenario about how to use the same type of component in a Spark Batch Job, see Writing and reading data from MongoDB using a Spark Batch Job.
tJDBCConfiguration properties for Apache Spark Streaming
These properties are used to configure tJDBCConfiguration running in the Spark Streaming Job framework.
The Spark Streaming
tJDBCConfiguration component belongs to the Storage and the Databases families.
The component in this framework is available only if you have subscribed to Talend Real-time Big Data Platform or Talend Data
Fabric.
Basic settings
|
Property type |
Either Built-In or Repository. Built-In: No property data stored centrally.
Repository: Select the repository file where the |
||
|
JDBC URL |
Specify the JDBC URL of the database to be used. For example, the If you are using Spark V1.3, this URL should contain the authentication
information, such as:
|
||
|
Driver JAR |
Complete this table to load the driver JARs needed. To do this, click the |
||
|
Driver Class |
Enter the class name for the specified driver between double quotation marks. |
||
|
Username and Password |
Enter the authentication information to the database you need to connect To enter the password, click the […] button next to the Available only for Spark V1.4. and onwards. |
||
|
Additional JDBC parameters |
Specify additional connection properties for the database connection you are This field is not available if the Use an existing |
Advanced settings
|
Connection pool |
In this area, you configure, for each Spark executor, the connection pool used to control
|
|
Evict connections |
Select this check box to define criteria to destroy connections in the connection pool. The
|
Usage
|
Usage rule |
This component is used with no need to be connected to other components. The configuration in a tJDBCConfiguration This component, along with the Spark Streaming component Palette it belongs to, appears Note that in this documentation, unless otherwise explicitly stated, a scenario presents |
|
Spark Connection |
You need to use the Spark Configuration tab in
the Run view to define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
Related scenarios
For a scenario about how to use the same type of component in a Spark Streaming Job, see
Reading and writing data in MongoDB using a Spark Streaming Job.