tJDBCConfiguration
Stores connection information and credentials to be reused by other JDBC
components.
You configure the connection to a given database in
tJDBCConfiguration and configure the other JDBC related components to
reuse this configuration. At runtime, the Spark executors read this configuration in order
to connect to this database.
If you use JDBC with Databricks on Azure, you must have a
Premium pricing workspace for your Databricks cluster. For further information about Azure
Databricks pricing, see Azure Databricks pricing.
Depending on the Talend
product you are using, this component can be used in one, some or all of the following
Job frameworks:
-
Spark Batch: see tJDBCConfiguration properties for Apache Spark Batch.
The component in this framework is available in all subscription-based Talend products with Big Data
and Talend Data Fabric. -
Spark Streaming: see tJDBCConfiguration properties for Apache Spark Streaming.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.
tJDBCConfiguration properties for Apache Spark Batch
These properties are used to configure tJDBCConfiguration running in the Spark
Batch Job framework.
The Spark Batch
tJDBCConfiguration component belongs to the Storage and the Databases families.
The component in this framework is available in all subscription-based Talend products with Big Data
and Talend Data Fabric.
Basic settings
Property type |
Either Built-In or Repository. Built-In: No property data stored centrally.
Repository: Select the repository file where the |
||
JDBC URL |
The JDBC URL of the database to be used. For
|
||
Driver JAR |
Complete this table to load the driver JARs needed. To do For more information, see Importing a database driver. |
||
Driver Class |
Enter the class name for the specified driver between double |
||
Username and Password |
Enter the authentication information to the database you need To enter the password, click the […] button next to the If you are using Databricks, Available only for Spark V1.4. and onwards. |
||
Additional JDBC |
Specify additional connection properties for the database connection you are This field is not available if the Use an existing |
Advanced settings
Connection pool |
In this area, you configure, for each Spark executor, the connection pool used to control
|
Evict connections |
Select this check box to define criteria to destroy connections in the connection pool. The
|
Usage
Usage rule |
This component is used with no need to be connected to other The configuration in a tJDBCConfiguration component applies only on the JDBC related |
Spark Connection |
In the Spark
Configuration tab in the Run view, define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
Related scenarios
For a scenario about how to use the same type of component in a Spark Batch Job, see Writing and reading data from MongoDB using a Spark Batch Job.
tJDBCConfiguration properties for Apache Spark
Streaming
These properties are used to configure tJDBCConfiguration running in the Spark
Streaming Job framework.
The Spark Streaming
tJDBCConfiguration component belongs to the Storage and the Databases families.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.
Basic settings
Property type |
Either Built-In or Repository. Built-In: No property data stored centrally.
Repository: Select the repository file where the |
||
JDBC URL |
The JDBC URL of the database to be used. For
|
||
Driver JAR |
Complete this table to load the driver JARs needed. To do For more information, see Importing a database driver. |
||
Driver Class |
Enter the class name for the specified driver between double |
||
Username and Password |
Enter the authentication information to the database you need To enter the password, click the […] button next to the If you are using Databricks, Available only for Spark V1.4. and onwards. |
||
Additional JDBC |
Specify additional connection properties for the database connection you are This field is not available if the Use an existing |
Advanced settings
Connection pool |
In this area, you configure, for each Spark executor, the connection pool used to control
|
Evict connections |
Select this check box to define criteria to destroy connections in the connection pool. The
|
Usage
Usage rule |
This component is used with no need to be connected to other components. The configuration in a tJDBCConfiguration component applies only on the JDBC related |
Spark Connection |
In the Spark
Configuration tab in the Run view, define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
Related scenarios
For a scenario about how to use the same type of component in a Spark Streaming Job, see
Reading and writing data in MongoDB using a Spark Streaming Job.