tSnowflakeOutputBulk
Writes incoming data to files generated in a folder. The folder
can be in an internal Snowflake stage, an Amazon Simple Storage Service (Amazon S3) bucket,
or an Azure container.
tSnowflakeBulkExec components work together in a two-step process:
- The tSnowflakeOutputBulk component uploads incoming data to a
storage. - The tSnowflakeBulkExec component loads the data from a storage into a
Snowflake database table.
You can transform the data before it is loaded into the database table in this two-step
process. These two steps are fused together in the tSnowflakeOutputBulkExec component, detailed in a separate section.
tSnowflakeOutputBulk Standard properties
These properties are used to configure tSnowflakeOutputBulk running in the Standard Job framework.
The Standard
tSnowflakeOutputBulk component belongs to the Cloud family.
The component in this framework is available in all subscription-based Talend products.
connector. The properties related to database settings vary depending on your database
type selection. For more information about dynamic database connectors, see Dynamic database components.
Basic settings
Database |
Select a type of database from the list and click |
Property Type |
Select the way the connection details
This property is not available when other connection component is selected |
Connection Component |
Select the component that opens the database connection to be reused by this |
Account |
In the Account field, enter, in double quotation marks, the account name This field is available only when you |
Snowflake Region |
Select an AWS region or an Azure region from This field is available only when you |
User Id and Password |
Enter, in double quotation marks, your authentication
This field is available only when you |
Warehouse |
Enter, in double quotation marks, the name of the This field is available only when you |
Schema |
Enter, within double quotation marks, the name of the This field is available only when you |
Database |
Enter, in double quotation marks, the name of the This field is available only when you |
Schema and Edit Schema |
A schema is a row description. It defines the number of fields Built-In: You create and store the schema locally for this component Repository: You have already created the schema and stored it in the If the Snowflake data type to Click Edit
Note that if the input value of any This This |
Storage | Select the type of storage into which data will be uploaded.
|
Stage Folder |
Specify the Snowflake stage folder to store the data.
This field is available when you |
Region | Specify the region where the S3 bucket locates.
This field is available when you select |
Access Key and Secret Key |
Enter the authentication information required to connect to the Amazon S3 bucket to be used. To enter the password, click This field is available when you select |
Bucket | Enter the name of the bucket (in double quotation marks) to be used for storing data. This bucket must already exist. This field is available when you select |
Folder | Enter the name of the folder (in double quotation marks) in which you want to store data. This folder will be created if it does not exist at runtime. This property is available only when S3 or Azure is selected from the Storage drop-down list. |
Server-Side Encryption |
Select this check box to encrypt the files to be uploaded to the S3 bucket on the server side. This check box is checked by default. This field is available when you select |
Protocol | Select the protocol used to create an Azure connection.
This field is available when you select |
Account Name |
Enter the Azure storage account name (in double quotation marks). This field is available when you select |
Container | Enter the name (in double quotation marks) of the Azure container used for storing data. This field is available when you select |
SAS Token | Specify the SAS token to grant limited access to objects in your storage account. To enter the SAS token, click the This field is available when you select |
Advanced settings
Additional JDBC Parameters |
Specify additional connection properties for the database connection you are This field is available only when you |
Use Custom Snowflake Region |
Select this check box to specify a custom
Snowflake region. This option is available only when you select Use This Component from the Connection Component drop-down list in the Basic settings view.
For more information on Snowflake Region |
Login Timeout |
Specify the timeout period (in minutes) |
Role |
Enter, in double quotation marks, the default access This role must already exist and has been granted to the |
Use Custom Stage |
Select this check box to upload the data to the files generated in a folder under the stage. You need also to enter the path to the folder in the field provided. For example, to upload data to the files generated in myfolder1/myfolder2 under the stage, you need to type "@~/myfolder1/myfolder2" in the field. This field is available when you Once selected, the Stage Folder |
Put Command Options |
Set parameters for the PUT command by selecting the following options from the drop-down list. The PUT command is provided by Snowflake. It uploads data to a Snowflake stage folder.
For information about the parameters of the PUT command, see the PUT command. This field is available when you |
Put Command Error Retry |
Specify the maximum data loading This field is available when you |
S3 Max Error Retry |
Specify the maximum data loading This field is available when you select |
Azure Max Error Retry |
Specify the maximum data loading This field is available when you select |
Use Custom S3 Connection Configuration |
Select this check box if you wish to use your custom S3 configuration. Option: select the parameter from the list. Value: enter This field is available when you select |
Non-empty Storage Folder Action |
Specify the action to be performed when the storage folder specified for uploading data is not empty.
|
Chunk Size (bytes) |
Specify the size for the files generated, which defaults to 52428800 bytes. With this option specified, This option can significantly |
Use Custom Local Folder | Specify a local folder as a temporary folder for holding the files generated. With this option selected, files for storing the incoming data are first generated in the specified local folder and are then moved to the specified storage after all the incoming data is uploaded. |
Number of file requests threads |
Specify the number of threads used for sending Put requests in parallel when writing the data in the files. |
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level |
Global Variables
NB_LINE |
The number of rows processed. This is an After variable and it returns an integer. |
NB_SUCCESS |
The number of rows successfully processed. This is an After variable and it returns an |
NB_REJECT |
The number of rows rejected. This is an After variable and it returns an integer. |
ERROR_MESSAGE |
The error message generated by the component when an error occurs. This is an After |
Usage
Usage rule |
This component is an end component of a data |
Related scenarios
For use cases in relation with tSnowflakeOutputBulk, see the
following scenario: