tRedshiftOutputBulk
Prepares a delimited/CSV file that can be used by tRedshiftBulkExec to feed Amazon Redshift.
The tRedshiftOutputBulk and tRedshiftBulkExec components can be used together in a two step process
to load data to Amazon Redshift from a delimited/CSV file on Amazon S3. In the first
step, a delimited/CSV file is generated. In the second step, this file is used in the
INSERT statement used to feed Amazon Redshift. These two steps are fused together in the
tRedshiftOutputBulkExec component. The advantage of
using two separate steps is that the data can be transformed before it is loaded to
Amazon Redshift.
This component receives data from the preceding component,
generates a single delimited/CSV file and then uploads the file to
Amazon S3.
tRedshiftOutputBulk Standard properties
These properties are used to configure tRedshiftOutputBulk running in the Standard Job framework.
The Standard
tRedshiftOutputBulk component belongs to the Cloud and the Databases families.
The component in this framework is generally available.
Basic settings
|
Data file path at local |
Specify the local path to the file to be generated. Note that the file is generated on the same machine where the Studio |
|
Schema and Edit schema |
A schema is a row description. It defines the number of fields (columns) to |
|
|
Built-In: You create and store the |
|
|
Repository: You have already created |
|
|
Click Edit schema to make changes to the schema.
|
|
Compress the data file |
Select this check box and select a compression type from the list This check box disappears when the Append the |
|
Encrypt |
Select this check box to generate and upload the data file to Amazon By default, this check box is cleared and the data file will be For more information about the client-side and server-side encryption, |
|
Access Key |
Specify the Access Key ID that uniquely identifies an AWS Account. For |
|
Secret Key |
Specify the Secret Access Key, constituting the security credentials To enter the secret key, click the […] button next to |
|
Bucket |
Type in the name of the Amazon S3 bucket, namely the top level folder, |
|
Key |
Type in an object key to assign to the file uploaded to Amazon |
Advanced settings
|
Field Separator |
Enter the character used to separate fields. |
|
Text enclosure |
Select the character in a pair of which the fields are |
|
Delete local file after putting it to s3 |
Select this check box to delete the local file after being uploaded to |
|
Create directory if not exists |
Select this check box to create the directory specified in the |
|
Encoding |
Select an encoding type for the data in the file to be |
|
Config client |
Select this check box to configure client parameters for Amazon S3.
|
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job |
Global Variables
|
Global Variables |
NB_LINE: the number of rows processed. This is an After
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
|
Usage rule |
This component is more commonly used with the tRedshiftBulkExec component to feed Amazon Redshift with |
Related scenario
For a related scenario, see Scenario: loading/unloading data from/to Amazon S3.