tRedshiftOutputBulkExec
Executes the Insert action on the data provided.
As a dedicated component, it allows gains in performance during Insert
operations to Amazon Redshift.
The tRedshiftOutputBulk and tRedshiftBulkExec components can be used together in a two step process
to load data to Amazon Redshift from a delimited/CSV file on Amazon S3. In the first
step, a delimited/CSV file is generated. In the second step, this file is used in the
INSERT statement used to feed Amazon Redshift. These two steps are fused together in the
tRedshiftOutputBulkExec component. The
advantage of using two separate steps is that the data can be transformed before it is
loaded to Amazon Redshift.
This component receives data from the preceding component, generates a single
delimited/CSV file and uploads the file to Amazon S3, finally it
loads the data from Amazon S3 to Redshift.
tRedshiftOutputBulkExec Standard properties
These properties are used to configure tRedshiftOutputBulkExec running in the Standard Job framework.
The Standard
tRedshiftOutputBulkExec component belongs to the Cloud and the Databases families.
The component in this framework is generally available.
Basic settings
|
Property Type |
Either Built-In or Repository. |
|
|
Built-In: No property data stored |
|
|
Repository: Select the repository |
|
Use an existing connection |
Select this check box and in the Component |
|
Host |
Type in the IP address or hostname of the database server. |
|
Port |
Type in the listening port number of the database server. |
|
Database |
Type in the name of the database. |
|
Schema |
Type in the name of the schema. |
|
Username and Password |
Type in the database user authentication data. To enter the password, click the […] button next to the |
|
Additional JDBC Parameters |
Specify additional JDBC properties for the connection you are creating. The |
|
Table Name |
Specify the name of the table to be written. Note that only one table |
|
Action on table |
On the table defined, you can perform one of the following
|
|
Schema and Edit schema |
A schema is a row description. It defines the number of fields (columns) to |
|
|
Built-In: You create and store the |
|
|
Repository: You have already created |
|
|
Click Edit schema to make changes to the schema.
|
|
Data file path at local |
Specify the local path to the file to be generated. Note that the file is generated on the same machine where the Studio |
|
Append the local file |
Select this check box to append data to the specified local file if it |
|
Create directory if not exists |
Select this check box to create the directory specified in the |
|
Access Key |
Specify the Access Key ID that uniquely identifies an AWS Account. For |
|
Secret Key |
Specify the Secret Access Key, constituting the security credentials To enter the secret key, click the […] button next to |
|
Bucket |
Type in the name of the Amazon S3 bucket, namely the top level folder, |
|
Key |
Type in an object key to assign to the file uploaded to Amazon |
Advanced settings
|
Fields terminated by |
Enter the character used to separate fields. |
|
Enclosed by |
Select the character in a pair of which the fields are |
|
Compressed by |
Select this check box and select a compression type from the list This field disappears when the Append the local |
|
Encrypt |
Select this check box to generate and upload the data file to Amazon S3 using By default, this check box is cleared and the data file will be For more information about the client-side and server-side encryption, |
|
Encoding |
Select an encoding type for the data in the file to be |
|
Delete local file after putting it to s3 |
Select this check box to delete the local file after being uploaded to |
|
Date format |
Select one of the following items from the list to specify the date
|
|
Time format |
Select one of the following items from the list to specify the time
|
|
Settings |
Click the [+] button below the table
For more information about the parameters, see http://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html. |
|
Config client |
Select this check box to configure client parameters for Amazon S3.
|
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job |
Global Variables
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
|
Usage rule |
This component is mainly used when no particular transformation is |
|
Dynamic settings |
Click the [+] button to add a The Dynamic settings table is For examples on using dynamic parameters, see Scenario: Reading data from databases through context-based dynamic connections and Scenario: Reading data from different MySQL databases using dynamically loaded connection parameters. For more information on Dynamic |
Related scenario
For a related scenario, see Scenario: loading/unloading data from/to Amazon S3.