tS3Output
Writes data into a given S3 filesystem.
The tS3Output component receives data processed by its preceding
component and writes the data into a given S3 filesystem. By default, the
format used to write the data is S3N (S3 Native Filesystem).
tS3Output MapReduce properties (deprecated)
These properties are used to configure tS3Output running in the MapReduce Job framework.
The MapReduce
tS3Output component belongs to the MapReduce family.
The information in this section is only for users who have subscribed to
Talend Data Fabric or to any Talend product with Big Data but it is not
applicable to Talend Open Studio for Big Data users.
The MapReduce framework is deprecated from Talend 7.3 onwards. Use Talend Jobs for Apache Spark to accomplish your integration tasks.
Basic settings
Property type |
Either Built-In or Repository. |
 |
Built-In: No property data stored centrally. |
 |
Repository: Select the repository file where the The following fields are |
Schema and Edit |
A schema is a row description. It defines the number of fields Click Edit
|
 |
Built-In: You create and store the schema locally for this component |
 |
Repository: You have already created the schema and stored it in the |
Force S3 format instead of the recommended |
Select this check box to use the S3 format to write the data. Note that the S3N format allows you to write structured data while |
Bucket and Folder |
Enter the bucket name and its folder you need to use. You |
Access key and Secret |
Enter the authentication information required to connect to To enter the password, click the […] button next to the |
Type |
Select the type of the file to be processed. The type of the file may be:
|
Action |
Select an operation for writing data:
Create: Creates a file and write
Overwrite: Overwrites the file |
Row separator |
The separator used to identify the end of a row. |
Field separator |
Enter character, string or regular expression to separate fields for the transferred |
Include Header |
Select this check box to output the header of the data. This option is not available for a Sequence file. |
Custom encoding |
You may encounter encoding issues when you process the stored data. In that Select the encoding from the list or select Custom This option is not available for a Sequence file. |
Compress the data |
Select the Compress the data check box to compress the Hadoop provides different compression formats that help reduce the space needed for |
Advanced settings
Advanced separator (for number) |
Select this check box to change the separator used for numbers. By default, the thousands separator is a comma (,) and the decimal separator is a period (.). This option is not available for a Sequence file. |
Use local timezone for date | Select this check box to use the local date of the machine in which your Job is executed. If leaving this check box clear, UTC is automatically used to format the Date-type data. |
Global Variables
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
Usage rule |
In a Once a Map/Reduce Job is opened in the workspace, tS3Output as well as the MapReduce family appears in the Palette of the Studio. Note that in this documentation, unless otherwise |
Hadoop Connection |
You need to use the Hadoop Configuration tab in the When you configure this connection for tS3Output, you need to select the Use Datanode hostname check box. This connection is effective on a per-Job basis. |
Related scenario
This component is used in the similar way as the other input components writing data
into a given filesystem. But note that when you configure the Hadoop connection in the
Hadoop configuration tab of the Run view, you need to select the Use
Datanode hostname check box.