Component family |
MapReduce/Input |
|
Function |
The tS3Input component loads S3N-formatted (S3 Native This component, along with the MapReduce family it belongs to, appears only when you are |
|
Purpose |
tS3Input reads data from a given |
|
Basic settings |
Property type |
Either Built-in or Repository. |
|
|
Built-in: No property data stored |
|
|
Repository: Select the Repository |
Schema and Edit |
A schema is a row description. It defines the number of fields to be processed and passed on Click Edit schema to make changes to the schema. If the
|
|
Built-In: You create and store the schema locally for this |
||
Repository: You have already created the schema and |
||
Bucket and Folder |
Enter the bucket name and its folder in which you need to write data. You need to separate |
|
|
Access key and Secret |
Enter the authentication information required to connect to the Amazon S3 bucket to be To enter the password, click the […] button next to the |
File type |
Type |
Select the type of the file to be processed. The type of the file may be:
|
|
Row separator |
Enter the separator used to identify the end of a row. |
|
Field separator |
Enter character, string or regular expression to separate fields for the transferred |
Header |
Enter the number of rows to be skipped in the beginning of file. |
|
Custom encoding |
You may encounter encoding issues when you process the stored data. In that situation, select Select the encoding from the list or select Custom and This option is not available for a Sequence file. |
|
Advanced settings |
Advanced separator (for number) |
Select this check box to change the separator used for numbers. By This option is not available for a Sequence file. |
Trim all column |
Select this check box to remove the leading and trailing This option is not available for a Sequence file. |
|
Check column to trim |
This table is filled automatically with the schema being used. Select the check box(es) This option is not available for a Sequence file. |
|
|
Enable parallel execution |
Select this check box to perform high-speed data processing, by treating multiple data flows
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see Talend Studio |
|
Usage |
In a Talend Map/Reduce Job, it is used as a start component and requires Once a Map/Reduce Job is opened in the workspace, tS3Input as well as the MapReduce family appears in the Palette of the Studio. Note that in this documentation, unless otherwise explicitly stated, a scenario presents |
|
Hadoop Connection |
You need to use the Hadoop Configuration tab in the This connection is effective on a per-Job basis. |