
Warning
This component will be available in the Palette of
the studio on the condition that you have subscribed to the relevant edition of one of
the Talend solutions with Big Data.
Component family |
FileScale |
Note that this component is deprecated. |
Function |
tFSSort sorts input data by sort |
|
Purpose |
tFSSort helps creating metrics |
|
Basic settings |
Schema type and Edit |
A schema is a row description, it defines the number of fields to be processed and Click Edit schema to make changes to the schema. If the
|
|
|
Repository: You have already |
|
|
Built-in: You create and store |
|
Property type |
Either Built-in or Repository. Since version 5.6, both the Built-In mode and the Repository mode are |
|
|
Built-in: No property data stored |
|
|
Repository: Select the repository |
|
Input File Name |
Name of the file holding the data you want to sort. |
|
Output File Name |
Name of the file where you want to write the sorted data. |
|
Record separator (char) |
Character, string or regular expression to separate records |
|
Field separator (char) |
Character, string or regular expression to separate fields in a |
|
Header |
Number of records to be skipped in the beginning of the |
|
Footer |
Number of records to be skipped at the end of the file. |
|
Criteria |
Click the plus button to add as many lines as required for the
Schema column: Click in the cell NoteThe order is essential as it determines the sorting Sort type: Click in the cell and Order type: Click in the cell and |
Advanced settings |
Generate FSLang File |
Select this check box to generate the FSLang file corresponding to |
|
Assign FileScale Path |
Select this check box and then click the three-dot button next to |
|
Specify Number of Process Child |
Select this check box and enter the number of child processes to |
|
Custom FileScale Parameter (separated by,) |
Enter the parameters for any specific operation you want to add to |
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at a |
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see Talend Studio |
|
Usage |
This component handles files therefore it does not require input |
|
Limitation |
Limitation is imposed by limits of physical memory and CPU |
Warning
Make sure that you have unzipped and saved locally the FileScale
executable file delivered by
Talend
. You must define the path of this executable file in the
Advanced settings
view of
tFSSort
.
This scenario describes a Job that sort in very short time big amount of data in a
large scale file following two defined value entries.
In this scenario, we have already stored the input schemas of the large input file in
the repository. For more information about storing schema metadata in the Repository tree view, see Talend Studio User
Guide.
The input file contains 10 columns: id, surname, firstname, zipcode, city,
dateofbirth, streetname, streetnr, statecode, and
state.

-
In the Repository tree view, expand Metadata and the file node where you have stored the
input schemas and drop the relevant metadata onto the design workspace.The [Component] dialog box displays.

-
Select tFSSort from the list and click
OK to close the dialog box.The tFSSort component displays in the
workspase. -
Double-click tFSSort to display its Basic settings view.

All tFSSort property fields are automatically filled
in. If you did not define your input schemas locally in the repository, fill in the
details manually after selecting Built-in in the
Schema Type and Property
Type fields.
-
In the Output File Name, browse to the output
file you want to write the sorted data in. -
In the Criteria table, click the plus button
to add columns to the list and then select the schema columns you want to use as
base for the sorting operation.In this scenario, we want to sort the data according to the
city and surname columns. We want
to group the data alphabetically in an ascending order. -
In the Sort type and Order type columns, click in the cell and select alpha and asc for
the sort type and order type respectively. -
Click the Advanced settings tab to display
the advanced settings view and then select the Assign
FileScale Path check box to display the FileScale Path field and browse to the filescale executable
file. -
Save your Job and press F6 to execute
it.

A progress bar displays below the tFSSort component
in the design workspace to show the completed percentage of the sorting operation. This
progress bar will make it evident how the huge input data is sorted at high
speed.
When the percentage progress bar reaches 100%, the sorted data according to first the
city name and second the surname is written in the defined output file.