August 17, 2023

tFSFilterRows – Docs for ESB 5.x

tFSFilterRows

tFSFilterRow_icon32.png

Warning

This component will be available in the Palette of the studio on the condition that you have
subscribed to the relevant edition of one of the Talend solutions
with Big Data.

tFSFilterRows Properties

Component family

FileScale

Note that this component is deprecated.

Function

tFSFilterRows enables you to
define a row filter on one table. This component has real-time
capabilities for filtering large scale files. To optimize
performance, the component usually sorts data before processing
it.

Purpose

Helps parametrizing row filters for large scale source data
against set conditions.

Basic settings

Schema type and Edit
Schema

A schema is a row description, it defines the number of fields to
be processed and passed on to the next component. The schema is
either Built-in or stored remotely
in the Repository.

Click Edit schema to make changes to the schema. If the
current schema is of the Repository type, three options are
available:

  • View schema: choose this option to view the
    schema only.

  • Change to built-in property: choose this option
    to change the schema to Built-in for local
    changes.

  • Update repository connection: choose this option to change
    the schema stored in the repository and decide whether to propagate the changes to
    all the Jobs upon completion. If you just want to propagate the changes to the
    current Job, you can select No upon completion and
    choose this schema metadata again in the [Repository
    Content]
    window.

 

 

Repository: You have already
created the schema and stored it in the Repository. You can reuse it
in various projects and job flowcharts. Related topic: see
Talend Studio
User Guide.

 

 

Built-in: You create and store
the schema locally for this component only. Related topic: see
Talend Studio
User Guide.

 

Property type

Either Built-in or Repository.

Since version 5.6, both the Built-In mode and the Repository mode are
available in any of the Talend solutions.

 

 

Built-in: No property data stored
centrally.

 

 

Repository: Select the repository
file where Properties are stored. The fields that follow are
pre-filled in using the fetched data.

 

Input File Name

Name of the file holding the data you want to filter.

 

Output File Name

Name of the file where you want to write the filtered data.

 

Record separator (char)

Character, string or regular expression to separate records
(lines).

 

Field separator (char)

Character, string or regular expression to separate fields in a
record.

 

Header

Number of records to be skipped in the beginning of the
file.

 

Footer

Number of records to be skipped at the end of the file.

 

Logical operator used to combine
conditions

In the case you want to combine simple filtering and advanced
mode, select the operator to combine both modes.

 

Conditions

Click the plus button to add as many conditions as needed.

The conditions are performed one after the other for each
row.

Schema column: Click in the cell
and select the column of the schema the function is to be operated
on.

Function: click in the cell and
select the function on the list.

Operator: Click in the cell and
select the operator to bind the schema column with the value.

Value: Type in the filtered value,
between quotes if need be.

 

Use advanced mode

Select this check box when the operation you want to perform
cannot be carried out through the standard functions offered.

In the Advanced text field, type
in the regular expression as required.

Advanced settings

Generate FSLang File

Select this check box to generate the FSLang file corresponding to
your Job and click the three-dot button next to the FSLang File Name field to specify its
path and its name.

 

Assign FileScale Path

Select this check box and then click the three-dot button next to
the FileScale Path field to select
the FileScale program executable file required to execute the
component.

 

Specify Number of Process Child

Select this check box and enter the number of child processes to
use for carrying out the aggregation.

 

Sort results

Select this check box to sort the results.

 

Custom FileScale Parameter (separated
by,)

Enter the parameters for any specific operation you want to add to
the FileScale executable call.

 

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at a
Job level as well as at each component level.

Global Variables

ERROR_MESSAGE: the error message generated by the
component when an error occurs. This is an After variable and it returns a string. This
variable functions only if the Die on error check box is
cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable
functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl +
Space
to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio
User Guide.

Usage

This component handles files therefore it does not require input
and output data flows. It is used to filter rows in large scale
files.

Limitation

Limitation is imposed by limits of physical memory and CPU
architectures. For example, total length of processed files cannot
exceed file system limit for LargeFile support (maximum value of 64
signed bits).

Related Scenarios

For related scenarios, see: Scenario: Combining filtering and sorting processes in a large scale file.


Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x