tFSFilterRows

Warning

This component will be available in the Palette of the studio on the condition that you have
subscribed to the relevant edition of one of the Talend solutions
with Big Data.

tFSFilterRows Properties

Component family	FileScale	Note that this component is deprecated.
Function	tFSFilterRows enables you to define a row filter on one table. This component has real-time capabilities for filtering large scale files. To optimize performance, the component usually sorts data before processing it.
Purpose	Helps parametrizing row filters for large scale source data against set conditions.
Basic settings	Schema type and Edit Schema	A schema is a row description, it defines the number of fields to be processed and passed on to the next component. The schema is either Built-in or stored remotely in the Repository. Click Edit schema to make changes to the schema. If the current schema is of the Repository type, three options are available: View schema: choose this option to view the schema only. Change to built-in property: choose this option to change the schema to Built-in for local changes. Update repository connection: choose this option to change the schema stored in the repository and decide whether to propagate the changes to all the Jobs upon completion. If you just want to propagate the changes to the current Job, you can select No upon completion and choose this schema metadata again in the [Repository Content] window.
		Repository: You have already created the schema and stored it in the Repository. You can reuse it in various projects and job flowcharts. Related topic: see Talend Studio User Guide.
		Built-in: You create and store the schema locally for this component only. Related topic: see Talend Studio User Guide.
	Property type	Either Built-in or Repository. Since version 5.6, both the Built-In mode and the Repository mode are available in any of the Talend solutions.
		Built-in: No property data stored centrally.
		Repository: Select the repository file where Properties are stored. The fields that follow are pre-filled in using the fetched data.
	Input File Name	Name of the file holding the data you want to filter.
	Output File Name	Name of the file where you want to write the filtered data.
	Record separator (char)	Character, string or regular expression to separate records (lines).
	Field separator (char)	Character, string or regular expression to separate fields in a record.
	Header	Number of records to be skipped in the beginning of the file.
	Footer	Number of records to be skipped at the end of the file.
	Logical operator used to combine conditions	In the case you want to combine simple filtering and advanced mode, select the operator to combine both modes.
	Conditions	Click the plus button to add as many conditions as needed. The conditions are performed one after the other for each row. Schema column: Click in the cell and select the column of the schema the function is to be operated on. Function: click in the cell and select the function on the list. Operator: Click in the cell and select the operator to bind the schema column with the value. Value: Type in the filtered value, between quotes if need be.
	Use advanced mode	Select this check box when the operation you want to perform cannot be carried out through the standard functions offered. In the Advanced text field, type in the regular expression as required.
Advanced settings	Generate FSLang File	Select this check box to generate the FSLang file corresponding to your Job and click the three-dot button next to the FSLang File Name field to specify its path and its name.
	Assign FileScale Path	Select this check box and then click the three-dot button next to the FileScale Path field to select the FileScale program executable file required to execute the component.
	Specify Number of Process Child	Select this check box and enter the number of child processes to use for carrying out the aggregation.
	Sort results	Select this check box to sort the results.
	Custom FileScale Parameter (separated by,)	Enter the parameters for any specific operation you want to add to the FileScale executable call.
	tStatCatcher Statistics	Select this check box to gather the Job processing metadata at a Job level as well as at each component level.
Global Variables	ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box. A Flow variable functions during the execution of a component while an After variable functions after the execution of the component. To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it. For further information about variables, see Talend Studio User Guide.
Usage	This component handles files therefore it does not require input and output data flows. It is used to filter rows in large scale files.
Limitation	Limitation is imposed by limits of physical memory and CPU architectures. For example, total length of processed files cannot exceed file system limit for LargeFile support (maximum value of 64 signed bits).

Related Scenarios

Document get from Talend https://help.talend.com

Thank you for watching.

Docs 5.x

0 Comments

Inline Feedbacks

View all comments

tFSFilterRows – Docs for ESB 5.x

tFSFilterRows

Warning

tFSFilterRows Properties

Related Scenarios

My Website Links

Tags