tPigFilterColumns
Selects data or filters out data from a relation based on defined filter
conditions.
The tPigFilterColumns selects one or
more columns from a relation based on defined condition.
tPigFilterColumns Standard properties
These properties are used to configure tPigFilterColumns running in the Standard Job framework.
The Standard
tPigFilterColumns component belongs to the Big Data and the Processing families.
The component in this framework is available when you are using one of the Talend solutions with Big Data.
Basic settings
|
Schema and Edit |
A schema is a row description. It defines the number of fields (columns) to Click Edit schema to make changes to the schema.
|
|
|
Built-in: The schema will be |
|
|
Repository: The schema already |
Advanced settings
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the |
Global Variables
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
|
Usage rule |
This component is commonly used as intermediate step together with |
|
Prerequisites |
The Hadoop distribution must be properly installed, so as to guarantee the interaction
For further information about how to install a Hadoop distribution, see the manuals |
|
Limitation |
Knowledge of Pig script is required. |
Related Scenario
For a tPigFilterColumns related scenario, see Scenario: Joining two files based on an exact match and saving the result to a local file of tPigJoin.