Component family |
File/Management |
|
Function |
Compares two files and provides comparison data (based on a |
|
Purpose |
Helps at controlling the data quality of files being |
|
Basic settings |
Schema and Edit |
A schema is a row description, it defines the number of fields to be processed and The schema of this component is read-only. |
|
File to compare |
Filepath to the file to be checked. |
|
Reference file |
Filepath to the file, the comparison is based on. |
|
If differences are detected, display and |
Type in a message to be displayed in the Run console based on the result of the |
|
Print to console |
Select this check box to display the message. |
Advanced settings |
Encoding |
Select the encoding from the list or select Custom and define it manually. This field is |
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at a |
Usage |
This component can be used as standalone component but it is |
|
Global Variables |
DIFFERENCE: the result of the comparison. This is a Flow ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see Talend Studio |
|
Connections |
Outgoing links (from this component to another): Row: Main. Trigger: On Subjob Ok; On Subjob Incoming links (from one component to this one): Row: Main; Reject; Trigger: Run if; On Subjob Ok; On For further information regarding connections, see |
|
Log4j |
The activity of this component can be logged using the log4j feature. For more information on this feature, see Talend Studio User For more information on the log4j logging levels, see the Apache documentation at http://logging.apache.org/log4j/1.2/apidocs/org/apache/log4j/Level.html. |
|
Limitation |
n/a |
This scenario describes a Job unarchiving a file and comparing it to a reference file
to make sure it did not change. The output of the comparison is stored into a delimited
file and a message displays in the console.
-
Drag and drop the following components: tFileUnarchive, tFileCompare,
and tFileOutputDelimited. -
Link the tFileUnarchive to the tFileCompare with Iterate connection.
-
Connect the tFileCompare to the output
component, using a Main row link. -
In the tFileUnarchive component Basic settings, fill in the path to the archive to
unzip. -
In the Extraction Directory field, fill in
the destination folder for the unarchived file. -
In the tFileCompare Basic settings, set the File to
compare. Press Ctrl+Space bar to display the
list of global variables. Select
$_globals{tFileUnarchive_1}{CURRENT_FILEPATH} or
“((String)globalMap.get(“tFileUnarchive_1_CURRENT_FILEPATH”))”
according to the language you work with, to fetch the file path from the
tFileUnarchive component. -
And set the Reference file to base the
comparison on it. -
In the messages fields, set the messages you want to see if the files differ
or if the files are identical, for example: “[job ” + JobName + “]
Files differ”. -
Select the Print to Console check box, for
the message defined to display at the end of the execution. -
The schema is read-only and contains standard information data. Click
Edit schema to have a look to it. -
Then set the output component as usual with semi-colon as data
separators. -
Save your Job and press F6 to run it.
The message set is displayed to the console and the output shows the schema
information data.