July 30, 2023

tFileCompare – Docs for ESB 7.x

tFileCompare

Compares two files and provides comparison data based on a read-only
schema.

tFileCompare Standard properties

These properties are used to configure tFileCompare running in the Standard Job framework.

The Standard
tFileCompare component belongs to the File family.

The component in this framework is available in all Talend
products
.

Basic settings

Schema and Edit
Schema

A schema is a row description, it defines the number of fields to be processed and
passed on to the next component.

The schema of this component is read-only.

File to compare

Filepath to the file to be checked.

Warning: Use absolute path (instead of relative path) for
this field to avoid possible errors.

Reference file

Filepath to the file, the comparison is based on.

Warning: Use absolute path (instead of relative path) for
this field to avoid possible errors.

If differences are detected, display and
If no difference detected, display

Type in a message to be displayed in the Run console based on the result of the
comparison.

Print to console

Select this check box to display the message.

Advanced settings

Encoding

Select the encoding from the list or select Custom and define it manually. This field is
compulsory for DB data handling.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at a
Job level as well as at each component level.

Global Variables

Global Variables

DIFFERENCE: the result of the comparison. This is a Flow
variable and it returns a boolean.

ERROR_MESSAGE: the error message generated by the
component when an error occurs. This is an After variable and it returns a string. This
variable functions only if the Die on error check box is
cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable
functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl +
Space
to access the variable list and choose the variable to use from it.

For further information about variables, see
Talend Studio

User Guide.

Usage

Usage rule

This component can be used as standalone component but it is
usually linked to an output component to gather the log data.

Connections

Outgoing links (from this component to another):

Row: Main.

Trigger: On Subjob Ok; On Subjob
Error; Run if; On Component Ok; On Component Error.

Incoming links (from one component to this one):

Row: Main; Reject;
Iterate.

Trigger: Run if; On Subjob Ok; On
Subjob Error; On component Ok; On Component Error; Synchronize;
Parallelize.

For further information regarding connections, see

Talend Studio

User Guide.

Comparing unzipped files

This scenario describes a Job unarchiving a file and comparing it to a reference file
to make sure it did not change. The output of the comparison is stored into a delimited
file and a message displays in the console.

tFileCompare_1.png

Procedure

  1. Drag and drop the following components: tFileUnarchive, tFileCompare,
    and tFileOutputDelimited.
  2. Link the tFileUnarchive to the tFileCompare with Iterate connection.
  3. Connect the tFileCompare to the output
    component, using a Main row link.
  4. In the tFileUnarchive component Basic settings, fill in the path to the archive to
    unzip.
  5. In the Extraction Directory field, fill in
    the destination folder for the unarchived file.
  6. In the tFileCompare
    Basic settings, set the File to
    compare
    . Press Ctrl+Space bar to display the
    list of global variables. Select
    $_globals{tFileUnarchive_1}{CURRENT_FILEPATH} or
    “((String)globalMap.get(“tFileUnarchive_1_CURRENT_FILEPATH”))”
    according to the language you work with, to fetch the file path from the
    tFileUnarchive component.

    tFileCompare_2.png

  7. And set the Reference file to base the
    comparison on it.
  8. In the messages fields, set the messages you want to see if the files differ
    or if the files are identical, for example: “[job ” + JobName + “]
    Files differ”
    .
  9. Select the Print to Console check box, for
    the message defined to display at the end of the execution.
  10. The schema is read-only and contains standard information data. Click
    Edit schema to have a look to it.

    tFileCompare_3.png

  11. Then set the output component as usual with semi-colon as data
    separators.
  12. Save your Job and press F6 to run it.

    tFileCompare_4.png

    The message set is displayed to the console and the output shows the schema
    information data.
    tFileCompare_5.png


Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x