Scenario: Doing a fuzzy match on two columns and outputting the main and
rejected data (deprecated)
This scenario applies only to a subscription-based Talend Platform solution or Talend Data Fabric.
This scenario describes a five-component Job aiming at: first checking the edit distance
between the IdClient column of an input file against the data of the
reference input file, and second checking all emails by their pronunciation in the
Email column against the data of the reference input file. The outputs
of these two matching types are written in two separate files.
In this scenario, we have already stored the input schemas of the input and reference
files in the Repository. For more information about storing schema metadata in the Repository tree view, see
Talend Studio User
Guide.
Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Login
0 Comments