Creating the Job
-
Drop the following components from the Palette onto the
design workspace: tFileInputDelimited,
tMatchPairing, tLogRow and
tFileOutputDelimited (x2).
-
Connect tFileInputDelimited to
tMatchPairing using the Main link.tFileInputDelimited reads the source file and sends
data to the next component. -
Connect tMatchPairing to the output file components
using the Pairs and Unique rows
links, and to tLogRow using the Exact
duplicates link.tMatchPairing pre-analyzes the data, computes pairs
of suspect duplicates, unique rows and exact duplicates and generates a pairing
model to be used with tMatchPredict
Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Login
0 Comments