Ongil Fuzzy Match – Talend Custom Components

Ongil Fuzzy Match
Name: Ongil Fuzzy Match
Icon: Ongil Fuzzy Match
Author: ongilpvtltd
Resources:
Download:
Install Instructions:
Example: Coming soon…
Features:

To illustrate, consider a data integration job where contact details data from multiple sources have been combined. The ‘Address’ field in the integrated file could have


“TB ELECTRONICS (ENTERPRISE 1)PTE. LTD.1 KONG MO KIO ELECTRONICS PARK ROAD#01-01 TB ENGINEERING HUB567710 SINGAPORE ”
and
“TB Electronics (Enterprise 1) PTE L#01-01 TB Engineering Hub1 Kong Mo Kio Electronics Park Road56771 SINGAPORE”

in two different records. It is clear that these two entries point to the same address, but how can this be detected?
OngilSmartFuzzyMatch component will identify that these two entries are possible duplicates, and group the corresponding records under one cluster. Moreover, a similarity score will also be provided to show how similar the values are – with 1 indicating an exact match and 0 indicating no similarity. Mutliple fields can be selected to identify similar rows. For instance, in addition to the ‘address’ field, a ‘first name’ field can also be added, and OngilSmartFuzzyMatch will consider values in both fields to identify similar rows.
The results will presented in two files – “clean.xlsx” which will contain all unique records (records which have no similarity with other records), and “duplicates.xlsx” which will contain all similar/duplicate records grouped into clusters (reference by cluster_id). All records in one cluster will be similar to each other.

Overview:

Ongil Fuzzy Match allows you to find duplicates in your given data by checking in multiple columns and combining the results together,thus finding duplicate rows in the data instead of just duplicate column values

Ongil Fuzzy Match sample

Ongil Fuzzy Match v_1.0.0__1
Ongil Fuzzy Match v_1.0.0__2
Ongil Fuzzy Match v_1.0.0__3

Release Notes:

Release version: 1.0.0 – 2019-08-12 06:04:43

Compatible:


Document get from Talend Exchange
Thank you for watching.

Was this article helpful?
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x