tPersonator
verifying, moving and appending contact data.
This component is the result of Talend collaboration with Melissa Data, one of the world leaders for global address
validation.
For more information about Melissa Data
and its software tools, visit the website.
Using the Personatorâ„¢
Consumer Web Service, the tPersonator component verifies, corrects and adds data to
enrich your database. You can use several tPersonator components to perform one or more
actions.
The component in this framework is available in Talend Data Management Platform, Talend Big Data Platform, Talend Real Time Big Data Platform, Talend Data Services Platform, Talend MDM Platform and in Talend Data Fabric.
Setting up the Genderization Policy parameter of
tPersonator
The behavior of the Genderization Policy
parameter, also known as gender aggression, depends on the Gender Population parameter. The following table shows the results
of the tPersonator component according to the
options that you set:
Gender Population | First name gender |
|||||||
---|---|---|---|---|---|---|---|---|
Male | Neutral | Female | ||||||
Always | Often | Commonly | Always | Commonly | Often | Always | ||
Mixed | ||||||||
Policy | Neutral | M | M | N | N | N | F | F |
Conservative | M | N | N | N | N | N | F | |
Aggressive | M | M | M | N | F | F | F | |
Male | ||||||||
Policy | Neutral | M | M | M | N | N | F | F |
Conservative | M | M | N | N | N | N | F | |
Aggressive | M | M | M | M | N | F | F | |
Female | ||||||||
Policy | Neutral | M | M | N | N | F | F | F |
Conservative | M | N | N | N | N | F | F | |
Aggressive | M | M | N | F | F | F | F |
tPersonator output schema
The tPersonator component generates
read-only columns that are all of type String.
Here is the list of the output columns in the original order:
- md_RecordID
- md_AddressDeliveryInstallation
- md_AddressExtras
- md_AddressKey
- md_AddressLine1
- md_AddressLine2
- md_AddressLockBox
- md_AddressPostDirection
- md_AddressPreDirection
- md_AddressPrivateMailboxName
- md_AddressPrivateMailboxRange
- md_AddressRouteService
- md_AddressStreetName
- md_AddressStreetSuffix
- md_AddressSuiteName
- md_AddressSuiteNumber
- md_AddressTypeCode
- md_AreaCode
- md_CBSACode
- md_CBSADivisionCode
- md_CBSADivisionLevel
- md_CBSADivisionTitle
- md_CBSALevel
- md_CBSATitle
- md_CarrierRoute
- md_CensusBlock
- md_CensusKey
- md_CensusTract
- md_City
- md_CityAbbreviation
- md_CompanyName
- md_CongressionalDistrict
- md_CountryCode
- md_CountryName
- md_CountyFIPS
- md_CountyName
- md_CountySubdivisionCode
- md_CountySubdivisionName
- md_DateOfBirth
- md_DateOfDeath
- md_DeliveryIndicator
- md_DeliveryPointCheckDigit
- md_DeliveryPointCode
- md_DemographicsGender
- md_DemographicsResults
- md_DomainName
- md_ElementarySchoolDistrictCode
- md_ElementarySchoolDistrictName
- md_EmailAddress
- md_Gender
- md_Gender2
- md_HouseholdIncome
- md_Latitude
- md_LengthOfResidence
- md_Longitude
- md_MailboxName
- md_MaritalStatus
- md_MelissaAddressKey
- md_NameFirst
- md_NameFirst2
- md_NameFull
- md_NameLast
- md_NameLast2
- md_NameMiddle
- md_NameMiddle2
- md_NamePrefix
- md_NamePrefix2
- md_NameSuffix
- md_NameSuffix2
- md_NewAreaCode
- md_Occupation
- md_OwnRent
- md_PhoneCountryCode
- md_PhoneCountryName
- md_PhoneExtension
- md_PhoneNumber
- md_PhonePrefix
- md_PhoneSuffix
- md_PlaceCode
- md_Plus4
- md_PostalCode
- md_PresenceOfChildren
- md_PrivateMailBox
- md_RecordExtras
- md_Reserved
- md_Salutation
- md_SecondarySchoolDistrictCode
- md_SecondarySchoolDistrictName
- md_StateDistrictLower
- md_StateDistrictUpper
- md_StateName
- md_Suite
- md_TopLevelDomain
- md_UTC
- md_UnifiedSchoolDistrictCode
- md_UnifiedSchoolDistrictName
- md_UrbanizationName
- md_Results
tPersonator Standard properties
These properties are used to configure tPersonator
running in the Standard Job framework.
The standard tPersonator component belongs
to the Data Quality family.
Basic Settings
Schema and Edit schema |
A schema is a row description. It defines the number of fields Click Sync Select the Schema type:
Click Edit
View schema: choose this
Change to built-in property:
Update repository connection: The output schema contains read-only columns. For more information, see the list of the output |
Input mapping | Associate the Personator field with the Input column. |
Actions | Select the actions to perform:
Depending on the action, some inputs are |
Centric Hint | Available if you select Append Action or Verify Action. Select one reference data:
|
Append Options | Available if you select Append Action.
Select one action:
|
Address Options | Diacritics: auto, on or off. Set to on to return the French characters. If set to auto, those characters are returned if present in your database. Advanced Address Use Preferred |
Name Options |
Name Hint
Middle Name Logic
Salutation Format: Select the
Gender Population: Mixed, Male, Female Genderization Policy: Correct First Standardize Company: Select to |
Email Options |
|
Address Output Groups |
Basic (Default): Select to return the basic address. Address Details: Select to return the detailed Plus4: Select to return the +4 code. PrivateMailBox: Select to return the private mail Suite: Select to return the Parsed Address: Select to return the address |
Geographic Output Groups |
Census: Select to return census information. Census2: Select to return more census GeoCode: Select to return the |
Other Output Groups | Demographic Basic: Select to return a string containing all the results of the demographics. Commas delimit the results. Name Parsed Email: Select to return the Parsed Phone: |
Action | Mandatory inputs |
---|---|
Check | The database must at least contain one of the following:
|
Verify | The database must at least contain two of the following:
Note: If the database contains only names and
company names, you cannot perform the verify action. The results cannot be accurate enough. |
Move | The database must at least contain one of the following:
|
Append | The mandatory inputs depend on the data to append.
To append a name or company name, the
database must at least contain one of the following:
To append an address, the
database must at least contain one of the following:
To append a phone number, the
database must at least contain one of the following:
To append an email, the
database must at least contain one of the following:
|
Advanced Settings
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level |
License Key |
To enter a license key, click the […] button next to the field. |
Number of Retries |
Define the number of retries before the Job fails. |
Timeout in Seconds |
Define the timeout time period. |
Cache Directory |
Browse the cache directory. |
Batch Request Size (1-100) |
Define the number of messages to be delivered in each batch. |
Multi-Threading | Select to use more than one thread in the same job to handle the response from the Melissa data service. |
Thead Count (1-10) | Define the maximum number of threads. |
Show Debug Console Output |
Select to show the debug console output. |
Scenario: Verifying and enriching a database
The Job in this scenario uses the tPersonator component to ensure the
quality of a customer database and enrich this
database.
- Company name
- Address
- State
- Zip code
- Full name
- Phone number
- Date of company foundation
- Date of birth
Setting up the Job
-
Drop the following components from the Palette onto the
design workspace: tFileInputDelimited,
tPersonator and tLogRow. -
Connect the components together using the Main
links.

Configuring the tFileInputDelimited component
-
Double-click tFileInputDelimited to open its Basic
settings view. -
Select Built-in as
Property Type and Schema. -
Click […] next to
Edit schema. -
Click the [+] button to
add the columns. -
In File name/Stream,
browse the input file.
Configuring the tPersonator component
- Double-click the tPersonator component to open its Basic settings view.
-
Select Built-in as
Schema. -
To check the schema, click the […] button next to
Edit schema. -
In Input Mapping, click
the [+] button to associate the
tPersonator fields with the input data:-
CompanyName with
Company -
AdressLine1 with
Address -
State with
State - PostalCode/ZipCode with Zip
-
FullName with
FullName -
PhoneNumber with
PhoneNumber
-
CompanyName with
-
Select Check Action,
Append Action, Verify Action and Move Action. -
In Centric Hint,
select Address. -
In Append Options, select
Always. -
In Address
Options:-
Diacritics:
Select auto. -
Select the Advanced Address
Correction and Use Preferred
City check boxes.
-
Diacritics:
-
In Name Options, select
the following options:-
Name
Hint:Varying -
Middle Name
Logic: Parse
Logic -
Salutation
Format: Formal -
Gender
Population: Mixed -
Genderization
Policy: Aggressive -
Correct First
Name and Standardize
Company check boxes
-
Name
-
Select all the check boxes in Email
Options, Address Output
Groups, Geographics Output
Groups and Other Output
Groups.You have the following configuration: -
Click the Advanced
settings tab:-
Clear the tStatCatcher
Statistics check box. -
Enter a License
Key. -
Enter the Number of
Retries: 5. -
Enter the Timeout in
Seconds: 100 . -
Browse the Cache
Directory. -
Enter the Batch Request
Size (1-100): 2. -
Select the Multi-Threading
check box. -
Enter the Thread
Count: 3. -
Select the Show Debug
Console Output check box.
-
Clear the tStatCatcher
Configuring the tLogRow component
-
Double-click the tLogRow component to open
its Basic settings view. -
In the Mode area, select Table (print values in cells of a table).
Saving and executing the Job
to execute the Job.
The database is enriched and the results are displayed on the
console. Here is an example of the output columns that have been enriched.

As you can, result codes are indicated in the last column. For more information on these result codes, see
this description table.