August 17, 2023

tMelissaDataAddress – Docs for ESB 5.x

tMelissaDataAddress

tMelissaDataAddress_icon32_white.png

Warning

This component will be available in the Palette of
Talend Studio on the condition that you have subscribed to one of
the Talend Platform products.

This address management component is the result of Talend collaboration with Melissa
Data, one of the world leaders for global address validation.

For more information about the enterprise and its software tools, visit http://www.melissadata.com/.

tMelissaDataAddress properties

Component family

Data Quality

 

Function

tMelissaDataAddress validates, corrects and
standardizes Canadian and United States addresses. It iterates on each row and reads
all input addresses against a MelissaData Data file.

tMelissaDataAddress uses the July 2012 release
of the MelissaData AddressObject library.

Purpose

tMelissaDataAddress verifies that an address is
a properly formatted address and corrects any formatting or spelling errors in each
row.

Basic settings

Schema and Edit schema

A schema is a row description, it defines the number of fields to be processed
and passed on to the next component. The schema is either Built-in or stored remotely in the Repository.

Since version 5.6, both the Built-In mode and the Repository mode are
available in any of the Talend solutions.

 

 

Built-in: You create the schema and store it
locally for this component only. Related topic: see Talend Studio User
Guide
.

 

 

Repository: You have already created the schema
and stored it in the Repository. You can reuse it in various projects and job
designs. Related topic: see Talend Studio User
Guide
.

 

Choose Company column

Select from the list the company name you want to analyze.

 

Choose Address1 column

Select from the list the first address column you want to analyze.

 

Choose Address2 column

Select from the list the second address column you want to analyze.

 

Choose City column

Select from the list the city column you want to analyze.

 

Choose State column

Select from the list the state column you want to analyze.

 

Choose Postal column

Select from the list the postal code column you want to analyze.

 

Specify your MelissaData license

Enter the MelissaData license key provided by MelissaData when you order the
Data Quality Suite or the Address Object API.

This software key unlocks the full functionality of Address Object.

For more information, visit http://www.melissadata.com/ and
download the Reference Guide for Address Object from the Support Center of
MelissaData.

 

Specify your MelissaData DataFile folder

Set the path to the MelissaData Data folder provided by MelissaData and
installed locally.

You must order and download the Data Quality Suite or the Address Object API
from http://www.melissadata.com/.
tMelissaDataAddress uses the July 2012 release of
the MelissaData AddressObject library.

Advanced settings

tStatCatcher
Statistics

Select this check box to collect log data at the component level.

Global Variables

ERROR_MESSAGE: the error message generated by the
component when an error occurs. This is an After variable and it returns a string. This
variable functions only if the Die on error check box is
cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable
functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl +
Space
to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio
User Guide.

Usage

This component is an intermediary step. It requires an input flow as well as an
output.

Limitation

n/a

Scenario: Editing addresses against a MelissaData data file

This scenario describes a three-component Job that:

  • uses the tFixedFlowInput component to generate the
    address data to be analyzed,

  • uses the tMelissaDataAddress component to analyze the
    input schema and validate, correct and standardize the US addresses generated by the
    tFixedFlowInput component,

  • uses a tLogRow component to output the correct
    formatted addresses on the console.

use_case_tmelissadataaddress.png

Setting up the Job

  1. Drop the following components from the Palette onto
    the design workspace: tFixedFlowInput, tMelissaDataAddress and tLogRow.

  2. Connect the three components together using the Main links.

Configuring the input component

  1. Double-click tFixedFlowInput to open its Basic settings view in the Component tab.

    use_case_tmelissadataaddress2.png
  2. Create the schema through the Edit Schema
    button.

    use_case_tmelissadataaddress3.png

    Click the plus button to add the following columns to your input schema:
    company, address1,
    address2, city, postal
    and state. These columns are mandatory for the tMelissaDataAddress component.

  3. Click OK.

  4. In the Number of rows field, set the number of rows
    as 1.

  5. In the Mode area, select the Use Inline Content (delimited file) option, and set the row and field
    separators in the corresponding fields.

  6. In the Content table, enter the address data you
    want to analyze, for example:

Configuring the tMelissaDataAddress component

  1. Double-click tMelissaDataAddress to display the
    Basic settings view and define the component
    properties.

    use_case_tmelissadataaddress4.png
  2. Click Sync columns to retrieve the schema from the
    preceding component.

  3. Click the Edit schema button to view the input and
    output schema and do any modifications in the output schema, if necessary.

    use_case_tmelissadataaddress5.png

    In the output schema of this component there are many output standard columns that
    are read-only. These output columns return for example the standard company and city
    names, up to two street address lines, two-letter abbreviation for the state and country
    names, the postal zip code and the results codes.

  4. Click OK to close the dialog box.

  5. In each of the address detail fields, select from the list the column that holds the
    corresponding address detail; that is the company name, the first and second addresses,
    the city and state names and finally the postal code.

  6. In the Specify your MelissaData license field, set
    your license key provided by MelissaData when you order the Data Quality Suite or the
    Address Object API.

  7. In the Specify your MelissaData DataFile folder
    field, set the path to the MelissaData data folder provided by MelissaData and installed
    locally.

Setting a JVM argument and finalizing the Job

  1. Double-click the tLogRow component to display the
    Basic settings view and define the component
    properties.

  2. Click the Run tab and then in the open view click
    Advanced Settings.

    use_case_tmelissadataaddress7.png
  3. Select the Use specific JVM arguments check box and
    then click New.

  4. In the pop-up window, set the following JVM argument:
    Djava.library.path=<path/to/libmdAddrJavaWrapper.dll/folder/>.

    In this argument, you must indicate the folder where the MelissaData AddressObject
    library, called libmdAddrJavaWrapper.so on Linux or
    libmdAddrJavaWrapper.dll on Windows, is installed.

    Without the correct JVM argument setting, the following error is to be expected:
    java.lang.Error: java.lang.UnsatisfiedLinkError.

  5. Save your Job and press F6 to execute it.

    The tMelissaDataAddress reads the input address
    rows, corrects and formats the addresses and gives the result in a kind of
    “standardized” address output rows.

    use_case_tmelissadataaddress6.png

    In addition to verifying and standardizing an address, tMelissaDataAddress will also match street names against a zip code, match
    geographic data to zip code and city information and finally parse street addresses and
    return all these results via different output columns. The above capture shows only some
    of the output columns written by the tMelissaDataAddress component.

    These output columns return for example the standard company and city names, up to
    two street address lines, two-letter abbreviation for the state and country names and
    the postal zip code.

    They also return some result code. These codes are written in comma-delimited lists.
    Each code consists of two letters followed by two numbers. These codes indicate
    different statuses and errors. For example, the AC02 code means
    that the state name is corrected based on the combination of city name and zip code, and
    the AS01 code means that the street address is valid and
    deliverable.

    For a complete list of the meaning of the result codes and for further information
    about all the output columns, see the Address Object Reference Guide you can download
    from the Support Center of MelissaData athttp://www.melissadata.com/.


Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x