July 30, 2023

tELTGreenplumMap – Docs for ESB 7.x

tELTGreenplumMap

Uses the tables provided as input to feed the parameter in the built statement. The
statement can include inner or outer joins to be implemented between tables or between
one table and its aliases.

The three ELT Greenplum components are closely
related, in terms of their operating conditions. These components should be used to
handle Greenplum DB schemas to generate Insert statements, including clauses, which are
to be executed in the DB output table defined.

Helps you to build the SQL statement graphically, using the table
provided as input.

tELTGreenplumMap Standard properties

These properties are used to configure tELTGreenplumMap running in the Standard Job framework.

The Standard
tELTGreenplumMap component belongs to the ELT family.

The component in this framework is available in all Talend
products
.

Basic settings

Use an existing connection

Select this check box and in the Component List click the relevant connection component to
reuse the connection details you already defined.

Note: When a Job contains the parent Job and the child Job, if you
need to share an existing connection between the two levels, for example, to share the
connection created by the parent Job with the child Job, you have to:

  1. In the parent level, register the database connection
    to be shared in the Basic
    settings
    view of the connection component which creates that very database
    connection.

  2. In the child level, use a dedicated connection
    component to read that registered database connection.

For an example about how to share a database connection
across Job levels, see

Talend Studio
User Guide
.

ELT Greenplum Map Editor

The ELT Map editor allows you to define the output schema and make a
graphical build of the SQL statement to be executed. The column names of
schema can be different from the column names in the database.

Style link

Select the way in which links are displayed.

Auto: By default, the links between the
input and output schemas and the Web service parameters are in the form
of curves.

Bezier curve: Links between the schema
and the Web service parameters are in the form of curve.

Line: Links between the schema and the
Web service parameters are in the form of straight lines.

This option slightly optimizes performance.

Property type

Either Built-in or Repository.

 

Built-in: No property data stored
centrally.

 

Repository: Select the Repository
file where Properties are stored. The following fields are pre-filled in
using fetched data.

Host

Database server IP address.

Port

Listening port number of DB server.

Database

Name of the database.

Username and Password

DB user authentication data.

To enter the password, click the […] button next to the
password field, and then in the pop-up dialog box enter the password between double quotes
and click OK to save the settings.

Advanced settings

Additional JDBC parameters

Specify additional connection properties for the DB connection you are
creating. This option is not available if you have selected the
Use an existing connection check
box in the Basic settings.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at a Job
level as well as at each component level.

Global Variables

Global Variables

ERROR_MESSAGE: the error message generated by the
component when an error occurs. This is an After variable and it returns a string. This
variable functions only if the Die on error check box is
cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable
functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl +
Space
to access the variable list and choose the variable to use from it.

For further information about variables, see
Talend Studio

User Guide.

Usage

Usage rule

tELTGreenplumMap is used along with
tELTGreenplumInput and tELTGreenplumOutput. Note that the Output
link to be used with these components must correspond strictly to the
syntax of the table name.

Note:

Note that the ELT components do not handle actual data flow but
only schema information.

Dynamic settings

Click the [+] button to add a row in the table
and fill the Code field with a context
variable to choose your database connection dynamically from multiple
connections planned in your Job. This feature is useful when you need to
access database tables having the same data structure but in different
databases, especially when you are working in an environment where you
cannot change your Job settings, for example, when your Job has to be
deployed and executed independent of Talend Studio.

The Dynamic settings table is
available only when the Use an existing
connection
check box is selected in the Basic settings view. Once a dynamic parameter is
defined, the Component List box in the
Basic settings view becomes unusable.

For examples on using dynamic parameters, see Reading data from databases through context-based dynamic connections and Reading data from different MySQL databases using dynamically loaded connection parameters. For more information on Dynamic
settings
and context variables, see Talend Studio
User Guide.

Mapping data using a simple implicit join

In this scenario, a tELTGreenplumMap
component is deployed to retrieve the data from the source table employee_by_statecode, compares its statecode column against the
table statecode, and then maps the desired columns from the two tables
to the output table employee_by_state.

Before the Job execution, the three tables, employee_by_statecode, statecode and employee_by_state look like:

tELTGreenplumMap_1.png

Dropping components

tELTGreenplumMap_2.png

  1. Add the following components from the Palette
    to the workspace:

    • tGreenplumConnection
    • two tELTGreenplumInput
    • tELTGreenplumMap
    • tELTGreenplumOutput
    • tGreenplumCommit
    • tGreenplumInput
    • tLogRow
  2. Rename the following components:

    • tGreenplumConnection to
      connect_to_greenplum_host
    • two tELTGreenplumInput to
      employee+statecode and
      statecode
    • tELTGreenplumMap to
      match+map
    • tELTGreenplumOutput to
      map_data_output
    • tGreenplumCommit to
      commit_to_host
    • tGreenplumInput to
      read_map_output_table
    • tLogRow to show_map_data
  3. Connect the components in the Job:

    • link tGreenplumConnection to tELTGreenplumMap using an OnSubjobOk trigger
    • link tELTGreenplumMap to tGreenplumCommit using an OnSubjobOk trigger
    • link tGreenplumCommit to tGreenplumInput using an OnSubjobOk trigger
    • link tGreenplumInput to tLogRow using a Row
      > Main
      connection
    The two tELTGreenplumInput
    components and tELTGreenplumOutput will be
    linked to tELTGreenplumMap later once the
    relevant tables have been defined.

Configuring the components

  1. Double-click tGreenplumConnection to open
    its Basic settings view in the Component tab.

    tELTGreenplumMap_3.png

    1. In the Host and
      Port fields, enter
      the context variables for the Greenplum server.
    2. In the Database field,
      enter the context variable for the Greenplum database.
    3. In the Username and Password fields, enter the context variables for
      the authentication credentials.
    For more information on context variables, see
    Talend Studio User
    Guide.
  2. Double-click employee+statecode to open its
    Basic settings view in the Component tab.

    tELTGreenplumMap_4.png

    1. In the Default table name field, enter
      the name of the source table, namely employee_by_statecode.
    2. Click the […] button next to the
      Edit schema field to open the schema
      editor.

      tELTGreenplumMap_5.png

    3. Click the [+] button
      to add three columns, namely id, name and statecode, with the data type as INT4, VARCHAR,
      and INT4 respectively.
    4. Click OK to close the
      schema editor.
    5. Link employee+statecode to tELTGreenplumMap using the output employee_by_statecode.
  3. Double-click statecode to open its Basic settings view in the Component tab.

    tELTGreenplumMap_6.png

    1. In the Default table
      name
      field, enter the name of the lookup table, namely statecode.
  4. Click the […] button next to the Edit schema field to open the schema editor.

    tELTGreenplumMap_7.png

    1. Click the [+] button
      to add two columns, namely state and statecode, with the data type as VARCHAR and INT4 respectively.
    2. Click OK to close the
      schema editor.
    3. Link statecode to
      tELTGreenplumMap
      using the output statecode.
  5. Click tELTGreenplumMap to open its Basic settings view in the Component tab.

    tELTGreenplumMap_8.png

    1. Select the Use an existing
      connection
      check box.
  6. Click the […] button next to the ELT Greenplum Map Editor field to open the map
    editor.

    tELTGreenplumMap_9.png

  7. Click the [+] button on the upper left corner to open the table
    selection box.

    tELTGreenplumMap_10.png

    1. Select tables employee_by_statecode and statecode in sequence and click Ok. The tables appear on the left panel of the
      editor.
  8. On the upper right corner, click the [+]
    button to add an output table, namely employee_by_state.

    1. Click Ok to close the
      map editor.
  9. Double-click tELTGreenplumOutput to open its
    Basic settings view in the Component tab.

    tELTGreenplumMap_11.png

    1. In the Default table
      name
      field, enter the name of the output table, namely employee_by_state.
  10. Click the […] button next to the Edit schema field to open the schema editor.

    tELTGreenplumMap_12.png

    1. Click the [+] button
      to add three columns, namely id, name and state, with the data type as INT4, VARCHAR,
      and VARCHAR respectively.
    2. Click OK to close the
      schema editor.
    3. Link tELTGreenplumMap
      to tELTGreenplumOutput
      using the table output employee_by_state.
    4. Click OK on the
      pop-up window below to retrieve the schema of tELTGreenplumOutput.

      tELTGreenplumMap_13.png

    ow the map editor’s output table employee_by_state shares the same schema as that of tELTGreenplumOutput.
  11. Double-click tELTGreenplumMap to open the map
    editor.

    D.
    Drop the columns id and name from table employee_by_statecode as well as the column statecode from table statecode to their counterparts in the output table employee_by_state.
    Click Ok to close the map editor.
    1. Drop the column statecode from table employee_by_statecode to its counterpart
      of the table statecode, looking for the records in the two tables that have the
      same statecode values.
  12. Double-click tGreenplumInput to open its
    Basic settings view in the Component tab.

    tELTGreenplumMap_14.png

    1. Select the Use an existing
      connection
      check box.
    2. In the Table name
      field, enter the name of the source table, namely employee_by_state.
    3. In the Query field,
      enter the query statement, namely "SELECT * FROM "employee_by_state"".
  13. Double-click tLogRow to open its Basic settings view in the Component tab.

    tELTGreenplumMap_15.png

    1. In the Mode area,
      select Table (print values in cells
      of a table
      for a better display.

Executing the Job

  1. Press Ctrl+S to save the Job.
  2. Press F6 to run the Job.

    tELTGreenplumMap_16.png

    As shown above, the desired employee records have been written to the table
    employee_by_state, presenting clearer
    geographical information about the employees.

Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x