The three ELT Greenplum components are closely related, in terms of their operating
conditions. These components should be used to handle Greenplum DB schemas to generate
Insert statements, including clauses, which are to be executed in the DB output table
defined.
|
Component family |
ELT/Map/Greenplum |
|
|
Function |
Helps you to build the SQL statement graphically, using the table |
|
|
Purpose |
Uses the tables provided as input, to feed the parameter in the |
|
|
Basic settings |
Use an existing connection |
Select this check box and in the Component List click the NoteWhen a Job contains the parent Job and the child Job, if you need to share an existing
For an example about how to share a database connection across Job levels, see |
|
|
ELT Greenplum Map Editor |
The ELT Map editor allows you to define the output schema and make |
|
|
Style link |
Select the way in which links are displayed.
Auto: By default, the links between
Bezier curve: Links between the
Line: Links between the schema and This option slightly optimizes performance. |
|
|
Property type |
Either Built-in or Repository. Since version 5.6, both the Built-In mode and the Repository mode are |
|
|
|
Built-in: No property data stored |
|
|
|
Repository: Select the Repository |
|
|
Host |
Database server IP address. |
|
|
Port |
Listening port number of DB server. |
|
|
Database |
Name of the database. |
|
|
Username and |
DB user authentication data. To enter the password, click the […] button next to the |
|
Advanced settings |
Additional JDBC parameters |
Specify additional connection properties for the DB connection you |
|
|
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at a |
|
Dynamic settings |
Click the [+] button to add a row in the table and fill The Dynamic settings table is available only when the For more information on Dynamic settings and context |
|
|
Global Variables |
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see Talend Studio |
|
|
Usage |
tELTGreenplumMap is used along NoteNote that the ELT components do not handle actual data flow |
|
You can also restrict the Select statement based on a Where clause and/or other clauses such
as Group By, Order By, etc. by clicking the Add filter
row button at the top of the output table in the map editor.
To add a restriction based on a Where clause, click the Add
filter row button and select Add a WHERE
clause from the popup menu.
To add a restriction based on Group By, Order By etc., click the Add filter row button and select Add an
other(GROUP…) clause from the popup menu.

Make sure that all input components are linked correctly to the ELT Map component to
be able to implement all inclusions, joins and clauses.
In this scenario, a tELTGreenplumMap component is
deployed to retrieve the data from the source table employee_by_statecode, compares its statecode column against the table statecode, and then maps the desired columns from the two tables to the
output table employee_by_state.
Before the Job execution, the three tables, employee_by_statecode, statecode and
employee_by_state look like:

-
Drop tGreenplumConnection, tELTGreenplumInput (two), tELTGreenplumMap, tELTGreenplumOutput, tGreenplumCommit, tGreenplumInput and tLogRow
from the Palette onto the workspace. -
Rename tGreenplumConnection as connect_to_greenplum_host, two tELTGreenplumInput components as employee+statecode and statecode, tELTGreenplumMap
as match+map, tELTGreenplumOutput as map_data_output, tGreenplumCommit as commit_to_host, tGreenplumInput as read_map_output_table and tLogRow as show_map_data. -
Link tGreenplumConnection to tELTGreenplumMap using an OnSubjobOk trigger.
Link tELTGreenplumMap to tGreenplumCommit using an OnSubjobOk trigger.
Link tGreenplumCommit to tGreenplumInput using an OnSubjobOk trigger.
-
Link tGreenplumInput to tLogRow using a Row >
Main connection.The two tELTGreenplumInput components and
tELTGreenplumOutput will be linked to
tELTGreenplumMap later once the
relevant tables have been defined.
-
Double-click tGreenplumConnection to
open its Basic settings view in the
Component tab.
In the Host and Port fields, enter the context variables for the Greenplum
server.In the Database field, enter the context
variable for the Greenplum database.In the Username and Password fields, enter the context variables for the
authentication credentials.For more information on context variables, see Talend Studio User Guide.
-
Double-click employee+statecode to open
its Basic settings view in the Component tab.
In the Default table name field, enter
the name of the source table, namely employee_by_statecode.Click the […] button next to the
Edit schema field to open the schema
editor.
Click the [+] button to add three
columns, namely id,
name and statecode, with the
data type as INT4, VARCHAR, and INT4 respectively.Click OK to close the schema
editor.Link employee+statecode to tELTGreenplumMap using the output employee_by_statecode.
-
Double-click statecode to open its
Basic settings view in the Component tab.
In the Default table name field, enter
the name of the lookup table, namely statecode. -
Click the […] button next to the
Edit schema field to open the schema
editor.
Click the [+] button to add two columns,
namely state and
statecode, with the data type as VARCHAR and INT4 respectively.Click OK to close the schema
editor.Link statecode to tELTGreenplumMap using the output statecode.
-
Click tELTGreenplumMap to open its
Basic settings view in the Component tab.
Select the Use an existing connection
check box. -
Click the […] button next to the
ELT Greenplum Map Editor field to open
the map editor.
-
Click the [+] button on the upper left
corner to open the table selection box.
Select tables employee_by_statecode and
statecode in sequence and click
Ok.The tables appear on the left panel of the editor.
-
On the upper right corner, click the [+]
button to add an output table, namely employee_by_state.Click Ok to close the map editor.
-
Double-click tELTGreenplumOutput to open
its Basic settings view in the Component tab.
In the Default table name field, enter
the name of the output table, namely employee_by_state. -
Click the […] button next to the
Edit schema field to open the schema
editor.
Click the [+] button to add three
columns, namely id,
name and state, with the data
type as INT4, VARCHAR, and VARCHAR
respectively.Click OK to close the schema
editor.Link tELTGreenplumMap to tELTGreenplumOutput using the table output
employee_by_state.Click OK on the pop-up window below to
retrieve the schema of tELTGreenplumOutput.
Now the map editor’s output table employee_by_state shares the same schema as that of
tELTGreenplumOutput. -
Double-click tELTGreenplumMap to open the
map editor.Drop the column statecode from table
employee_by_statecode to its
counterpart of the table statecode,
looking for the records in the two tables that have the same statecode values.Drop the columns id and name from table employee_by_statecode as well as the column statecode from table statecode to their counterparts in the output table
employee_by_state.Click Ok to close the map editor.
-
Double-click tGreenplumInput to open its
Basic settings view in the Component tab.
Select the Use an existing connection
check box.In the Table name field, enter the name
of the source table, namely employee_by_state.In the Query field, enter the query
statement, namely"SELECT * FROM "employee_by_state"". -
Double-click tLogRow to open its
Basic settings view in the Component tab.
In the Mode area, select Table (print values in cells of a table for a
better display.
For related scenarios, see:
-
A related scenario using subquery: Scenario: Mapping data using a subquery.
