
Component family |
XML |
|
Function |
Validates an input XML file or an input XML flow against an XSD |
|
Purpose |
Helps at controlling data and structure quality of the file or |
|
Basic settings |
Mode |
From this dropdown list, select: – File, to validate an input – Flow, to validate an input |
|
Schema and Edit |
A schema is a row description, it defines the number of fields to The schema of this component is read-only. It contains standard |
NoteFile mode only |
XSD file |
Filepath to the reference XSD file. HTTP URL also supported, e.g. |
NoteFile mode only |
XML file |
Filepath to the XML file to be validated. |
NoteFile mode only |
If XML is valid, display If XML is |
Type in a message to be displayed in the Run console based on the result of the |
NoteFile mode only |
Print to console |
Select this check box to display the validation message. |
NoteFlow mode only |
Allocate |
Specify the column or columns to be validated and the path to the |
Advanced settings |
Encoding |
Enter the encoding type between quotes. |
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at a |
|
Global Variables |
ERROR_MESSAGE: the error message generated by the DIFFERENCE: the result of the validation. This is a Flow VALID: the validation result. This is a Flow variable and XSD_ERROR_MESSAGE: the xsd error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see Talend Studio |
|
Usage |
When used in File mode, this |
|
Limitation |
n/a |
This scenario describes a Job that validates an XML column in an input file against a
reference XSD file and outputs the log information for the invalid rows of the column
into a delimited file. For the tXSDValidator use case
that validates an XML file, see Scenario: Validating XML files.
-
Drop a tFileInputDelimited component, a
tXSDValidator component, and two FileOutputDelimited components from the Palette to the design workspace. -
Double-click the tFileInputDelimited to open
its Component view and set its
properties: -
Use the Built-In
property type for this scenario.Browse to the input file, and define the number of rows to be skipped in the
beginning of the file.Use a Built-In schema
for this scenario. This means that it is available for this Job only.Click Edit schema and edit the schema according to the input file. In this
scenario, the input file has only two columns: ID and
ShipmentInfo. The ShipmentInfo
column is an XML column and needs to be validated. -
On your design workspace, connect the tFileInputDelimited component to the tXSDValidator component using a Row > Main link.
-
Double-click the tXSDValidator component, and
set its properties: -
From the Mode dropdown list, select Flow Mode.
Use a Built-In
schema for this scenario. Click Sync
columns to retrieve the schema from the preceding component. To
view or modify the schema, click the three-dot button next to Edit schema.Add a line in the Allocate table by clicking
the plus button. The name of the first column of the input file automatically
appears in the Input Column field. Click in the
field and select the column you want to validate.In the XSD File field, fill in the path to
your reference XSD file. -
On your design workspace, connect the tXSDValidator component to one tFileOutputDelimited component using a Row > Main link to output
the information about valid XML rows. -
Connect the tXSDValidator component to the
other tFileOutputDelimited component using a
Row > Rejects link to output the information about invalid XML
rows. -
Double-click each of the two tFileOutputDelimited components and configure the component
properties.In the File Name field, enter or, if you want
to use an existing output file, browse to the output file path. -
Select Built-In from the Schema list and click Sync
columns to retrieve the schema from the preceding
component. -
Save your Job and press F6 to run it.

The output files contain the validation information about the valid and invalid XML
rows of the specified column respectively.