tAzureAdlsGen2Output
Uploads incoming data to an ADLS Gen2 file system of an Azure storage
account in the specified format.
tAzureAdlsGen2Output Standard properties
These properties are used to configure tAzureAdlsGen2Output running in the Standard Job framework.
The Standard
tAzureAdlsGen2Output component belongs to the Cloud family.
The component in this framework is available in all subscription-based Talend products with Big Data
and Talend Data Fabric.
Basic settings
Property Type |
Select the way the connection details
|
Schema and Edit schema |
A schema is a row description. It defines the number of fields
Click Edit
schema to make changes to the schema. Note: If you
make changes, the schema automatically becomes built-in.
|
Sync |
Click this button to retrieve the |
Authentication method |
Select one of the following
authentication method from the drop-down list.
|
Account |
Enter the name of the Data Lake |
Endpoint |
Enter the Azure Storage service The combination of the account name and the |
Shared |
Enter the key associated with the This field is available if you select Shared key from Authentication method drop-down list. |
SAS |
Enter your account SAS token. You can This field is available if you select |
Check |
Click this button to validate the |
Filesystem |
Enter the name of the target Blob You can also click the … button to |
Blobs |
Enter the path to the target |
Format |
Set the format for the incoming data. |
Field |
Set the field delimiter. You can |
Record |
Set the record separator. You can |
Text |
Enter the character used to enclose |
Escape |
Enter the character of the row to be |
Header |
Select this check box to insert a header row to the data. The schema column |
File |
Select the file encoding from the |
Advanced settings
tStatCatcher Statistics |
Select this check box to gather the Job processing metadata at the Job level |
Max batch |
Set the maximum number of lines Do not change the default |
Blob |
Enter a string as the name prefix for the Blob files generated. The name of a |
Global Variables
ERROR_MESSAGE |
The error message generated by the component when an error occurs. This |
NB_LINE |
The number of rows successfully processed. This is an After variable and it returns |
Usage
Usage rule |
This component is usually used as an end component of a Job or |
Accessing Azure ADLS Gen2 storage
This scenario demonstrates the use of the
tAzureAdlsGen2Output and tAzureAdlsGen2Input
components. In the first subJob, a tFixedFlowInput component passes
data to tAzureAdlsGen2Output, which then uploads the data to Azure ADLS
Gen2 storage; in the second subJob, tAzureAdlsGen2Input reads the data
and passes it to tLogRow.
1 2 3 4 5 6 |
1;James 2;Josephine 3;Donette 4;Simona 5;Mitsue 6;Leota |
This scenario requires an Azure storage user
account with permissions for reading and writing files.
Optionally, you can monitor the data using Microsoft Azure Storage
Explorer, a utility for managing your Azure storage resources. Check Azure Storage Explorer for related information.
Accessing Azure ADLS Gen2 storage: establishing the Job
-
Create a standard Job and drop tFixedFlowInput,
tAzureAdlsGen2Output, tAzureAdlsGen2Input, and tLogRow onto the workspace. -
Connect tFixedFlowInput and
tAzureAdlsGen2Output using the Row > Main link. -
Connect tAzureAdlsGen2Input and
tLogRow using the Row > Main link. -
Connect tFixedFlowInput and
tAzureAdlsGen2Input using the RowTrigger > OnSubjobOk link.
Accessing Azure ADLS Gen2 storage: setting up the Job
-
In the Basic settings
view of tFixedFlowInput:- Click the Edit
schema button and add two columns: id (type Integer) and name (type String); - Select Use Inline Content(delimited
file) and enter the following into the Content field.
1234561;James2;Josephine3;Donette4;Simona5;Mitsue6;Leota - Leave other options as they are.
- Click the Edit
-
In the Basic settings
view of tAzureAdlsGen2Output:- Click the Edit
schema button and add two columns: id (type Integer) and name (type String); - Provide your Azure storage user account credentials in
the Authentication method,
Account name, Endpoint suffix, and Shared key. - Validate your Azur storage user account by clicking
Check connection. - Enter the name of an existing Blob container in Filesystem. You can also click … to the right of this field and
select the Blob container from the list in the dialog box. - In Blobs Path,
enter the name of the directory where you want to put the data. - Select CSV for Format; Semicolon for
Field Delimiter; and CRLF for Record
Separator. Select the Header option. - Leave other options as they are.
- Click the Edit
-
In the Advanced settings
view of tAzureAdlsGen2Input, enter the
prefix for the Blob files generated in the Blob Template
Name field (data- in this example). -
Do exact the same described in step 2 for the
tAzureAdlsGen2Input component. Be sure to propagate the schema to the subsequent
component when prompted. -
In the Basic settings view of tLogRow:
- Select Table (print values in cells of a
table). - Leave other options as they are.
- Select Table (print values in cells of a
Accessing Azure ADLS Gen2 storage: executing the Job
- Press F6 to run the Job.
-
Check the result in the Run console.
-
(Option) Check the Blob file generated using Microsoft Azure Storage Explorer. See Get started with Storage Explorer
for related information.