tBigQueryOutputBulk
Creates a .txt or .csv file for the data
of large size so that you can process it according to your needs before transferring it to
Google BigQuery.
The tBigQueryOutputBulk and tBigQueryBulkExec components are generally used together as parts of a two step
process. In the first step, an output file is generated. In the second step, this file is
used to feed a dataset. These two steps are fused together in the tBigQueryOutput component, detailed in a separate section. The advantage of
using two separate components is that the data can be transformed before it is loaded in the
dataset.
This component writes given data into a .txt or
.csv file, ready to be transferred to Google BigQuery.
tBigQueryOutputBulk Standard properties
These properties are used to configure tBigQueryOutputBulk running in the Standard Job framework.
The Standard
tBigQueryOutputBulk component belongs to the Big Data family.
The component in this framework is generally available.
Basic settings
Schema and Edit |
A schema is a row description. It defines the number of fields (columns) to Click Edit schema to make changes to the schema.
|
Property type |
Built-In: You create and store the |
|
Repository: You have already created |
File name |
Browse, or enter the path to the .txt or .csv file |
Append |
Select the check box to write new data at the end of the existing |
Advanced settings
Field Separator |
Enter character, string or regular expression to separate fields for the transferred |
Create directory if not exists |
Select this check box to create the directory you defined in the |
Custom the flush buffer size |
Enter the number of rows to be processed before the memory is freed. |
Check disk space |
Select the this check box to throw an exception during execution |
Encoding |
Select the encoding from the list or select Custom and define it manually. This field is compulsory for database |
tStatCatcher Statistics |
Select this check box to collect the log data at the component |
Global Variables
Global Variables |
NB_LINE: the number of rows read by an input component or
ERROR_MESSAGE: the error message generated by the A Flow variable functions during the execution of a component while an After variable To fill up a field or expression with a variable, press Ctrl + For further information about variables, see |
Usage
Usage rule |
This is an output component which needs the data provided by its |
Related Scenario
For related topic, see Scenario: Writing data in BigQuery