tMapRStreamsInputAvro
these messages. Only MapR V5.2 onwards is supported by this component.
tMapRStreamsInputAvro properties for Apache Spark Streaming
These properties are used to configure tMapRStreamsInputAvro running in the Spark Streaming Job framework.
The Spark Streaming
tMapRStreamsInputAvro component belongs to the Messaging family.
The streaming version of this component is available in Talend Real Time Big Data Platform and in
Talend Data Fabric.
Basic settings
Schema and Edit |
A schema is a row description. It defines the number of fields Note that the schema of this component is read-only. It stores the |
Starting offset |
Select the starting point from which the messages of a topic are consumed. In MapR Streams, the increasing ID number of a message is called offset. When a new consumer group starts, from this list, you can select Note that the consumer group takes into account only the offset-committed messages to Each consumer group has its own counter to remember the position of a message it has
|
Topic name |
Enter the name of the topic from which tMapRStreamsInput |
Set number of records per second to read from |
Enter this number within double quotation marks to limit the size of each batch to be sent For example, if you put 100 and the batch value you If you leave this check box clear, the component tries to read all the available messages |
Advanced settings
Consumer properties |
Add the MapR Streams consumer properties you need to customize to this table. For further information about the consumer properties you can define in this table, see |
Use hierarchical mode |
Select this check box to map the binary (including hierarchical) Avro schema to the Once selecting it, you need set the following parameter(s):
|
Usage
Usage rule |
This component is used as a start component and requires an output link. |
Spark Connection |
In the Spark
Configuration tab in the Run view, define the connection to a given Spark cluster for the whole Job. In addition, since the Job expects its dependent jar files for execution, you must specify the directory in the file system to which these jar files are transferred so that Spark can access these files:
This connection is effective on a per-Job basis. |
Prerequisites |
The Hadoop distribution must be properly installed, so as to guarantee the interaction
For further information about how to install a Hadoop distribution, see the manuals |
Related scenarios
No scenario is available for the Spark Streaming version of this component
yet.