Arranging data flow for the KMeans Job
-
In the
Integration
perspective of the Studio, create an empty Job from the Job Designs node in the Repository tree view.For further information about how to create a Job, see
Talend Open Studio for Big Data Getting Started
Guide
. - In the workspace, enter the name of the component to be used and select this component from the list that appears.
-
Connect tFileInputDelimited to tReplicate using the Row >
Main link. - Do the same to connect tReplicate to tModelEncoder and then tModelEncoder to tKMeansModel.
- Repeat the operations to connect tReplicate to tPredict and then tPredict to tFileOutputDelimited.
- Leave tHDFSConfiguration as it is.
Document get from Talend https://help.talend.com
Thank you for watching.
Subscribe
Login
0 Comments