官术网_书友最值得收藏!

Chapter 5. Controlling the Flow of Data

In the previous chapters, you learned to transform your data in many ways. Now suppose you collect results from a survey. You receive several files with the data and those files have different formats. You have to merge those files somehow, and generate a unified view of the information. Not only that, you want to remove the rows of data whose content is irrelevant. Finally, based on the rows that interest you, you want to create another file with some statistics. This kind of requirement is very common, but requires more background in PDI.

In this chapter, you will learn how to implement this kind of task with Kettle. In particular, we will cover the following topics:

  • Copying and distributing rows
  • Splitting the stream based on conditions
  • Merging streams

You will also apply these concepts in the treatment of invalid data.

主站蜘蛛池模板: 潮州市| 蒙城县| 凉城县| 滕州市| 伊宁市| 万盛区| 江西省| 新蔡县| 芦山县| 五台县| 上犹县| 滁州市| 鸡东县| 长垣县| 昌乐县| 东明县| 上林县| 平南县| 宁夏| 鹤庆县| 榆中县| 黎川县| 钦州市| 沙田区| 古交市| 明光市| 班戈县| 绥江县| 洮南市| 沭阳县| 阳东县| 关岭| 乌拉特中旗| 麦盖提县| 临漳县| 巍山| 巴南区| 盐津县| 岐山县| 庆城县| 宜城市|