官术网_书友最值得收藏!

Finding the sum of all even numbers in an RDD of integers

Let's first create an RDD of integers as follows:

scala>valintRDD = sc.parallelize(Array(1,4,5,6,7,10,15))
intRDD: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[0] at parallelize at <console>:24

The next step is to filter all the even elements in this RDD. So, we will execute a filter operation on the RDD, as follows:

scala>valevenNumbersRDD=intRDD.filter(i => (i%2==0))
evenNumbersRDD: org.apache.spark.rdd.RDD[Int] = MapPartitionsRDD[3] at filter at <console>:26

The preceding operation will fetch those elements for which 2 is a factor, that is, even elements .

Now, we will sum all the elements of this RDD as follows:

scala>val sum =evenNumbersRDD.sum
sum: Double = 20.0
scala> sum
res5: Double = 20.0

It is shown in the following screenshot:

主站蜘蛛池模板: 江阴市| 当涂县| 伊春市| 晋城| 襄城县| 高邑县| 五华县| 诸城市| 林西县| 渝中区| 美姑县| 墨玉县| 唐山市| 延长县| 开江县| 育儿| 项城市| 奎屯市| 台北县| 西吉县| 正镶白旗| 宜丰县| 金溪县| 舟山市| 铜陵市| 获嘉县| 科技| 江北区| 禄丰县| 兰州市| 武定县| 延长县| 库伦旗| 交口县| 嘉鱼县| 伊川县| 思南县| 阿拉善右旗| 河间市| 建宁县| 衡东县|