官术网_书友最值得收藏!

Reading data from files

For this recipe, we will create an RDD by reading a local file in PySpark. To create RDDs in Apache Spark, you will need to first install Spark as noted in the previous chapter. You can use the PySpark shell and/or Jupyter notebook to run these code samples. Note that while this recipe is specific to reading local files, a similar syntax can be applied for Hadoop, AWS S3, Azure WASBs, and/or Google Cloud Storage:

主站蜘蛛池模板: 松滋市| 平武县| 澄城县| 朝阳区| 西乌| 交口县| 汉中市| 波密县| 崇左市| 双牌县| 北辰区| 株洲市| 左云县| 红河县| 无为县| 南丹县| 双江| 白山市| 三江| 六安市| 郧西县| 济源市| 屯门区| 岳西县| 高邮市| 哈密市| 启东市| 利川市| 永平县| 屏边| 巴林左旗| 临桂县| 柳江县| 封开县| 辰溪县| 尉氏县| 卫辉市| 新民市| 滁州市| 宕昌县| 集贤县|