官术网_书友最值得收藏!

What you need for this book

Practical exercises in this book are demonstrated on virtual machines (VM) from Cloudera, Hortonworks, MapR, or prebuilt Spark for Hadoop for getting started easily. The same exercises can be run on a bigger cluster as well.

Prerequisites for using virtual machines on your laptop:

  • RAM: 8 GB and above
  • CPU: At least two virtual CPUs
  • The latest VMWare player or Oracle VirtualBox must be installed for Windows or Linux OS
  • Latest Oracle VirtualBox, or VMWare Fusion for Mac
  • Virtualization enabled in BIOS
  • Browser: Chrome 25+, IE 9+, Safari 6+, or Firefox 18+ recommended (HDP Sandbox will not run on IE 10)
  • Putty
  • WinScP

The Python and Scala programming languages are used in chapters, with more focus on Python. It is assumed that readers have a basic programming background in Java, Scala, Python, SQL, or R, with basic Linux experience. Working experience within Big Data environments on Hadoop platforms would provide a quick jump start for building Spark applications.

主站蜘蛛池模板: 龙陵县| 浦江县| 车险| 宁波市| 安泽县| 镇雄县| 德昌县| 泸定县| 巍山| 武穴市| 缙云县| 正蓝旗| 邳州市| 怀远县| 宜州市| 道真| 大石桥市| 建湖县| 陆丰市| 哈尔滨市| 新野县| 金山区| 凤城市| 济阳县| 庆安县| 巢湖市| 丹江口市| 平远县| 乌兰察布市| 嘉善县| 临猗县| 南雄市| 泸水县| 安溪县| 巴里| 玛纳斯县| 北辰区| 噶尔县| 兖州市| 安福县| 前郭尔|