官术网_书友最值得收藏!

What is Spark?

According to Apache, Spark is a fast and general engine for large-scale data processing. This is actually a really good summary of what it's all about. If you have a really massive dataset that can represent anything - weblogs, genomics data, you name it - Spark can slice and dice that data up. It can distribute the processing among a huge cluster of computers, taking a data analysis problem that's just too big to run on one machine and divide and conquer it by splitting it up among multiple machines.

主站蜘蛛池模板: 新沂市| 汤阴县| 台安县| 岳普湖县| 正定县| 和田县| 洛南县| 丰城市| 东丰县| 永清县| 呼和浩特市| 昭平县| 延庆县| 化德县| 全椒县| 平邑县| 铁力市| 七台河市| 永寿县| 河源市| 叶城县| 壶关县| 邯郸市| 寻乌县| 咸宁市| 沁阳市| 东城区| 英超| 永平县| 宜丰县| 山阳县| 南安市| 民乐县| 海南省| 揭阳市| 新建县| 稷山县| 收藏| 沅陵县| 舒兰市| 祁阳县|