舉報

會員
Apache Spark Quick Start Guide
ApacheSparkisaflexibleframeworkthatallowsprocessingofbatchandreal-timedata.Itsunifiedenginehasmadeitquitepopularforbigdatausecases.ThisbookwillhelpyoutogetstartedwithApacheSpark2.0andwritebigdataapplicationsforavarietyofusecases.ItwillalsointroduceyoutoApacheSpark–oneofthemostpopularBigDataprocessingframeworks.AlthoughthisbookisintendedtohelpyougetstartedwithApacheSpark,butitalsofocusesonexplainingthecoreconcepts.ThispracticalguideprovidesaquickstarttotheSpark2.0architectureanditscomponents.ItteachesyouhowtosetupSparkonyourlocalmachine.Aswemoveahead,youwillbeintroducedtoresilientdistributeddatasets(RDDs)andDataFrameAPIs,andtheircorrespondingtransformationsandactions.Then,wemoveontothelifecycleofaSparkapplicationandlearnaboutthetechniquesusedtodebugslow-runningapplications.YouwillalsogothroughSpark’sbuilt-inmodulesforSQL,streaming,machinelearning,andgraphanalysis.Finally,thebookwilllayoutthebestpracticesandoptimizationtechniquesthatarekeyforwritingefficientSparkapplications.Bytheendofthisbook,youwillhaveasoundfundamentalunderstandingoftheApacheSparkframeworkandyouwillbeabletowriteandoptimizeSparkapplications.
最新章節
- Leave a review - let other readers know what you think
- Other Books You May Enjoy
- Summary
- Speculative execution
- Code generation
- Join performance
品牌:中圖公司
上架時間:2021-07-02 12:29:34
出版社:Packt Publishing
本書數字版權由中圖公司提供,并由其授權上海閱文信息技術有限公司制作發行