官术网_书友最值得收藏!

Spark MLlib

MLlib ALS algorithm takes the training data of the RDD type, that is, Distributed Datasets [Rating] and trains a model, which is a MatrixFactorizationModel object.

RDD  is a special data type supported by Spark. The RDD format is immutable, and they run on clusters and can operate in Parallel. One can perform on an RDD class.

The technique we are using here is known as . Let's assume User A likes Product A, Product B, and Product C and rated them with a score. Then, let's assume User B likes Product B, Product C, and Product D and gave a similar rating to the score User A gave for Product B and Product C. Now, using Collaborative Filtering, one can find out what User A would rate for Product D or what User B would rate for Product A as we have some commonality between User A and User B--they both rated Product B and Product C similarly.

主站蜘蛛池模板: 宽甸| 鹰潭市| 化隆| 内丘县| 双牌县| 嘉峪关市| 建宁县| 南澳县| 贞丰县| 集贤县| 天柱县| 重庆市| 盐池县| 乌兰县| 罗源县| 铅山县| 灵台县| 嵩明县| 洪雅县| 临湘市| 湘潭市| 江阴市| 珲春市| 深泽县| 陆良县| 凤台县| 稷山县| 洪雅县| 阿荣旗| 永登县| 许昌县| 中牟县| 寿阳县| 万盛区| 晋中市| 霍邱县| 留坝县| 金华市| 开远市| 海城市| 巫溪县|