官术网_书友最值得收藏!

Spark MLlib

MLlib ALS algorithm takes the training data of the RDD type, that is, Distributed Datasets [Rating] and trains a model, which is a MatrixFactorizationModel object.

RDD  is a special data type supported by Spark. The RDD format is immutable, and they run on clusters and can operate in Parallel. One can perform on an RDD class.

The technique we are using here is known as . Let's assume User A likes Product A, Product B, and Product C and rated them with a score. Then, let's assume User B likes Product B, Product C, and Product D and gave a similar rating to the score User A gave for Product B and Product C. Now, using Collaborative Filtering, one can find out what User A would rate for Product D or what User B would rate for Product A as we have some commonality between User A and User B--they both rated Product B and Product C similarly.

主站蜘蛛池模板: 吴江市| 白山市| 涿州市| 丽江市| 南岸区| 广饶县| 黄冈市| 格尔木市| 崇义县| 石门县| 千阳县| 安丘市| 黑龙江省| 临泽县| 大悟县| 仙居县| 博客| 德清县| 林芝县| 台前县| 南投县| 古丈县| 阜平县| 石城县| 黎城县| 聂拉木县| 兴化市| 五原县| 伊宁县| 米泉市| 和硕县| 杨浦区| 台南县| 九龙城区| 深水埗区| 桂林市| 常州市| 汕尾市| 海伦市| 巴中市| 新乡市|