官术网_书友最值得收藏!

Spark MLlib

MLlib ALS algorithm takes the training data of the RDD type, that is, Distributed Datasets [Rating] and trains a model, which is a MatrixFactorizationModel object.

RDD  is a special data type supported by Spark. The RDD format is immutable, and they run on clusters and can operate in Parallel. One can perform on an RDD class.

The technique we are using here is known as . Let's assume User A likes Product A, Product B, and Product C and rated them with a score. Then, let's assume User B likes Product B, Product C, and Product D and gave a similar rating to the score User A gave for Product B and Product C. Now, using Collaborative Filtering, one can find out what User A would rate for Product D or what User B would rate for Product A as we have some commonality between User A and User B--they both rated Product B and Product C similarly.

主站蜘蛛池模板: 马关县| 林甸县| 易门县| 都江堰市| 镇赉县| 开远市| 竹北市| 无棣县| 庆阳市| 天全县| 九龙城区| 象山县| 宁海县| 平和县| 澄迈县| 罗江县| 班戈县| 昭觉县| 汉阴县| 茂名市| 五大连池市| 华容县| 肇庆市| 原平市| 左贡县| 济宁市| 彭泽县| 邛崃市| 新沂市| 噶尔县| 泰宁县| 娱乐| 拜城县| 鹿泉市| 靖安县| 咸丰县| 海口市| 新宾| 获嘉县| 苍山县| 宕昌县|