官术网_书友最值得收藏!

Looking into the basics of the Apriori algorithm

The Apriori algorithm is part of our affinity analysis methodology and deals specifically with finding frequent itemsets within the data. The basic procedure of Apriori builds up new candidate itemsets from previously discovered frequent itemsets. These candidates are tested to see if they are frequent, and then the algorithm iterates as explained here:

  1. Create initial frequent itemsets by placing each item in its own itemset. Only items with at least the minimum support are used in this step.
  2. New candidate itemsets are created from the most recently discovered frequent itemsets by finding supersets of the existing frequent itemsets.
  3. All candidate itemsets are tested to see if they are frequent. If a candidate is not frequent then it is discarded. If there are no new frequent itemsets from this step, go to the last step.
  4. Store the newly discovered frequent itemsets and go to the second step.
  5. Return all of the discovered frequent itemsets.

This process is outlined in the following workflow:

主站蜘蛛池模板: 宜兴市| 密云县| 沁阳市| 彭州市| 招远市| 星座| 巴楚县| 安远县| 深水埗区| 昭平县| 富裕县| 高邮市| 巫溪县| 新龙县| 榆林市| 罗源县| 冕宁县| 黄骅市| 周口市| 元谋县| 依兰县| 长春市| 金秀| 敖汉旗| 黔江区| 抚顺市| 元氏县| 永登县| 湖南省| 大宁县| 青海省| 宕昌县| 光泽县| 青阳县| 和政县| 静宁县| 襄垣县| 兰考县| 北海市| 盖州市| 什邡市|