官术网_书友最值得收藏!

The process

The CRISP-DM process was designed specifically for the data mining. However, it is flexible and thorough enough that it can be applied to any analytical project, whether it is predictive analytics, data science, or machine learning. Don't be intimidated by the numerous list of tasks as you can apply your judgment to the process and adapt it for any real-world situation. The following figure provides a visual representation of the process and shows the feedback loops, which facilitate its flexibility:

The process

Figure from CRISP-DM 1.0, Step-by-step data mining guide

The process has the following six phases:

  • Business Understanding
  • Data Understanding
  • Data Preparation
  • Modeling
  • Evaluation
  • Deployment

For an in-depth review of the entire process with all of its tasks and subtasks, you can examine the paper by SPSS, CRISP-DM 1.0, step-by-step data mining guide, available at https://the-modeling-agency.com/crisp-dm.pdf.

I will discuss each of the steps in the process, covering the important tasks. However, it will not be in the detailed level of the guide, but more high level. We will not skip any of the critical details but focus more on the techniques that one can apply to the tasks. Keep in mind that the process steps will be used in the later chapters as a framework in the actual application of the machine learning methods in general and the R code specifically.

主站蜘蛛池模板: 咸丰县| 林芝县| 长岭县| 防城港市| 浦北县| 黄陵县| 民权县| 德阳市| 屯昌县| 珠海市| 镇平县| 南靖县| 罗甸县| 会宁县| 两当县| 上栗县| 通州区| 新建县| 黔西县| 洛南县| 修武县| 和硕县| 新密市| 平罗县| 华安县| 科技| 绥芬河市| 武功县| 平舆县| 新密市| 浮山县| 福清市| 江阴市| 朔州市| 富蕴县| 隆尧县| 五莲县| 辽阳县| 英超| 新竹市| 开平市|