官术网_书友最值得收藏!

Summary

In this chapter, we introduced data mining using Python. If you were able to run the code in this section (note that the full code is available in the supplied code package), then your computer is set up for much of the rest of the book. Other Python libraries will be introduced in later chapters to perform more specialized tasks.

We used the IPython Notebook to run our code, which allows us to immediately view the results of a small section of the code. This is a useful framework that will be used throughout the book.

We introduced a simple affinity analysis, finding products that are purchased together. This type of exploratory analysis gives an insight into a business process, an environment, or a scenario. The information from these types of analysis can assist in business processes, finding the next big medical breakthrough, or creating the next artificial intelligence.

Also, in this chapter, there was a simple classification example using the OneR algorithm. This simple algorithm simply finds the best feature and predicts the class that most frequently had this value in the training dataset.

Over the next few chapters, we will expand on the concepts of classification and affinity analysis. We will also introduce the scikit-learn package and the algorithms it includes.

主站蜘蛛池模板: 永顺县| 平江县| 白朗县| 时尚| 重庆市| 龙山县| 黎城县| 西林县| 沿河| 莆田市| 永仁县| 旅游| 崇文区| 徐州市| 江永县| 长寿区| 无为县| 深泽县| 岗巴县| 将乐县| 祁连县| 疏勒县| 莲花县| 西青区| 浠水县| 白朗县| 馆陶县| 抚州市| 邹城市| 荣昌县| 双柏县| 东辽县| 平凉市| 普兰店市| 夏津县| 新安县| 阿尔山市| 高唐县| 洛南县| 扶余县| 台州市|