官术网_书友最值得收藏!

Preface

 

"It has been said that you don't really understand something until you have taught it to someone else. The truth is that you don't really understand it until you have taught it to a computer; that is, implemented it as an algorithm."

   

— Donald Knuth

As Don Knuth so wisely said, the best way to understand something is to implement it. This book will help you understand some of the most important algorithms in data science by showing you how to implement them in the Java programming language.

The algorithms and data management techniques presented here are often categorized under the general fields of data science, data analytics, predictive analytics, artificial intelligence, business intelligence, knowledge discovery, machine learning, data mining, and big data. We have included many that are relatively new, surprisingly powerful, and quite exciting. For example, the ID3 classification algorithm, the K-means and K-medoid clustering algorithms, Amazon's recommender system, and Google's PageRank algorithm have become ubiquitous in their effect on nearly everyone who uses electronic devices on the web.

We chose the Java programming language because it is the most widely used language and because of the reasons that make it so: it is available, free, everywhere; it is object-oriented; it has excellent support systems, such as powerful integrated development environments; its documentation system is efficient and very easy to use; and there is a multitude of open source libraries from third parties that support essentially all implementations that a data analyst is likely to use. It's no coincidence that systems such as MongoDB, which we study in Chapter 11, Big Data Analysis with Java, are themselves written in Java.

主站蜘蛛池模板: 山西省| 潞西市| 高台县| 许昌市| 永仁县| 年辖:市辖区| 万载县| 昆山市| 周宁县| 皋兰县| 略阳县| 斗六市| 舟曲县| 雷州市| 通榆县| 重庆市| 邢台县| 十堰市| 宁晋县| 迭部县| 通海县| 深水埗区| 赤峰市| 额尔古纳市| 仪征市| 丰县| 石城县| 镇雄县| 襄樊市| 西吉县| 鄂温| 巧家县| 广宁县| 二连浩特市| 衡阳市| 洪雅县| 衡南县| 景德镇市| 洱源县| 丁青县| 昆山市|