官术网_书友最值得收藏!

Chapter 2. Manipulating Data with Breeze

Data science is, by and large, concerned with the manipulation of structured data. A large fraction of structured datasets can be viewed as tabular data: each row represents a particular instance, and columns represent different attributes of that instance. The ubiquity of tabular representations explains the success of spreadsheet programs like Microsoft Excel, or of tools like SQL databases.

To be useful to data scientists, a language must support the manipulation of columns or tables of data. Python does this through NumPy and pandas, for instance. Unfortunately, there is no single, coherent ecosystem for numerical computing in Scala that quite measures up to the SciPy ecosystem in Python.

In this chapter, we will introduce Breeze, a library for fast linear algebra and manipulation of data arrays as well as many other features necessary for scientific computing and data science.

主站蜘蛛池模板: 平果县| 五指山市| 信丰县| 错那县| 平泉县| 宣化县| 元阳县| 仪征市| 华亭县| 榆林市| 融水| 金湖县| 吴堡县| 内丘县| 阳西县| 广宁县| 平乡县| 科尔| 鄢陵县| 辽源市| 新竹市| 霍林郭勒市| 罗源县| 辰溪县| 叶城县| 沾化县| 宣武区| 楚雄市| 富裕县| 拉孜县| 永兴县| 云阳县| 青州市| 科技| 裕民县| 威海市| 桃江县| 阳原县| 巴林左旗| 盈江县| 瑞丽市|