官术网_书友最值得收藏!

Manipulating Data with the Pandas Library

In the next few portions of the book, we are going to get our hands dirty by building the various kinds of recommender systems that were introduced in chapter one. However, before we do so, it is important that we know how to handle, manipulate, and analyze data efficiently in Python.

The datasets we'll be working with will be several megabytes in size. Historically, Python has never been well-known for its speed of execution. Therefore, analyzing such huge amounts of data using vanilla Python and the built-in data structures it provides us is simply impossible.

In this chapter, we're going to get ourselves acquainted with the pandas library, which aims to overcome the aforementioned limitations, making data analysis in Python extremely efficient and user-friendly. We'll also introduce ourselves to the Movies Dataset that we're going to use to build our recommenders as well as use pandas to extract some interesting facts and narrate the history of movies using data.

Disclaimer:
If you are already familiar with the pandas library, you may skip this chapter and move on to the next, Building an IMDB Top 250 Clone with p andas.

主站蜘蛛池模板: 富蕴县| 手游| 石嘴山市| 历史| 通渭县| 昂仁县| 崇仁县| 景谷| 曲靖市| 古交市| 加查县| 铜鼓县| 社会| 鄂州市| 邢台市| 黄山市| 丽江市| 辛集市| 闽侯县| 留坝县| 宁安市| 长岛县| 安乡县| 康马县| 东辽县| 章丘市| 文登市| 重庆市| 斗六市| 个旧市| 丹江口市| 亚东县| 泾阳县| 沙坪坝区| 上栗县| 辽宁省| 泽州县| 镇康县| 贺州市| 宕昌县| 桂东县|