官术网_书友最值得收藏!

Summary

In this chapter, we went on a tour of the how and why of pandas, data manipulation/analysis, and science. This started with an overview of why pandas exists, what functionality it contains, and how it relates to concepts of data manipulation, analysis, and data science.

Then we covered a process for data analysis to set a framework for why certain functions exist in pandas. These include retrieving data, organizing and cleaning it up, doing exploration, and then building a formal model, presenting your findings, and being able to share and reproduce the analysis.

Next, we covered several concepts involved in data and statistical modeling. This included covering many common analysis techniques and concepts, so as to introduce you to these and make you more familiar when they are explored in more detail in subsequent chapters.

pandas is also a part of a larger Python ecosystem of libraries that are useful for data analysis and science. While this book will focus only on pandas, there are other libraries that you will come across and that were introduced so you are familiar with them when they crop up.

We are ready to begin using pandas. In the next chapter, we will begin to ease ourselves into pandas, starting with obtaining a Python and pandas environment, an overview of Jupyter notebooks, and then getting a quick introduction to pandas Series and DataFrame objects before delving into them im more depth in subsequent elements of pandas.

主站蜘蛛池模板: 丰都县| 田林县| 云林县| 濮阳县| 花垣县| 文登市| 沿河| 加查县| 平和县| 岚皋县| 柳河县| 金阳县| 吉隆县| 咸阳市| 榆林市| 平陆县| 开江县| 张家川| 浦江县| 嵩明县| 昔阳县| 延庆县| 连山| 曲阜市| 富蕴县| 花莲市| 塘沽区| 高台县| 新闻| 洪雅县| 石屏县| 陕西省| 弥渡县| 鹤峰县| 吉首市| 从化市| 土默特右旗| 横峰县| 五河县| 平度市| 泽州县|