官术网_书友最值得收藏!

Computing descriptive statistics

In this section, we will review methods for obtaining descriptive statistics from data that is stored in a pandas DataFrame. We will use the pandas library to compute statistics from the data. So, let's jump right in!

DataFrames come equipped with many methods for computing common descriptive statistics for the data they contain. This is one of the advantages of storing data in DataFrames—working with data stored this way is easy. Getting common descriptive statistics, such as the mean, the median, the standard deviation, and more, is easy for data that is present in DataFrames. There are methods that can be called in order to quickly compute each of these. We will review several of these methods now.

If you want a basic set of descriptive statistics, just to get a sense of the contents of the DataFrame, consider using the describe() method. It includes the mean, standard deviation, an account of how much data there is, and the five-number summary built in.

Sometimes, the statistic that you want isn't a built-in DataFrame method. In this case, you will write a function that works for a pandas series, and then apply that function to each column using the apply() method.

主站蜘蛛池模板: 嘉义县| 黑龙江省| 昆明市| 礼泉县| 新晃| 青川县| 柘荣县| 商南县| 文昌市| 石棉县| 惠安县| 嘉义县| 苍山县| 岑巩县| 长治县| 翼城县| 蓬安县| 读书| 封开县| 梅河口市| 江口县| 曲麻莱县| 敖汉旗| 金山区| 焦作市| 北海市| 雷山县| 成都市| 汝州市| 淮滨县| 赤壁市| 海南省| 新邵县| 龙门县| 黄山市| 龙游县| 宝清县| 斗六市| 许昌市| 无极县| 德令哈市|