官术网_书友最值得收藏!

Fitting aggregated data to the gamma distribution

The gamma distribution can be used to model the size of insurance claims, rainfall, and the distribution of inter-spike intervals in brains. The PDF for the gamma distribution is defined by shape k and scale θ as follows:

There is also a definition that uses an inverse scale parameter (used by SciPy). The mean and variance of the gamma distribution are described by (3.3) and (3.4). As you can see, we can estimate the shape parameter from the mean and variance using simple algebra.

How to do it...

Let's fit aggregates for the rain data for January to the gamma distribution:

  1. Start with the following imports:
    from scipy.stats.distributions import gamma
    import matplotlib.pyplot as plt
    import dautil as dl
    import pandas as pd
    from IPython.display import HTML
  2. Load the data and select aggregates for January:
    rain = dl.data.Weather.load()['RAIN'].resample('M').dropna()
    rain = dl.ts.groupby_month(rain)
    rain = rain.get_group(1)
  3. Derive a value for k from the mean and variance of the distribution, and use it to fit the data:
    dist = dl.stats.Distribution(rain, gamma)
    
    a = (dist.mean() ** 2)/dist.var()
    shape, loc, scale = dist.fit(a)

The rest of the code is similar to the code in Fitting data to the exponential distribution. Refer to the following screenshot for the end result (the code is in the fitting_gamma.ipynb file in this book's code bundle):

See also

主站蜘蛛池模板: 元阳县| 河间市| 东安县| 略阳县| 洛隆县| 迭部县| 涞水县| 乳源| 日喀则市| 定安县| 红河县| 苍山县| 七台河市| 泾源县| 土默特左旗| 和林格尔县| 石阡县| 桂东县| 霍城县| 新建县| 仁布县| 宁国市| 瓦房店市| 临洮县| 顺平县| 丰县| 中牟县| 柳江县| 轮台县| 大港区| 怀安县| 高密市| 蕲春县| 平昌县| 子长县| 胶州市| 北宁市| 惠安县| 金山区| 龙岩市| 开原市|