官术网_书友最值得收藏!

The chi-square distribution

The chi-square statistics are defined by the following formula:

The chi-square distribution

Here, n is the size of the sample, s is the standard deviation of the sample, and σ is the standard deviation of the population.

If we repeatedly take samples and define the chi-square statistics, then we can form a chi-square distribution, which is defined by the following probability density function:

The chi-square distribution

Here, Y0 is a constant that depends on the number of degrees of freedom, Χ2 is the chi-square statistic, v = n - 1 is the number of degrees of freedom, and e is a constant equal to the base of the natural logarithm system.

Y0 is defined so that the area under the chi-square curve is equal to one.

The chi-square distribution

Chi-square for the goodness of fit

The Chi-square test can be used to test whether the observed data differs significantly from the expected data. Let's take the example of a dice. The dice is rolled 36 times and the probability that each face should turn upwards is 1/6. So, the expected distribution is as follows:

>>> expected = np.array([6,6,6,6,6,6])

The observed distribution is as follows:

>>> observed = observed = np.array([7, 5, 3, 9, 6, 6])

The null hypothesis in the chi-square test is that the observed value is similar to the expected value.

The chi-square can be performed using the chisquare function in the SciPy package:

>>> stats.chisquare(observed,expected)
(3.333333333333333, 0.64874235866759344)

The first value is the chi-square value and the second value is the p-value, which is very high. This means that the null hypothesis is valid and the observed value is similar to the expected value.

主站蜘蛛池模板: 延长县| 邵武市| 巴青县| 出国| 乳源| 崇州市| 文成县| 霍城县| 怀集县| 鄂伦春自治旗| 突泉县| 凤山县| 太原市| 高州市| 巴楚县| 河津市| 山东省| 海南省| 蒙山县| 桂林市| 禹州市| 潞西市| 陇西县| 江达县| 清新县| 东兰县| 吉首市| 蓬安县| 获嘉县| 信丰县| 方城县| 扬州市| 瓦房店市| SHOW| 托里县| 河北省| 渝中区| 麻城市| 临桂县| 上蔡县| 唐海县|