官术网_书友最值得收藏!

Statistical inference

What developer at some point in his or her career, had to create a sample or test data? For example, I've often created a simple script to generate a random number (based upon the number of possible options or choices) and then used that number as the selected option (in my test recordset). This might work well for data development, but with statistics and data science, this is not sufficient.

To create sample data (or a sample population), the data scientist will use a process called statistical inference, which is the process of deducing options of an underlying distribution through analysis of the data you have or are trying to generate for. The process is sometimes called inferential statistical analysis and includes testing various hypotheses and deriving estimates.

When the data scientist determines that a recordset (or population) should be larger than it actually is, it is assumed that the recordset is a sample from a larger population, and the data scientist will then utilize statistical inference to make up the difference.

The data or recordset in use is referred to by the data scientist as the observed data. Inferential statistics can be contrasted with descriptive statistics, which is only concerned with the properties of the observed data and does not assume that the recordset came from a larger population.
主站蜘蛛池模板: 建德市| 巴林左旗| 宜阳县| 呼伦贝尔市| 公安县| 宁河县| 那坡县| 安庆市| 江孜县| 肇源县| 砚山县| 扶余县| 珠海市| 申扎县| 丰镇市| 大冶市| 高密市| 蒲江县| 汽车| 甘肃省| 永泰县| 邵东县| 武胜县| 玉溪市| 尼勒克县| 广灵县| 中超| 庆城县| 临潭县| 灵宝市| 浠水县| 沙河市| 辉县市| 长垣县| 蓬安县| 龙川县| 洪湖市| 西青区| 江口县| 若尔盖县| 金溪县|