- Hands-On Data Science with R
- Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias
- 313字
- 2021-06-10 19:12:31
Measures of central tendency
What if you had to describe the center of a distribution within a single number? Most people would appeal to one of these three estimators: mean, median, or mode. Those are probably the most popular measures of central tendency. Let's begin by sampling data from an arbitrary distribution. Get into your R console and try the following code:
set.seed(10)
small_sample <- rnorm(n = 10, mean = 10, sd = 5)
big_sample <- rnorm(n = 10^5, mean = 10, sd = 5)
The first line is setting the seed number to work with our random number generator (RNG). Every time there's a need to rely on a pseudo-random process, the set.seed() function will make sure your code is reproducible (at least at some level). By setting it to 10 you will get the same numbers that I'm getting from the preceding code lines.
The two last lines are sorting pseudo-random numbers from a normally distributed variable. Call the rnom() function to sort variables from a normal distribution. Choose the number of observations sorted by adjusting the n parameter. Modify the mean and sd parameters if you want a mean and standard deviation different from 0 and 1 respectively.
In the real world, you will hardly know for sure what underlying process is ruling your data, but here we do know beforehand that our numbers come from a normally distributed variable with a mean of 10 and a standard deviation of 5 units. We gathered two samples. The one called small_sample has only 10 observations, while big_sample sums up to 100,000 observations. Even though both come from similar distributions we will see how estimates behave with respect to sample sizes.
- 樂高機器人:WeDo編程與搭建指南
- Oracle SOA Governance 11g Implementation
- LabVIEW虛擬儀器從入門到測控應用130例
- 離散事件系統建模與仿真
- STM32G4入門與電機控制實戰:基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現
- 現代機械運動控制技術
- AutoCAD 2012中文版繪圖設計高手速成
- Pentaho Analytics for MongoDB
- Learning ServiceNow
- 強化學習
- 青少年VEX IQ機器人實訓課程(初級)
- 三菱FX/Q系列PLC工程實例詳解
- 貫通開源Web圖形與報表技術全集
- 計算機硬件技術基礎(第2版)
- Oracle 11g Anti-hacker's Cookbook