官术网_书友最值得收藏!

tapply

Now, suppose instead of prices for 2018 only, we have prices for these items for 2017, 2016, and 2015 as well. This new data frame is defined as follows:

all_prices = data.frame(items = rep(c("potato", "rice", "oil"), 4), 
jan_price = c(10, 20, 30, 10, 18, 25, 9, 17, 24, 9, 19,27),
mar_price = c(11, 22, 33, 13, 25, 32, 12, 21, 33, 15, 27,39),
june_price = c(20, 25, 33, 21, 24, 40, 17, 22, 27, 13, 18,23)
)
all_prices

The output for the preceding lines of code can be seen as follows:

Now suppose we want to take the mean price of different items for very March in all years. We can do this by using tapply(numerical_variable, categorical_variable, function). So, we will need to convert the items column of the all_prices data frame to a categorical variable to take the mean price.

tapply(all_prices$mar_price, factor(all_prices$items), mean)

This gives us a mean March price for oil, potato, and rice in all years, as follows:

Note the use of factor() to convert the items column to a factor variable.

There are other apply functions, but that's it for now, folks. We will introduce new functions as and when it will be necessary as we proceed to new chapters for geospatial analysis.

To install a new package, we need to write install.packages("package_name"), and to use any package, we need to write load.packages("package_name").

主站蜘蛛池模板: 襄城县| 略阳县| 茌平县| 武穴市| 涿鹿县| 衡山县| 松桃| 大城县| 玉环县| 平顶山市| 寿宁县| 建湖县| 东乡| 浠水县| 恭城| 冷水江市| 武城县| 阿拉尔市| 平凉市| 金堂县| 洪江市| 铁岭市| 石河子市| 合作市| 南涧| 海盐县| 唐山市| 靖宇县| 卢龙县| 蒙自县| 灵山县| 惠水县| 滁州市| 西和县| 揭西县| 衡东县| 理塘县| 楚雄市| 建始县| 瑞安市| 奉节县|