官术网_书友最值得收藏!

Chapter 2. Introduction to R Programming Language and Statistical Environment

In Chapter 1, The era of "Big Data", you have become familiar with the most useful Big Data terminology, and a small selection of typical tools applied to unusually large or complex data sets. You have also gained essential insights into how R was developed and how it became the leading statistical computing environment and programming language favored by technology giants and the best universities in the world. In this chapter you will have the opportunity to learn some most important R functions from base R installation and well-known third party packages used for data crunching, transformation, and analysis. More specifically in this chapter you will:

  • Understand the landscape of available R data structures
  • Be guided through a number of R operations allowing you to import data from standard and proprietary data formats
  • Carry out essential data cleaning and processing activities such as subsetting, aggregating, creating contingency tables, and so on
  • Inspect the data by implementing a selection of Exploratory Data Analysis techniques such as descriptive statistics
  • Apply basic statistical methods to estimate correlation parameters between two (Pearson's r) or more variables (multiple regressions) or find the differences between means for two (t-tests) or more groups Analysis of Variance (ANOVA)
  • Be introduced to more advanced data modeling tasks like logistic and Poisson regressions
主站蜘蛛池模板: 泗阳县| 潼关县| 威海市| 资源县| 独山县| 巴彦淖尔市| 志丹县| 抚宁县| 神池县| 云霄县| 平顶山市| 双鸭山市| 通州区| 英超| 太原市| 缙云县| 嘉定区| 长子县| 长汀县| 玛多县| 庆阳市| 彭水| 清镇市| 梓潼县| 祁东县| 修水县| 新绛县| 永寿县| 巫山县| 宁晋县| 志丹县| 栖霞市| 本溪市| 乌鲁木齐县| 台山市| 昌江| 平顶山市| 垣曲县| 旺苍县| 木里| 芦溪县|