官术网_书友最值得收藏!

Cleaning data

When working with data, you can generally expect to find human errors, missing entries, and numerical outliers. These types of errors usually need to be corrected, handled, or removed to prepare a dataset for analysis.

In Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions, I will demonstrate how to use regular expressions, a tool to identify, extract, and modify patterns in text data. Chapter 5, Manipulating Text Data - An Introduction to Regular Expressions, includes a project to use regular expressions to extract street names.

In Chapter 6Cleaning Numerical Data - An Introduction to R and Rstudio, I will demonstrate how to use RStudio to conduct two common tasks for cleaning numerical data: outlier detection and NA handling.

主站蜘蛛池模板: 新沂市| 依兰县| 长海县| 吴江市| 霍林郭勒市| 江阴市| 大兴区| 大方县| 万州区| 台前县| 双柏县| 鄂托克前旗| 长治市| 宝应县| 宜春市| 丁青县| 绿春县| 东至县| 永福县| 凤山市| 卫辉市| 长沙县| 明星| 屏山县| 海伦市| 集安市| 襄樊市| 桑日县| 梁山县| 泰和县| 鄂托克前旗| 利川市| 武隆县| 苏州市| 中江县| 涡阳县| 高州市| 高淳县| 河东区| 蓬溪县| 中西区|