官术网_书友最值得收藏!

Introduction

In the previous chapter, we covered how to integrate data from various data sources. However, simply collecting data is not enough; you also have to ensure the quality of the collected data. If the quality of data used is insufficient, the results of the analysis may be misleading due to biased samples or missing values. Moreover, if the collected data is not well structured and shaped, you may find it hard to correlate and investigate the data. Therefore, data preprocessing and preparation is an essential task that you must perform prior to data analysis.

Those of you familiar with how SQL operates may already understand how to use databases to process data. For example, SQL allows users to add new records with the insert operation, modify data with the update operation, and remove records with the delete operation. However, we do not need to move collected data back to the database; R already provides more powerful and convenient preprocessing functions and packages. In this chapter, we will cover how simple it is to perform data preprocessing in R.

主站蜘蛛池模板: 陆丰市| 华坪县| 韶关市| 永和县| 阿勒泰市| 榆林市| 庄浪县| 吐鲁番市| 石嘴山市| 青川县| 桦川县| 和田县| 咸宁市| 抚顺县| 旅游| 都匀市| 崇阳县| 江源县| 柞水县| 宜兴市| 禄劝| 禹城市| 镇雄县| 天气| 永定县| 张家口市| 大新县| 晴隆县| 康保县| 临武县| 堆龙德庆县| 昭通市| 绥宁县| 黄陵县| 渑池县| 新田县| 平武县| 承德市| 育儿| 晋城| 林西县|