官术网_书友最值得收藏!

Data cleaning

Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:

  1. Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
  2. Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
  3. Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
  4. Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
主站蜘蛛池模板: 庆云县| 贵州省| 元阳县| 根河市| 永福县| 巩义市| 庆阳市| 台北市| 江西省| 横山县| 高阳县| 密云县| 巴彦淖尔市| 卢氏县| 兴安县| 泗水县| 高台县| 重庆市| 裕民县| 五大连池市| 淳安县| 丰原市| 高淳县| 延川县| 泉州市| 革吉县| 交口县| 同德县| 易门县| 克山县| 彭州市| 梧州市| 喀什市| 清苑县| 德清县| 阳山县| 岗巴县| 温宿县| 西和县| 台南市| 奉节县|