官术网_书友最值得收藏!

Data cleaning

Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:

  1. Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
  2. Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
  3. Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
  4. Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
主站蜘蛛池模板: 广水市| 怀集县| 昌图县| 昭觉县| 荥阳市| 宁城县| 台湾省| 临安市| 叶城县| 宜昌市| 大足县| 兰州市| 丰宁| 紫云| 越西县| 湄潭县| 班玛县| 临湘市| 政和县| 伊川县| 黄龙县| 阿拉尔市| 淮北市| 平和县| 土默特左旗| 肇东市| 平潭县| 新巴尔虎左旗| 应用必备| 顺平县| 喀什市| 简阳市| 临清市| 苏尼特右旗| 万宁市| 新河县| 昭觉县| 通州区| 新干县| 贡山| 宁海县|