官术网_书友最值得收藏!

Data cleaning

Data cleaning, also known as data cleansing or data scrubbing, is a process consisting of the following steps:

  1. Identifying inaccurate, incomplete, irrelevant, or corrupted data to remove it from further processing
  2. Parsing data, extracting information of interest, or validating whether a string of data is in an acceptable format
  3. Transforming data into a common encoding format, for example, UTF-8 or int32, time scale, or a normalized range
  4. Transforming data into a common data schema; for instance, if we collect temperature measurements from different types of sensors, we might want them to have the same structure
主站蜘蛛池模板: 修水县| 佛冈县| 会昌县| 汉源县| 乐亭县| 嫩江县| 东至县| 温州市| 安多县| 潍坊市| 安平县| 建德市| 汽车| 隆昌县| 乐平市| 灵寿县| 太湖县| 兴仁县| 邵武市| 郑州市| 靖江市| 德江县| 宁安市| 阳朔县| 民勤县| 汉寿县| 开远市| 肇庆市| 武宁县| 金沙县| 加查县| 民乐县| 鞍山市| 汕头市| 临沂市| 项城市| 麟游县| 宽城| 那坡县| 静海县| 聊城市|