官术网_书友最值得收藏!

Data cleaning

Most datasets require this step, in which you get rid of errors, noise, and redundancies. We need our data to be accurate, complete, reliable, and unbiased, as there are lots of problems that may arise from using bad knowledge base, such as:

  • Inaccurate and biased conclusions
  • Increased error
  • Reduced generalizability, which is the model's ability to perform well over the unseen data that it didn't train on previously
主站蜘蛛池模板: 江油市| 道真| 从化市| 柳江县| 岳阳县| 新丰县| 仙居县| 伊吾县| 哈密市| 萝北县| 义马市| 金塔县| 庄河市| 牟定县| 清徐县| 武穴市| 扬中市| 健康| 承德县| 衡水市| 泌阳县| 始兴县| 黄山市| 安多县| 安阳县| 双牌县| 湖州市| 赤水市| 永定县| 长乐市| 临西县| 隆德县| 桂东县| 和田县| 兴城市| 阿拉善左旗| 平阴县| 松桃| 张掖市| 贵南县| 广灵县|