官术网_书友最值得收藏!

Data pre-processing

In this step, we apply some conversions to our data to make it consistent and concrete. There are lots of different conversions that you can consider while pre-processing your data:

  • Renaming (relabeling): This means converting categorical values to numbers, as categorical values are dangerous if used with some learning methods, and also numbers will impose an order between the values
  • Rescaling (normalization): Transforming/bounding continuous values to some range, typically [-1, 1] or [0, 1]
  • New features: Making up new features from the existing ones. For example, obesity-factor = weight/height
主站蜘蛛池模板: 惠来县| 西丰县| 晋州市| 安多县| 新龙县| 佛山市| 德保县| 沧州市| 新竹市| 富顺县| 盐亭县| 封丘县| 宜昌市| 宜黄县| 凤凰县| 格尔木市| 连江县| 达尔| 蓬莱市| 咸阳市| 深泽县| 柳江县| 五峰| 兴业县| 安泽县| 奉新县| 凯里市| 鹤庆县| 沛县| 双桥区| 天等县| 太白县| 霍州市| 绥棱县| 黑山县| 新和县| 贞丰县| 民勤县| 盐亭县| 安化县| 临清市|