官术网_书友最值得收藏!

Binning

Sometimes it's useful to separate feature values into several bins. For example, we may be only interested whether it rained on a particular day. Given the precipitation values, we can binarize the values, so that we get a true value if the precipitation value is not zero, and a false value otherwise. We can also use statistics to divide values into high, low, and medium bins.

The binning process inevitably leads to loss of information. However, depending on your goals this may not be an issue, and actually reduce the chance of overfitting. Certainly there will be improvements in speed and memory or storage requirements.

主站蜘蛛池模板: 竹北市| 漳浦县| 黄平县| 敖汉旗| 凤翔县| 安乡县| 清徐县| 阳高县| 潢川县| 天津市| 衡东县| 日土县| 巨野县| 扎赉特旗| 靖远县| 达州市| 怀来县| 贵州省| 固始县| 冕宁县| 吉林省| 永善县| 靖江市| 襄樊市| 永胜县| 海兴县| 拜城县| 河北区| 乌兰察布市| 安化县| 长沙县| 轮台县| 禄丰县| 黔西县| 大安市| 奈曼旗| 建德市| 辽宁省| 酉阳| 丰原市| 纳雍县|