官术网_书友最值得收藏!

Summary

In this chapter, we looked at using probabilistic linear models to predict a qualitative response with two generalized linear model methods: logistic regression, and multivariate adaptive regression splines. We explored using the weight of information and information value as a technique to do univariate feature selection. We covered the concept of finding the proper probability threshold to minimize classification error. Additionally, we began the process of using various performance metrics such as AUC, log-loss, and ROC charts to explore model selection visually and statistically. These metrics proved to be more informative than just pure accuracy, especially in a situation where class labels are highly imbalanced. In the next chapter, we'll cover regularization methods for feature selection, and how it can be used in training your algorithms. We'll see how we can create a dataset. We'll know about ridge regression and dive deeper in feature selection.

主站蜘蛛池模板: 太仓市| 克什克腾旗| 桐柏县| 溧阳市| 遂溪县| 都兰县| 通道| 吉隆县| 冷水江市| 英山县| 峡江县| 宁安市| 墨江| 阳西县| 哈尔滨市| 正蓝旗| 海原县| 进贤县| 娱乐| 四子王旗| 阳信县| 乡宁县| 昭觉县| 奉贤区| 韩城市| 普兰店市| 灌阳县| 安顺市| 内丘县| 浦东新区| 滕州市| 弥渡县| 吴江市| 长丰县| 惠州市| 田林县| 加查县| 天津市| 祁门县| 长垣县| 泰来县|