官术网_书友最值得收藏!

Advanced Feature Selection in Linear Models

"I found that math got to be too abstract for my liking and computer science seemed concerned with little details--trying to save a microsecond or a kilobyte in a computation. In statistics I found a subject that combined the beauty of both math and computer science, using them to solve real-world problems."

This was quoted by Rob Tibshirani, Professor, Stanford University at:

https://statweb.stanford.edu/~tibs/research_page.html.

So far, we've examined the usage of linear models for both quantitative and qualitative outcomes with an emphasis on the techniques of feature selection, that is, the methods and techniques to exclude useless or unwanted predictor variables. We saw that the linear models can be quite effective in machine learning problems. However, newer techniques that have been developed and refined in the last couple of decades or so can improve predictive ability and interpretability above and beyond the linear models that we discussed in the preceding chapters. In this day and age, many datasets have numerous features in relation to the number of observations or, as it is called, high-dimensionality. If you've ever worked on a genomics problem, this will quickly become self-evident. Additionally, with the size of the data that we are being asked to work with, a technique like best subsets or stepwise feature selection can take inordinate amounts of time to converge even on high-speed computers. I'm not talking about minutes: in many cases, hours of system time are required to get a best subsets solution.

There is a better way in these cases. In this chapter, we will look at the concept of regularization where the coefficients are constrained or shrunk towards zero. There are a number of methods and permutations to these methods of regularization but we will focus on Ridge regression, Least Absolute Shrinkage and Selection Operator (LASSO), and finally, elastic net, which combines the benefit of both techniques into one.

主站蜘蛛池模板: 平顺县| 海兴县| 阿鲁科尔沁旗| 崇左市| 民权县| 莆田市| 田林县| 循化| 平泉县| 本溪市| 汕头市| 本溪市| 内黄县| 青海省| 和顺县| 茂名市| 股票| 南昌市| 保山市| 米林县| 个旧市| 玉树县| 清镇市| 都江堰市| 中牟县| 安溪县| 仪征市| 台北市| 麻阳| 海晏县| 确山县| 日照市| 中阳县| 米林县| 阳东县| 永定县| 邢台县| 平定县| 九龙坡区| 平泉县| 新化县|