官术网_书友最值得收藏!

Model selection

This step comes after selecting a proper subset of your input variables by using any dimensionality reduction technique. Choosing the proper subset of the input variable will make the rest of the learning process very simple.

In this step, you are trying to figure out the right model to learn.

If you have any prior experience with data science and applying learning methods to different domains and different kinds of data, then you will find this step easy as it requires prior knowledge of how your data looks and what assumptions could fit the nature of your data, and based on this you choose the proper learning method. If you don't have any prior knowledge, that's also fine because you can do this step by guessing and trying different learning methods with different parameter settings and choose the one that gives you better performance over the test set.

Also, initial data analysis and visualization will help you to make a good guess about the form of the distribution and nature of your data.

主站蜘蛛池模板: 五台县| 屯昌县| 德昌县| 商都县| 平利县| 平顺县| 资源县| 曲阳县| 四川省| 宿迁市| 荆门市| 萍乡市| 六安市| 广昌县| 徐汇区| 尚志市| 沁阳市| 任丘市| 临洮县| 达日县| 遂宁市| 鄂州市| 深州市| 博爱县| 措美县| 阿巴嘎旗| 浦县| 葫芦岛市| 鄂州市| 托里县| 满洲里市| 江都市| 射阳县| 崇仁县| 临海市| 德州市| 景德镇市| 调兵山市| 江门市| 尼勒克县| 南华县|