官术网_书友最值得收藏!

Modeling

Once the data preparation is complete, the next phase is modeling. Here, you will be selecting an appropriate algorithm and using the data to train your model. There are a number of best practices to adhere to during this stage, and we will discuss them in detail, but the basic steps involve splitting your data into training, testing, and validation sets. This splitting up of the data may seem illogical—especially when more data typically yields better models—but as we'll see, doing this allows us to get better feedback on how the model will perform in the real world, and prevents us from the cardinal sin of modeling: overfitting. We will talk more about this in later chapters.

主站蜘蛛池模板: 富平县| 灵璧县| 县级市| 孟津县| 荥阳市| 七台河市| 元江| 湖北省| 镇赉县| 蒙自县| 康乐县| 涪陵区| 杂多县| 武冈市| 昔阳县| 屯门区| 海城市| 广宗县| 九龙县| 河曲县| 偃师市| 平远县| 南郑县| 多伦县| 乌海市| 临城县| 大洼县| 宁都县| 托克逊县| 庆安县| 徐州市| 集安市| 乌拉特前旗| 元朗区| 河北省| 长春市| 东乌珠穆沁旗| 梧州市| 汕头市| 湖口县| 清水河县|