官术网_书友最值得收藏!

Modeling

Once the data preparation is complete, the next phase is modeling. Here, you will be selecting an appropriate algorithm and using the data to train your model. There are a number of best practices to adhere to during this stage, and we will discuss them in detail, but the basic steps involve splitting your data into training, testing, and validation sets. This splitting up of the data may seem illogical—especially when more data typically yields better models—but as we'll see, doing this allows us to get better feedback on how the model will perform in the real world, and prevents us from the cardinal sin of modeling: overfitting. We will talk more about this in later chapters.

主站蜘蛛池模板: 额尔古纳市| 临夏县| 新化县| 上栗县| 元氏县| 辉县市| 阳原县| 齐河县| 龙陵县| 鲜城| 大田县| 枣阳市| 巨鹿县| 万载县| 赣榆县| 湖州市| 广昌县| 武义县| 石楼县| 乐亭县| 哈巴河县| 新巴尔虎左旗| 昌乐县| 乌兰县| 丰宁| 台江县| 和顺县| 深州市| 澄江县| 浦县| 鹤壁市| 富锦市| 闽清县| 岱山县| 民和| 北川| 阳西县| 兰溪市| 武鸣县| 刚察县| 常熟市|