官术网_书友最值得收藏!

Summary

In this chapter, we established the framework for the different data processing units that will be introduced in this book. There is a very good reason why the topics of model validation and overfitting are treated early on in this book: there is no point in building models and selecting algorithms if we do not have a methodology to evaluate their relative merits.

In this chapter, you were introduced to the following topics:

  • The concept of monadic transformation for implicit and explicit models
  • The versatility and cleanness of the cake pattern and mixin composition in Scala as an effective scaffolding tool for data processing
  • A robust methodology to validate machine learning models
  • The challenge in fitting models to both training and real-world data

The next chapter will address the problem of overfitting by identifying outliers and reducing noise in data.

主站蜘蛛池模板: 广河县| 白城市| 临江市| 宁阳县| 北辰区| 海兴县| 永定县| 黔西县| 新田县| 舒兰市| 望江县| 合川市| 中山市| 梁山县| 宜阳县| 商都县| 遵化市| 石嘴山市| 长沙市| 太谷县| 吉木乃县| 玉树县| 依兰县| 盐源县| 大石桥市| 西乡县| 济阳县| 钟祥市| 临颍县| 西和县| 石家庄市| 健康| 柏乡县| 科技| 无极县| 高阳县| 班玛县| 兴宁市| 沅陵县| 东阳市| 峨眉山市|