- Mastering Machine Learning with Spark 2.x
- Alex Tellez Max Pumperla Michal Malohlava
- 237字
- 2021-07-02 18:46:06
Data science - an iterative process
Often, the process flow of many big data projects is iterative, which means a lot of back-and-forth testing new ideas, new features to include, tweaking various hyper-parameters, and so on, with a fail fast attitude. The end result of these projects is usually a model that can answer a question being posed. Notice that we didn't say accurately answer a question being posed! One pitfall of many data scientists these days is their inability to generalize a model for new data, meaning that they have overfit their data so that the model provides poor results when given new data. Accuracy is extremely task-dependent and is usually dictated by the business needs with some sensitivity analysis being done to weigh the cost-benefits of the model outcomes. However, there are a few standard accuracy measures that we will go over throughout this book so that you can compare various models to see how changes to the model impact the result.
- C語言程序設計案例教程
- 零基礎學C++程序設計
- Linux核心技術從小白到大牛
- Building Minecraft Server Modifications
- Python極簡講義:一本書入門數據分析與機器學習
- 常用工具軟件立體化教程(微課版)
- Modern C++ Programming Cookbook
- 計算機應用技能實訓教程
- Learning TypeScript
- Java EE基礎實用教程
- Microsoft Windows Identity Foundation Cookbook
- Mastering Magento Theme Design
- Instant SQL Server Analysis Services 2012 Cube Security
- 算法(第4版)
- Visual C++實用教程