- Machine Learning with Swift
- Alexander Sosnovshchenko
- 154字
- 2021-06-24 18:54:51
Data preprocessing
The useful information in the data is usually referred to as a signal. On the other hand, the pieces of data that represent errors of different kinds and irrelevant data are known as noise. Errors can occur in the data during measurements, information transmission, or due to human errors. The goal of data cleansing procedures is to increase the signal/noise ratio. During this stage, you will usually transform all data to one format, delete entries with missed values, and check suspicious outliers (they can be both noise and signal). It is widely believed among ML engineers, that the data preprocessing stage usually consumes 90% of the time allocated for the ML project. Then, algorithm tweaking consumes another 90% of time. This statement is a joke only partially (about 10% of it). In Chapter 13, Best Practices, we are going to discuss common problems with the data and how to fix them.
- 筆記本電腦使用、維護(hù)與故障排除實(shí)戰(zhàn)
- SDL Game Development
- 平衡掌控者:游戲數(shù)值經(jīng)濟(jì)設(shè)計(jì)
- Learning Game Physics with Bullet Physics and OpenGL
- 微服務(wù)分布式架構(gòu)基礎(chǔ)與實(shí)戰(zhàn):基于Spring Boot + Spring Cloud
- 嵌入式系統(tǒng)中的模擬電路設(shè)計(jì)
- Intel Edison智能硬件開(kāi)發(fā)指南:基于Yocto Project
- 深入理解序列化與反序列化
- Neural Network Programming with Java(Second Edition)
- Blender Game Engine:Beginner's Guide
- 3D Printing Blueprints
- 分布式存儲(chǔ)系統(tǒng):核心技術(shù)、系統(tǒng)實(shí)現(xiàn)與Go項(xiàng)目實(shí)戰(zhàn)
- 零基礎(chǔ)輕松學(xué)修電腦主板
- 快·易·通:2天學(xué)會(huì)電腦組裝·系統(tǒng)安裝·日常維護(hù)與故障排除
- USB 3.0編程寶典