- Advanced Machine Learning with R
- Cory Lesmeister Dr. Sunil Kumar Chinnamgari
- 220字
- 2021-06-24 14:24:32
Preparing and Understanding Data
Research consistently shows that machine learning and data science practitioners spend most of their time manipulating data and preparing it for analysis. Indeed, many find it the most tedious and least enjoyable part of their work. Numerous companies are offering solutions to the problem but, in my opinion, results at this point are varied. Therefore, in this first chapter, I shall endeavor to provide a way of tackling the problem that will ease the burden of getting your data ready for machine learning. The methodology introduced in this chapter will serve as the foundation for data preparation and for understanding many of the subsequent chapters. I propose that once you become comfortable with this tried and true process, it may very well become your favorite part of machine learning—as it is for me.
The following are the topics that we'll cover in this chapter:
- Overview
- Reading the data
- Handling duplicate observations
- Descriptive statistics
- Exploring categorical variables
- Handling missing values
- Zero and near-zero variance features
- Treating the data
- Correlation and linearity
- 用“芯”探核:龍芯派開發實戰
- 深入理解Spring Cloud與實戰
- 電腦軟硬件維修大全(實例精華版)
- 計算機組裝·維護與故障排除
- 硬件產品經理成長手記(全彩)
- 精選單片機設計與制作30例(第2版)
- 從零開始學51單片機C語言
- Visual Media Processing Using Matlab Beginner's Guide
- 筆記本電腦使用、維護與故障排除從入門到精通(第5版)
- BeagleBone Robotic Projects
- Blender Quick Start Guide
- Spring Cloud微服務和分布式系統實踐
- 電腦組裝與維護即時通
- Intel FPGA權威設計指南:基于Quartus Prime Pro 19集成開發環境
- Blender for Video Production Quick Start Guide