- Hands-On Ensemble Learning with R
- Prabhanjan Narayanachar Tattar
- 118字
- 2021-07-23 19:10:50
Pima Indians Diabetes
Diabetes is a health hazard, which is mostly incurable, and patients who are diagnosed with it have to adjust their lifestyles in order to cater to this condition. Based on variables such as pregnant
, glucose
, pressure
, triceps
, insulin
, mass
, pedigree
, and age
, the problem here is to classify the person as diabetic or not. Here, we have 768 observations. This dataset is drawn from the mlbench
package:
> data("PimaIndiansDiabetes") > set.seed(12345) > Train_Test <- sample(c("Train","Test"),nrow(PimaIndiansDiabetes),replace = TRUE, + prob = c(0.7,0.3)) > head(Train_Test) [1] "Test" "Test" "Test" "Test" "Train" "Train" > PimaIndiansDiabetes_Train <- PimaIndiansDiabetes[Train_Test=="Train",] > PimaIndiansDiabetes_TestX <- within(PimaIndiansDiabetes[Train_Test=="Test",], + rm(diabetes)) > PimaIndiansDiabetes_TestY <- PimaIndiansDiabetes[Train_Test=="Test","diabetes"] > PID_Formula <- as.formula("diabetes~.")
The five datasets described up to this point are classification problems. We look at one example each for regression, time series, survival, clustering, and outlier detection problems.
推薦閱讀
- Unreal Engine:Game Development from A to Z
- 精通MATLAB圖像處理
- 2018西門子工業(yè)專家會(huì)議論文集(上)
- Cloud Analytics with Microsoft Azure
- Google App Inventor
- 樂高創(chuàng)意機(jī)器人教程(中級(jí) 下冊(cè) 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實(shí)踐指導(dǎo)叢書)
- 網(wǎng)絡(luò)化分布式系統(tǒng)預(yù)測(cè)控制
- 奇點(diǎn)將至
- 從零開始學(xué)Java Web開發(fā)
- 強(qiáng)化學(xué)習(xí)
- 網(wǎng)絡(luò)脆弱性掃描產(chǎn)品原理及應(yīng)用
- 漢字錄入技能訓(xùn)練
- QTP自動(dòng)化測(cè)試實(shí)踐
- 單片機(jī)C51應(yīng)用技術(shù)
- Learning Couchbase