- Scala Machine Learning Projects
- Md. Rezaul Karim
- 201字
- 2021-06-30 19:05:33
Description of the dataset
The Orange Telecom's Churn Dataset, which consists of cleaned customer activity data (features), along with a churn label specifying whether a customer canceled the subscription, will be used to develop our predictive model. The churn-80 and churn-20 datasets can be downloaded from the following links, respectively:
- https://bml-data.s3.amazonaws.com/churn-bigml-80.csv
- https://bml-data.s3.amazonaws.com/churn-bigml-20.csv
However, as more data is often desirable for developing ML models, let's use the larger set (that is, churn-80) for training and cross-validation purposes, and the smaller set (that is, churn-20) for final testing and model performance evaluation.
Note that the latter set is only used to evaluate the model (that is for demonstration purposes). For a production ready environment, telecommunication companies can use their own dataset with necessary preprocessing and feature engineering. The dataset has the following schema:
- State: String
- Account length: Integer
- Area code: Integer
- International plan: String
- Voicemail plan: String
- Number email messages: Integer
- Total day minutes: Double
- Total day calls: Integer
- Total day charge: Double
- Total eve minutes: Double
- Total eve calls: Integer
- Total eve charge: Double
- Total night minutes: Double
- Total night calls: Integer
- Total night charge: Double
- Total intl minutes: Double
- Total intl calls: Integer
- Total intl charge: Double
- Customer service calls: Integer
- 電氣自動化專業英語(第3版)
- 火格局的時空變異及其在電網防火中的應用
- Visual C# 2008開發技術實例詳解
- 快學Flash動畫百例
- Hands-On Linux for Architects
- STM32G4入門與電機控制實戰:基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現
- 工業機器人現場編程(FANUC)
- Chef:Powerful Infrastructure Automation
- 智能生產線的重構方法
- 精通數據科學:從線性回歸到深度學習
- Excel 2010函數與公式速查手冊
- Learn QGIS
- 青少年VEX IQ機器人實訓課程(初級)
- Creating ELearning Games with Unity
- Puppet 3 Beginner’s Guide