- Scala Machine Learning Projects
- Md. Rezaul Karim
- 142字
- 2021-06-30 19:05:29
Description of the dataset
A dataset from the Allstate Insurance company will be used, which consists of more than 300,000 examples with masked and anonymous data and consisting of more than 100 categorical and numerical attributes, thus being compliant with confidentiality constraints, more than enough for building and evaluating a variety of ML techniques.
The dataset is downloaded from the Kaggle website at https://www.kaggle.com/c/allstate-claims-severity/data. Each row in the dataset represents an insurance claim. Now, the task is to predict the value for the loss column. Variables prefaced with cat are categorical, while those prefaced with cont are continuous.
It is to be noted that the Allstate Corporation is the second largest insurance company in the United States, founded in 1931. We are trying to make the whole thing automated, to predict the cost, and hence the severity, of accident and damage claims.
- Photoshop CS4經(jīng)典380例
- Cloud Analytics with Microsoft Azure
- Visual C# 2008開發(fā)技術(shù)實(shí)例詳解
- CompTIA Network+ Certification Guide
- Learning Azure Cosmos DB
- Applied Data Visualization with R and ggplot2
- Cloud Security Automation
- Windows安全指南
- 大數(shù)據(jù)案例精析
- 傳感器與自動(dòng)檢測(cè)
- 計(jì)算機(jī)硬件技術(shù)基礎(chǔ)學(xué)習(xí)指導(dǎo)與練習(xí)
- FreeCAD [How-to]
- 智能小車機(jī)器人制作大全(第2版)
- 人工智能基礎(chǔ)
- 多媒體技術(shù)應(yīng)用教程