官术网_书友最值得收藏!

Description of the dataset

A dataset from the Allstate Insurance company will be used, which consists of more than 300,000 examples with masked and anonymous data and consisting of more than 100 categorical and numerical attributes, thus being compliant with confidentiality constraints, more than enough for building and evaluating a variety of ML techniques.

The dataset is downloaded from the Kaggle website at https://www.kaggle.com/c/allstate-claims-severity/data. Each row in the dataset represents an insurance claim. Now, the task is to predict the value for the loss column. Variables prefaced with cat are categorical, while those prefaced with cont are continuous.

It is to be noted that the Allstate Corporation is the second largest insurance company in the United States, founded in 1931. We are trying to make the whole thing automated, to predict the cost, and hence the severity, of accident and damage claims.

主站蜘蛛池模板: 泾源县| 罗城| 长丰县| 陆川县| 固始县| 平南县| 方城县| 文化| 都匀市| 威海市| 黔西| 宁南县| 南宫市| 四川省| 晋城| 徐水县| 瑞丽市| 义乌市| 互助| 应城市| 台南县| 杭锦旗| 禹州市| 木兰县| 永州市| 涞水县| 武乡县| 酉阳| 玛沁县| 莫力| 商南县| 土默特左旗| 冕宁县| 商丘市| 桐庐县| 霍邱县| 竹北市| 株洲市| 徐水县| 涿鹿县| 盘山县|