官术网_书友最值得收藏!

How it works...

We start by importing our dataset of PE header information from a collection of samples (step 1). This dataset consists of two classes of PE files: malware and benign. We then use plotly to create a nice-looking interactive 3D graph (step 1). We proceed to prepare our dataset for machine learning. Specifically, in step 2, we set X as the features and y as the classes of the dataset. Based on the fact that there are two classes, we aim to cluster the data into two groups that will match the sample classification. We utilize the K-means algorithm (step 3), about which you can find more information at: https://en.wikipedia.org/wiki/K-means_clustering. With a thoroughly trained clustering algorithm, we are ready to predict on the testing set. We apply our clustering algorithm to predict to which cluster each of the samples should belong (step 4). Observing our results in step 5, we see that clustering has captured a lot of the underlying information, as it was able to fit the data well.

主站蜘蛛池模板: 霍邱县| 略阳县| 黑山县| 潜江市| 衡阳县| 兴山县| 泾阳县| 大同县| 伊通| 离岛区| 珲春市| 偏关县| 永年县| 苏尼特左旗| 唐山市| 泾阳县| 缙云县| 稷山县| 延安市| 翁牛特旗| 凤阳县| 弥勒县| 泽库县| 阿荣旗| 惠水县| 增城市| 平阴县| 礼泉县| 维西| 塘沽区| 阳城县| 进贤县| 台山市| 蒙山县| 大新县| 马尔康县| 类乌齐县| 吉木萨尔县| 宜城市| 密云县| 东城区|