官术网_书友最值得收藏!

Scoring of unusualness

Once a model has been constructed, the likelihood of any future observed value can be found within the probability distribution. As described earlier, we had asked the question, "Is getting fifteen pieces of mail likely?". This question can now be empirically answered, depending on the model, with a number between zero (no possibility) and one (absolute certainty). ML will use the model to calculate this fractional value out to approximately 300 significant figures (which can be helpful when dealing with very low probabilities). Let's observe the following graph:

ML calculates the probability of the dip in value in this time series

Here, the probability of the observation of the actual value of 921 at this point in time was calculated to be 6.3634e-7 (or more commonly a mere 0.000063634% chance). This very small value is perhaps not that intuitive to most people. As such, ML will take this probability calculation, and via a process of quantile normalization, re-cast that observation on a severity scale between 0 and 100, where 100 is the highest level of unusualness possible for that particular dataset. In the preceding case, the probability calculation of 6.3634e-7 was normalized to a score of 94. This normalized score will come in handy later as a means by which to assess the severity of the anomaly for purposes of alerting and/or triage.

主站蜘蛛池模板: 辉南县| 社旗县| 寻乌县| 科技| 巴马| 庆元县| 延长县| 德州市| 富裕县| 武鸣县| 佛山市| 威远县| 巴里| 鞍山市| 云龙县| 泊头市| 龙井市| 大英县| 盐池县| 长海县| 屏东市| 固原市| 油尖旺区| 抚顺县| 特克斯县| 新津县| 黔江区| 松阳县| 内丘县| 佛教| 岚皋县| 河池市| 沂水县| 嘉兴市| 金平| 阳春市| 铁力市| 突泉县| 思茅市| 三河市| 仙游县|