官术网_书友最值得收藏!

Evaluation

Let's say you have a model with 99% accuracy in classifying brain tumors. Can you trust this model? No.

If your model had said that no-one has a brain tumor, it would still have 99%+ accuracy. Why?

Because luckily 99% or more of the population does not have a brain tumor!

To use our models for practical use, we need to look beyond accuracy. We need to understand what the model gets right or wrong in order to improve it. A minute spent understanding the confusion matrix will stop us from going ahead with such dangerous models.

Additionally, we will want to develop an intuition of what the model is doing underneath the black box optimization algorithms. Data visualization techniques such as t-SNE can assist us with this.

For continuously running NLP applications such as email spam classifiers or chatbots, we would want the evaluation of the model quality to happen continuously as well. This will help us ensure that the model's performance does not degrade with time.

主站蜘蛛池模板: 司法| 萝北县| 九江县| 镇沅| 宁海县| 镇平县| 仁布县| 唐山市| 靖宇县| 黔西| 凌海市| 大港区| 扎囊县| 镇康县| 从江县| 大悟县| 奉节县| 增城市| 青河县| 寻甸| 青铜峡市| 增城市| 商水县| 宣城市| 余江县| 灵山县| 察雅县| 酉阳| 泊头市| 龙泉市| 云和县| 合江县| 金溪县| 新民市| 思南县| 阳信县| 鄂尔多斯市| 霍山县| 邻水| 天长市| 商城县|