官术网_书友最值得收藏!

The softmax function

The softmax function is mainly used to handle classification problems and preferably used in the output layer, outputting the probabilities of the output classes. As seen earlier, while solving the binary logistic regression, we witnessed that the sigmoid function was able to handle only two classes. In order to handle multi-class we need a function that can generate values for all the classes and those values follow the rules of probability. This objective is fulfilled by the softmax function, which shrinks the outputs for each class between 0 and 1 and divides them by the sum of the outputs for all the classes:

For examples, , where x refers to four classes.

Then, the softmax function will gives results (rounded to three decimal places) as:

Thus, we see the probabilities of all the classes. Since the output of every classifier demands probabilistic values for all the classes, the softmax function becomes the best candidate for the outer layer activation function of the classifier.

主站蜘蛛池模板: 阜康市| 巴林左旗| 丽江市| 海丰县| 盈江县| 大埔区| 临武县| 甘谷县| 横山县| 平顺县| 饶河县| 阿拉善右旗| 独山县| 临西县| 玉树县| 彭阳县| 大石桥市| 积石山| 沾化县| 邹平县| 浦城县| 民权县| 临泽县| 德化县| 墨竹工卡县| 东乡| 鹤岗市| 合肥市| 石林| 博野县| 河源市| 衡南县| 建宁县| 峡江县| 天镇县| 宣武区| 桃园县| 天峻县| 鄂托克旗| 淅川县| 新野县|