官术网_书友最值得收藏!

Why do we use xavier initialization?

The following factors call for the application of xavier initialization:

  • If the weights in a network start very small, most of the signals will shrink and become dormant at the activation function in the later layers

  • If the weights start very large, most of the signals will massively grow and pass through the activation functions in the later layers

Thus, xavier initialization helps in generating optimal weights, such that the signals are within optimal range, thereby minimizing the chances of the signals getting neither too small nor too large.

The derivation of the preceding formula is beyond the scope of this book. Feel free to search here (http://andyljones.tumblr.com/post/110998971763/an-explanation-of-xavier-initialization) and go through the derivation for a better understanding.

主站蜘蛛池模板: 昂仁县| 长海县| 舒兰市| 黄梅县| 普定县| 宝兴县| 革吉县| 平谷区| 华安县| 四川省| 扎赉特旗| 河间市| 象州县| 江阴市| 遵化市| 新宾| 方山县| 安溪县| 广汉市| 上蔡县| 昌都县| 邵阳市| 和林格尔县| 陆河县| 项城市| 皋兰县| 海安县| 威海市| 万盛区| 宜兰市| 阳曲县| 清河县| 古丈县| 张掖市| 扎囊县| 沙洋县| 双辽市| 雅江县| 驻马店市| 株洲市| 榆社县|