官术网_书友最值得收藏!

  • Deep Learning with Keras
  • Antonio Gulli Sujit Pal
  • 74字
  • 2021-07-02 23:58:05

Increasing the size of batch computation

Gradient descent tries to minimize the cost function on all the examples provided in the training sets and, at the same time, for all the features provided in the input. Stochastic gradient descent is a much less expensive variant, which considers only BATCH_SIZE examples. So, let's see what the behavior is by changing this parameter. As you can see, the optimal accuracy value is reached for BATCH_SIZE=128:

主站蜘蛛池模板: 游戏| 昌乐县| 阳曲县| 遂平县| 舞阳县| 江津市| 盐城市| 绥宁县| 慈利县| 莒南县| 南京市| 隆回县| 兴和县| 龙井市| 灯塔市| 健康| 萨嘎县| 二手房| 灯塔市| 峨眉山市| 九龙城区| 竹山县| 新闻| 合川市| 临沂市| 和平区| 文昌市| 红安县| 洛扎县| 江华| 泉州市| 绵竹市| 云和县| 云南省| 阳山县| 会宁县| 太仆寺旗| 罗山县| 九龙城区| 吴旗县| 阳东县|