官术网_书友最值得收藏!

Additional data generation using affine transformation

We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.

The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.

These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.

主站蜘蛛池模板: 吐鲁番市| 石景山区| 静海县| 北宁市| 左权县| 绵竹市| 卢氏县| 章丘市| 崇文区| 平塘县| 汝城县| 延吉市| 清水河县| 东乡族自治县| 新河县| 昭通市| 贵阳市| 沽源县| 广宁县| 东台市| 沧州市| 临夏县| 澄城县| 涞水县| 大连市| 南城县| 石楼县| 乌拉特前旗| 灵台县| 日照市| 扶沟县| 天津市| 巴彦淖尔市| 昌江| 武清区| 卢氏县| 灌南县| 嘉鱼县| 怀安县| 嘉祥县| 郯城县|