- Practical Convolutional Neural Networks
- Mohit Sewak Md. Rezaul Karim Pradeep Pujari
- 414字
- 2021-06-24 18:58:53
History of CNNs
There have been numerous attempts to recognize pictures by machines for decades. It is a challenge to mimic the visual recognition system of the human brain in a computer. Human vision is the hardest to mimic and most complex sensory cognitive system of the brain. We will not discuss biological neurons here, that is, the primary visual cortex, but rather focus on artificial neurons. Objects in the physical world are three dimensional, whereas pictures of those objects are two dimensional. In this book, we will introduce neural networks without appealing to brain analogies. In 1963, computer scientist Larry Roberts, who is also known as the father of computer vision, described the possibility of extracting 3D geometrical information from 2D perspective views of blocks in his research dissertation titled BLOCK WORLD. This was the first breakthrough in the world of computer vision. Many researchers worldwide in machine learning and artificial intelligence followed this work and studied computer vision in the context of BLOCK WORLD. Human beings can recognize blocks regardless of any orientation or lighting changes that may happen. In this dissertation, he said that it is important to understand simple edge-like shapes in images. He extracted these edge-like shapes from blocks in order to make the computer understand that these two blocks are the same irrespective of orientation:

The vision starts with a simple structure. This is the beginning of computer vision as an engineering model. David Mark, an MIT computer vision scientist, gave us the next important concept, that vision is hierarchical. He wrote a very influential book named VISION. This is a simple book. He said that an image consists of several layers. These two principles form the basis of deep learning architecture, although they do not tell us what kind of mathematical model to use.
In the 1970s, the first visual recognition algorithm, known as the generalized cylinder model, came from the AI lab at Stanford University. The idea here is that the world is composed of simple shapes and any real-world object is a combination of these simple shapes. At the same time, another model, known as the pictorial structure model, was published from SRI Inc. The concept is still the same as the generalized cylinder model, but the parts are connected by springs; thus, it introduced a concept of variability. The first visual recognition algorithm was used in a digital camera by Fujifilm in 2006.
- Redis使用手冊
- 在你身邊為你設計Ⅲ:騰訊服務設計思維與實戰
- 數據分析實戰:基于EXCEL和SPSS系列工具的實踐
- Google Visualization API Essentials
- 從零開始學Hadoop大數據分析(視頻教學版)
- 區塊鏈通俗讀本
- 數據要素五論:信息、權屬、價值、安全、交易
- 數據驅動:從方法到實踐
- 大數據時代下的智能轉型進程精選(套裝共10冊)
- Lego Mindstorms EV3 Essentials
- Python數據分析與挖掘實戰(第3版)
- 云數據中心網絡與SDN:技術架構與實現
- 大數據技術原理與應用:概念、存儲、處理、分析與應用
- Access 2010數據庫程序設計實踐教程
- 改進的群智能算法及其應用