- Machine Learning with Go Quick Start Guide
- Michael Bironneau Toby Coleman
- 338字
- 2021-06-24 13:34:00
Defining problem and objectives
Before any development begins, the problem to be solved must be defined together with objectives of what good will look like, to set expectations. The way the problem is formulated is very important, as this can mean the difference between intractability and a simple solution. It is also likely to involve a conversation about where the input data for any algorithm will come from.
The typical formulation of an ML problem takes the form given X dataset, predict Y. The availability of data or lack of it thereof can affect the formulation of the problem, the solution, and its feasibility. For example, consider the problem given a large labeled set of images of handwritten digits[18], predict the label of a previously unseen image. Deep learning algorithms have demonstrated that it is possible to achieve relatively high accuracy on this particular problem with little work on the part of the engineer, as long as the training dataset is sufficiently large[19]. If the training set is not large, the problem immediately becomes more difficult and requires a careful selection of the algorithm to use. It also affects the accuracy and thus, the set of attainable objectives.
Experiments performed by Michael Nielsen on the MNIST handwritten digit dataset show that the difference between training an ML algorithm with 1 example of labeled input/output pairs per digit and 5 examples was an improvement of accuracy from around 40% to around 65% for most algorithms tested[20]. Using 10 examples per digit usually raised the accuracy a further 5%.
If insufficient data is available to meet the project objectives, it is sometimes possible to boost performance by artificially expanding the dataset by making small changes to existing examples. In the previously mentioned experiments, Nielsen observed that adding slightly rotated or translated images to the dataset improved performance by as much as 15%.
- FPGA從入門到精通(實戰(zhàn)篇)
- 龍芯應(yīng)用開發(fā)標(biāo)準(zhǔn)教程
- Linux運維之道(第2版)
- Mastering Manga Studio 5
- Learning Game Physics with Bullet Physics and OpenGL
- VCD、DVD原理與維修
- Machine Learning with Go Quick Start Guide
- Managing Data and Media in Microsoft Silverlight 4:A mashup of chapters from Packt's bestselling Silverlight books
- Istio服務(wù)網(wǎng)格技術(shù)解析與實踐
- Angular 6 by Example
- 創(chuàng)客電子:Arduino和Raspberry Pi智能制作項目精選
- 詳解FPGA:人工智能時代的驅(qū)動引擎
- 零基礎(chǔ)輕松學(xué)修電腦主板
- 現(xiàn)代多媒體技術(shù)及應(yīng)用
- Spring Cloud微服務(wù)架構(gòu)開發(fā)