官术网_书友最值得收藏!

Example of feature engineering procedures – can anyone really predict the weather?

Consider a machine learning pipeline that was built to predict the weather. For the sake of simplicity in our introduction chapter, assume that our algorithm takes in atmospheric data directly from sensors and is set up to predict between one of two values, sun or rain. This pipeline is then, clearly, a classification pipeline that can only spit out one of two answers. We will run this algorithm at the beginning of every day. If the algorithm outputs sun and the day is mostly sunny, the algorithm was correct, likewise, if the algorithm predicts rain and the day is mostly rainy, the algorithm was correct. In any other instance, the algorithm would be considered incorrect. If we run the algorithm every day for a month, we would obtain nearly 30 values of the predicted weather and the actual, observed weather. We can calculate an accuracy of the algorithm. Perhaps the algorithm predicted correctly for 20 out of the 30 days, leading us to label the algorithm with a two out of three or about 67% accuracy. Using this standardized value or accuracy, we could tweak our algorithm and see if the accuracy goes up or down.

Of course, this is an oversimplification, but the idea is that for any machine learning pipeline, it is essentially useless if we cannot evaluate its performance using a set of standard metrics and therefore, feature engineering as applied to the bettering of machine learning, is impossible without said evaluation procedure. Throughout this book, we will revisit this idea of evaluation; however, let’s talk briefly about how, in general, we will approach this idea.

When presented with a topic in feature engineering, it will usually involve transforming our dataset (as per the definition of feature engineering). In order to definitely say whether or not a particular feature engineering procedure has helped our machine learning algorithm, we will follow the steps detailed in the following section.

主站蜘蛛池模板: 库伦旗| 环江| 聂荣县| 平和县| 四子王旗| 安乡县| 正宁县| 江孜县| 临海市| 甘孜县| 内江市| 台东市| 莱芜市| 郸城县| 云龙县| 新疆| 伊吾县| 阿巴嘎旗| 伊川县| 临安市| 壶关县| 托克托县| 若尔盖县| 台安县| 广饶县| 固原市| 天气| 堆龙德庆县| 嘉义县| 佛坪县| 突泉县| 彰化市| 平泉县| 蓬溪县| 江北区| 东乡| 晴隆县| 丽水市| 宝鸡市| 沁水县| 郸城县|