官术网_书友最值得收藏!

Balanced dataset

The application allows you to record samples of different motion types. As you train the model, you may notice one interesting effect: to get accurate predictions, you need not only enough samples, but you also need the proportion of different classes in your dataset to be roughly equal. Think about it: if you have 100 samples of two classes (walk and run), and 99 of them belong to one class (walk), the classifier that delivers 99% accuracy may look like this:

func predict(x: [Double]) -> MotionType { 
   return .walk 
} 

But this is not what we want, obviously.

This observation lead us to the notion of the balanced data set; for most machine learning algorithms, you want the data set in which samples of different classes are represented equally frequently.

主站蜘蛛池模板: 祥云县| 县级市| 临颍县| 汶川县| 姜堰市| 乌拉特后旗| 渭南市| 乐陵市| 锡林郭勒盟| 同仁县| 吴川市| 鄱阳县| 财经| 咸宁市| 迁安市| 天台县| 宝兴县| 文水县| 迁西县| 滨州市| 永顺县| 尉犁县| 兴山县| 洛扎县| 建德市| 色达县| 昆明市| 天峻县| 绥棱县| 徐汇区| 全椒县| 随州市| 光泽县| 马边| 麻阳| 洛川县| 广州市| 富裕县| 岐山县| 牟定县| 桐梓县|