官术网_书友最值得收藏!

Getting to know your data

For many years, researchers argued about what is more important: data or algorithms. But now, it looks like the importance of data over algorithms is generally accepted among ML specialists. In most cases, we can assume that the one who has better data usually beats those with more advanced algorithms. Garbage in, garbage out—this rule holds true in ML more than anywhere else. To succeed in this domain, one need not only have data, but also needs to know his data and know what to do with it.

ML datasets are usually composed from individual observations, called samples, cases, or data points. In the simplest case, each sample has several features.

主站蜘蛛池模板: 南京市| 邻水| 崇仁县| 南江县| 集安市| 富阳市| 惠州市| 慈利县| 界首市| 莒南县| 冀州市| 农安县| 左权县| 嘉荫县| 油尖旺区| 康平县| 富民县| 资兴市| 雷州市| 松原市| 罗山县| 灯塔市| 衡水市| 二连浩特市| 霞浦县| 红原县| 辽源市| 淮安市| 东平县| 万山特区| 怀集县| 桃江县| 仪征市| 灵宝市| 苍溪县| 当涂县| 襄垣县| 双牌县| 宣武区| 罗山县| 宜宾市|