官术网_书友最值得收藏!

Domain knowledge

More often than data scientists would like to admit, machine learning models produce results that are obvious and intuitive. For instance, we once conducted an elaborate analysis of physicians, prescribing behavior to find out the strongest predictor of how many prescriptions a physician would write in the next quarter. We used a broad set of input variables such as the physicians locations, their specialties, hospital affiliations, prescribing history, and other data. In the end, the best performing model produced a result that we all knew very well. The strongest predictor of how many prescriptions a physician would write in the next quarter was the number of prescriptions the physician had written in the previous quarter! To filter out the truly meaningful variables and build a more robust model, we eventually had to engage someone who had extensive experience of working in the pharma industry. Machine learning models work best when produced in a hybrid approach—one that combines domain expertise along with the sophistication of models developed.

主站蜘蛛池模板: 临沭县| 鹤峰县| 舟曲县| 台湾省| 保康县| 府谷县| 六安市| 林甸县| 沧州市| 梁河县| 广东省| 连城县| 日喀则市| 田阳县| 凌海市| 汶川县| 女性| 广德县| 尼木县| 论坛| 南汇区| 太原市| 马山县| 湖北省| 资溪县| 田林县| 凌云县| 古丈县| 彩票| 德令哈市| 靖州| 白朗县| 耒阳市| 青川县| 阳城县| 炎陵县| 六盘水市| 双流县| 芜湖市| 克山县| 东阳市|