官术网_书友最值得收藏!

Summary

You will find the techniques covered in this chapter valuable not only initially when working with a new set of data, but throughout the analytic journey as patterns are investigated and further exploration of the results is undertaken.

Understanding the structure of the data in detail is critical before moving on to more sophisticated analytical methods as they often characterize the relationship found into a handful of summary statistics. The diagnostics accompanying these statistics provide a means of assessing how well they capture the patterns, but appreciating in advance where issues are likely to be present helps focus the examination of the results. 

The next chapter will expand on the topic of outliers touched on here and address the issue of missing values. Both of these situations occur regularly when dealing with real data and there are several approaches that can be utilized to detect their presence so that the impact on analytics can be minimized.

主站蜘蛛池模板: 大城县| 将乐县| 岑巩县| 麻栗坡县| 罗山县| 金华市| 合作市| 麻城市| 岳西县| 小金县| 长寿区| 临泉县| 乐陵市| 安国市| 南召县| 镇巴县| 麟游县| 石家庄市| 三门峡市| 长治市| 鸡泽县| 惠水县| 巨鹿县| 武强县| 安泽县| 莱芜市| 龙里县| 泾源县| 镇江市| 弥勒县| 玉门市| 策勒县| 若尔盖县| 建宁县| 余庆县| 武定县| 扶绥县| 张家界市| 临高县| 惠东县| 梅河口市|