官术网_书友最值得收藏!

Quality questions

Suppose there are concerns about the quality of the data to be, or being, consumed by the organization. As we eluded to earlier in this chapter, there are different types of data quality concerns such as what we called mechanical issues as well as statistical issues (and there are others).

Current trending examples of the most common statistical quality concerns include duplicate entries and misspellings, misclassification and aggregation, and changing meanings.

If management is questioning the validity of the total sales listed on a daily report or perhaps doesn't trust it because the majority of your customers are not legally able to drive in the United States, the number of the organizations repeat customers are declining, you have a quality issue:

Quality is a concern to both the data developer and the data scientist. A data developer focuses more on timing and formatting (the mechanics of the data), while the data scientist is more interested in the data's statistical quality (with priority given to issues with the data that may potentially impact the reliability of a particular study).

主站蜘蛛池模板: 滨州市| 鄢陵县| 信阳市| 镇远县| 茶陵县| 扶余县| 保康县| 桐庐县| 刚察县| 盐边县| 铁岭市| 象州县| 枞阳县| 中江县| 茌平县| 吴堡县| 兰考县| 武威市| 颍上县| 神农架林区| 黎川县| 鹤岗市| 南宁市| 绥阳县| 平罗县| 剑川县| 福泉市| 古田县| 砀山县| 黎城县| 翼城县| 交口县| 收藏| 晋宁县| 康定县| 含山县| 手机| 江津市| 莒南县| 临沂市| 米林县|