官术网_书友最值得收藏!

Discovering coding issues using frequencies

The frequency distribution for INCOME in the following screenshot demonstrates another reason why it is important to examine the pattern for individual data fields before diving into analytics more deeply. Navigate to Analyze | Descriptive Statistics | Frequencies, and select Respondents Income to build this table:

The values coded in the data are displayed in Figure 3 along with the associated value labels. This was done on the Edit | Options | Output screen by specifying values and labels in the dropdown at the lower left under to pivot table labeling.

People are often reluctant to divulge their income so surveys typically ask them to select an income category like the groupings in this table. Notice, however, that the groups (numbered 1 through 26) represent unequal bands of income. The groups coded 3 thru 7 represent a range of only $1,000, while group 15 has a range of $5,000, and group 20 covers $15,000.

In the previous chapter, one of the properties of variables that was discussed involved the level of measurement. Income is an example of a scale variable and for these fields, it is assumed that the intervals between values are equal, which is not the case in this instance. The information in this table can be evaluated even though the categories are not uniform in terms of the dollar range they represent, but before using this field for statistical modeling, it is important to adjust the coding to reflect the actual differences in income.

In a later chapter, techniques for modifying fields will be covered but it is by examining the data along with the coding scheme used that you discover where the need for change exists.

主站蜘蛛池模板: 玉溪市| 株洲县| 安宁市| 淮阳县| 津市市| 来安县| 平原县| 顺平县| 黔西县| 奈曼旗| 元阳县| 南宫市| 钟祥市| 临汾市| 夹江县| 西盟| 日土县| 罗源县| 隆林| 新蔡县| 安乡县| 富宁县| 汾阳市| 勃利县| 阜阳市| 盘锦市| 肥东县| 江永县| 瓮安县| 扬中市| 灌南县| 砚山县| 闽清县| 漳浦县| 开化县| 隆子县| 灵宝市| 罗田县| 福州市| 鄄城县| 敦煌市|