- Statistics for Data Science
- James D. Miller
- 254字
- 2021-07-02 14:58:52
Categorical data
Earlier, we explained how variables in your data can be either independent or dependent. Another type of variable definition is a categorical variable. This type of variable is one that can take on one of a limited, and typically fixed, number of possible values, thus assigning each individual to a particular category.
Often, the collected data's meaning is unclear. Categorical data is a method that a data scientist can use to put meaning to the data.
For example, if a numeric variable is collected (let's say the values found are 4, 10, and 12), the meaning of the variable becomes clear if the values are categorized. Let's suppose that based upon an analysis of how the data was collected, we can group (or categorize) the data by indicating that this data describes university students, and there is the following number of players:
- 4 tennis players
- 10 soccer players
- 12 football players
Now, because we grouped the data into categories, the meaning becomes clear.
Some other examples of categorized data might be individual pet preferences (grouped by the type of pet), or vehicle ownership (grouped by the style of a car owned), and so on.
So, categorical data, as the name suggests, is data grouped into some sort of category or multiple categories. Some data scientists refer to categories as sub-populations of data.
- 計算機應用
- Getting Started with Containerization
- Apache Hive Essentials
- 微型計算機控制技術
- Python Data Science Essentials
- 讓每張照片都成為佳作的Photoshop后期技法
- Hadoop Real-World Solutions Cookbook(Second Edition)
- 機器人創新實訓教程
- Splunk Operational Intelligence Cookbook
- OpenStack Cloud Computing Cookbook
- Windows Server 2003系統安全管理
- 智能鼠原理與制作(進階篇)
- 貫通開源Web圖形與報表技術全集
- Building Google Cloud Platform Solutions
- 與人共融機器人的關節力矩測量技術