- Hands-On Data Science with SQL Server 2017
- Marek Chmel Vladimír Mu?n?
- 380字
- 2021-06-10 19:13:53
Math and statistics
Statistics and other math skills are essential in several phases of the data science project. Even in the beginning of data exploration, you'll be dividing the features of your data observations into categories:
- Categorical
- Numeric:
- Discrete
- Continuous
Categorical values describe the item and represent an attribute of the item. Imagine you have a dataset about cars: car brand would be a typical categorical value, and color would be another.
On the other side, we have numerical values that can be split into two different categories—discrete and continuous. Discrete values describe the amount of observations, such as how many people purchased a product, and so on. Continuous values have an infinite number of possible values and use real numbers for the representation. In a nutshell, discrete variables are like points plotted on a chart, and a continuous variable can be plotted as a line.
Another classification of the data is the measurement-level point of view. We can split data into two primary categories:
- Qualitative:
- Nominal
- Ordinal
- Quantitative:
- Interval
- Ratio
Nominal variables can't be ordered and only describe an attribute. An example would be the color of a product; this describes how the product looks, but you can't put any ordering scheme on the color saying that red is bigger than green, and so on. Ordinal variables describe the feature with a categorical value and provide an ordering system; for example: Education—elementary, high school, university degree, and so on.
With quantitative values, it's a different story. The major difference is that ratio has a true zero. Imagine the attribute was a length. If the length is 0, you know there's no length. But this does not apply to temperature, since there's an interval of possible values for the temperature, where 0°C or 0°F does not mean the beginning of the scale for the temperature (as absolute zero, or beginning of the scale is 273.15° C or -459.67° F). With °K, it would actually be a ratio type of the quantitative value, since the scale really begins with 0°K. So, as you can see, any number can be an interval or a ratio value, but it depends on the context!
- ArchiCAD 19:The Definitive Guide
- 腦動力:C語言函數速查效率手冊
- 大數據時代的數據挖掘
- Windows XP中文版應用基礎
- 21天學通Visual Basic
- 新手學電腦快速入門
- INSTANT Autodesk Revit 2013 Customization with .NET How-to
- 分數階系統分析與控制研究
- Hands-On Data Warehousing with Azure Data Factory
- 學練一本通:51單片機應用技術
- Photoshop CS5圖像處理入門、進階與提高
- 基于Proteus的單片機應用技術
- 步步驚“芯”
- 計算機組裝與維修實訓
- 人工智能云平臺:原理、設計與應用