- Statistics for Data Science
- James D. Miller
- 163字
- 2021-07-02 14:58:53
Statistical comparison
Simply put, when you hear the term statistical comparison, one is usually referring to the act of a data scientist performing a process of analysis to view the similarities or variances of two or more groups or populations (or recordsets).
As a data developer, one might be familiar with various utilities such as FC Compare, UltraCompare, or WinDiff, which aim to provide the developer with a line-by-line comparison of the contents of two or more (even binary) files.
In statistics (data science), this process of comparing is a statistical technique to compare populations or recordsets. In this method, a data scientist will conduct what is called an Analysis of Variance (ANOVA), compare categorical variables (within the recordsets), and so on.
ANOVA is an assortment of statistical methods that are used to analyze the differences among group means and their associated procedures (such as variations among and between groups, populations, or recordsets). This method eventually evolved into the Six Sigma dataset comparisons.
推薦閱讀
- Hands-On Deep Learning with Apache Spark
- 數據展現的藝術
- 大數據管理系統
- 軟件架構設計
- Getting Started with Oracle SOA B2B Integration:A Hands-On Tutorial
- 機器自動化控制器原理與應用
- 嵌入式Linux上的C語言編程實踐
- 電腦上網直通車
- 21天學通Visual Basic
- 基于多目標決策的數據挖掘方法評估與應用
- 網絡化分布式系統預測控制
- 21天學通Linux嵌入式開發
- Hands-On DevOps
- Getting Started with Tableau 2019.2
- Wireshark Revealed:Essential Skills for IT Professionals