- Statistics for Data Science
- James D. Miller
- 184字
- 2021-07-02 14:58:57
Contextual data issues
A lot of the previously mentioned data issues can be automatically detected and even corrected. The issues may have been originally caused by user entry errors, by corruption in transmission or storage, or by different definitions or understandings of similar entities in different data sources. In data science, there is more to think about.
During data cleaning, a data scientist will attempt to identify patterns within the data, based on a hypothesis or assumption about the context of the data and its intended purpose. In other words, any data that the data scientist determines to be either obviously disconnected with the assumption or objective of the data or obviously inaccurate will then be addressed. This process is reliant upon the data scientist's judgment and his or her ability to determine which points are valid and which are not.
- 后稀缺:自動化與未來工作
- Unreal Engine:Game Development from A to Z
- Project 2007項目管理實用詳解
- 教父母學會上網
- AutoCAD 2012中文版繪圖設計高手速成
- 學會VBA,菜鳥也高飛!
- Python:Data Analytics and Visualization
- 嵌入式操作系統原理及應用
- 寒江獨釣:Windows內核安全編程
- 人工智能:智能人機交互
- 自適應學習:人工智能時代的教育革命
- 項目實踐精解:C#核心技術應用開發
- Getting Started with Tableau 2019.2
- Cisco UCS Cookbook
- Linux Administration Cookbook