- Haskell Data Analysis Cookbook
- Nishant Shukla
- 170字
- 2021-12-08 12:43:34
Introduction

The conclusions drawn from data analysis are only as robust as the quality of the data itself. After obtaining raw text, the next natural step is to validate and clean it carefully. Even the slightest bias may risk the integrity of the results. Therefore, we must take great precautionary measures, which involve thorough inspection, to ensure sanity checks are performed on our data before we begin to understand it. This section should be the starting point for cleaning data in Haskell.
Real-world data often has an impurity that needs to be addressed before it can be processed. For example, extraneous whitespaces or punctuation could clutter data, making it difficult to parse. Duplication and data conflicts are another area of unintended consequences of reading real-world data. Sometimes it's just reassuring to know that data makes sense by conducting sanity checks. Some examples of sanity checks include matching regular expressions as well as detecting outliers by establishing a measure of distance. In this chapter, we will cover each of these topics.
- VMware View Security Essentials
- 自然語言處理實(shí)戰(zhàn):預(yù)訓(xùn)練模型應(yīng)用及其產(chǎn)品化
- Microsoft Exchange Server PowerShell Cookbook(Third Edition)
- 移動(dòng)界面(Web/App)Photoshop UI設(shè)計(jì)十全大補(bǔ)
- Windows內(nèi)核編程
- Cocos2d-x by Example:Beginner's Guide(Second Edition)
- OpenCV with Python Blueprints
- 網(wǎng)絡(luò)數(shù)據(jù)采集技術(shù):Java網(wǎng)絡(luò)爬蟲實(shí)戰(zhàn)
- Java EE 7 with GlassFish 4 Application Server
- Python 3 Object:oriented Programming(Second Edition)
- OpenCV Android開發(fā)實(shí)戰(zhàn)
- 大學(xué)計(jì)算機(jī)應(yīng)用基礎(chǔ)(Windows 7+Office 2010)(IC3)
- Python Social Media Analytics
- Hands-On Dependency Injection in Go
- 例說FPGA:可直接用于工程項(xiàng)目的第一手經(jīng)驗(yàn)