- Hands-On Data Science with Anaconda
- Dr. Yuxing Yan James Yan
- 167字
- 2021-06-25 21:08:48
Data Basics
In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.
In this chapter, the following topics will be covered:
- Sources of data
- Introduction to the Python pandas package
- Several ways to inputting packages
- Introduction to the Quandl data delivery platform
- Dealing with missing data
- Sorting data, as well as how to slice, dice, and merge various datasets
- Introduction to Python packages: cbsodata and datadotword
- Introduction to R packages: dslabs, haven, and foreign
- Generating Python datasets
- Generating R datasets
推薦閱讀
- 構(gòu)建高質(zhì)量的C#代碼
- 21天學(xué)通PHP
- PowerShell 3.0 Advanced Administration Handbook
- 7天精通Dreamweaver CS5網(wǎng)頁設(shè)計與制作
- 并行數(shù)據(jù)挖掘及性能優(yōu)化:關(guān)聯(lián)規(guī)則與數(shù)據(jù)相關(guān)性分析
- 程序設(shè)計語言與編譯
- 返璞歸真:UNIX技術(shù)內(nèi)幕
- MCSA Windows Server 2016 Certification Guide:Exam 70-741
- JMAG電機電磁仿真分析與實例解析
- 運動控制系統(tǒng)應(yīng)用與實踐
- Godot Engine Game Development Projects
- 從零開始學(xué)Java Web開發(fā)
- Hands-On Business Intelligence with Qlik Sense
- EJB JPA數(shù)據(jù)庫持久層開發(fā)實踐詳解
- Windows 7來了