官术网_书友最值得收藏!

Data Basics

In this chapter, we'll first discuss sources of open data, which includes the University of California at Irvine (UCI) Machine Learning Depository, the Bureau of Labor Statistics, the Census Bureau, Professor French's Data Library, and the Federal Reserve's Data Library. Then, we will show you several ways of inputting data, how to deal with missing values, sorting, choosing a subset, merging different datasets, and data output. For different languages, such as Python, R, and Julia, several relevant packages for data manipulation will be introduced as well. In particular, the Python pandas package will be discussed.

In this chapter, the following topics will be covered:

  • Sources of data
  • Introduction to the Python pandas package
  • Several ways to inputting packages
  • Introduction to the Quandl data delivery platform
  • Dealing with missing data
  • Sorting data, as well as how to slice, dice, and merge various datasets
  • Introduction to Python packages: cbsodata and datadotword
  • Introduction to R packages: dslabs, haven, and foreign
  • Generating Python datasets
  • Generating R datasets
主站蜘蛛池模板: 鹤壁市| 肇庆市| 丹凤县| 金平| 元谋县| 边坝县| 济源市| 延川县| 象州县| 利辛县| 阳城县| 山阳县| 汶川县| 桐乡市| 涿州市| 固安县| 资阳市| 花莲市| 高要市| 南和县| 繁昌县| 田东县| 曲靖市| 巴林右旗| 龙海市| 射阳县| 东方市| 丹寨县| 镇雄县| 台山市| 山西省| 博白县| 隆德县| 大关县| 崇左市| 日照市| 洛阳市| 定结县| 钟山县| 平阴县| 岳池县|