官术网_书友最值得收藏!

Summary

In this chapter, we learned what data munging is and why it is necessary for data science. Julia provides functionalities to facilitate data munging with the DataFrames.jl package, with features such as these:

  • NA: A missing value in Julia is represented by a specific data type, NA.
  • DataArray: DataArray provided in the DataFrames.jl provides features such as allowing us to store some missing values in an array.
  • DataFrame: DataFrame is 2-D data structure like spreadsheets. It is very similar to R or pandas's dataframes, and provides many functionalities to represent and analyze data. DataFrames has many features well suited for data analysis and statistical modeling.
  • A dataset can have different types of data in different columns.
  • Records have a relation with other records in the same row of different columns of the same length.
  • Columns can be labeled. Labeling helps us to easily become familiar with the data and access it without the need to remember their numerical indices.

We learned about importing data from a file using the readtable() function and exporting data to a file. The readtable() function provides flexibility when using many arguments.

We also explored joining of datasets, such as RDBMS tables. Julia provides various joins that we can exploit according to our use case.

We discussed the Split-Apply-Combine Strategy, one of the most widely used techniques deployed by data scientists, and why it is needed. We went through reshaping or pivoting data using stack and melt (stackdf, meltdf) functions and explored the various possibilities involved. We were also introduced to PooledDataArray and learned why it is required for efficient memory management.

We were introduced to web scraping, which is sometimes a must for a data scientist to gather data. We also used the Requests package to fetch an HTTP response.

主站蜘蛛池模板: 元氏县| 黄大仙区| 重庆市| 资中县| 博乐市| 积石山| 青川县| 万宁市| 通化市| 山阴县| 乌拉特后旗| 廉江市| 龙南县| 疏附县| 南安市| 昌都县| 乌兰察布市| 湛江市| 九江市| 轮台县| 林口县| 潜山县| 姚安县| 定襄县| 吉水县| 南木林县| 崇仁县| 华阴市| 德格县| 台东市| 永安市| 安乡县| 工布江达县| 蕲春县| 嘉鱼县| 九江市| 辉南县| 红桥区| 靖远县| 余庆县| 伊宁县|