官术网_书友最值得收藏!

Analyzing Different Datasets

We all love to talk about the weather. So, let's work with some weather-related datasets. The datasets contain approximately five years' worth of high-temporal resolution (hourly measurements) data for various weather attributes, such as temperature, humidity, air pressure, and so on. We'll analyze and compare the humidity and weather datasets.

Let's begin by implementing the following steps:

  1. Load the humidity dataset by using the following command:
df_hum <- read.csv("data/historical-hourly-weather-data/humidity.csv")
  1. Load the weather description dataset by using the following command:
df_desc <- read.csv("data/historical-hourly-weather-data/weather_description.csv")
  1. Compare the two datasets by using the str command.

The outcome will be the humidity levels of different cities, as follows:

The weather descriptions of different cities are shown as follows:

The different geometric objects that we will be working with in this chapter are as follows:

One-dimensional objects are used to understand and visualize the characteristics of a single variable, as follows:

  • Histogram
  • Bar chart

Two-dimensional objects are used to visualize the relationship between two variables, as follows:

  • Bar chart
  • Boxplot
  • Line chart
  • Scatter plot

Although geometric objects are also used in base R, they don't follow the structure of the Grammar of Graphics and have different naming conventions, as compared to ggplot2. This is an important distinction, which we will look at in detail later.

主站蜘蛛池模板: 和平区| 博野县| 通江县| 临武县| 通道| 右玉县| 镇原县| 遵义市| 万山特区| 东莞市| 桓台县| 盖州市| 河源市| 迁西县| 工布江达县| 贵德县| 阿拉善盟| 阿拉善左旗| 济宁市| 伊川县| 东城区| 榆社县| 和顺县| 彝良县| 丹寨县| 宁明县| 周至县| 淮滨县| 中牟县| 洛南县| 泰兴市| 炉霍县| 噶尔县| 明光市| 仁化县| 五大连池市| 陕西省| 涪陵区| 阳曲县| 柘荣县| 巫山县|