官术网_书友最值得收藏!

Using Hive

As opposed to relational data warehouses, nested data models have complex types such as array, map, and struct. We can partition tables based on the values of one or more columns with the PARTITIONED BY clause. Moreover, tables or partitions can be bucketed using CLUSTERED BY columns, and data can be sorted within that bucket via SORT BY columns:

  • Tables: They are very similar to RDBMS tables and contain rows and tables.
  • Partitions: Hive tables can have more than one partition. They are mapped to subdirectories and filesystems as well.
  • Buckets: Data can also be pided into buckets in Hive. They can be stored as files in partitions in the underlying filesystem.

The Hive query language provides the basic SQL-like operations. Here are few of the tasks that HQL can do easily:

  • Create and manage tables and partitions
  • Support various relational, arithmetic, and logical operators
  • Evaluate functions
  • Download the contents of a table to a local directory or the results of queries to the HDFS directory
主站蜘蛛池模板: 会理县| 涞水县| 大邑县| 济宁市| 台中县| 宁乡县| 靖边县| 乌恰县| 邯郸市| 友谊县| 保山市| 昭苏县| 康定县| 衡南县| 和龙市| 鄂尔多斯市| 四会市| 肇东市| 彰化市| 湘潭县| 怀宁县| 东光县| 西盟| 洞口县| 德兴市| 堆龙德庆县| 来凤县| 额尔古纳市| 日喀则市| 汉川市| 蛟河市| 邵阳市| 莱芜市| 沁水县| 卢氏县| 额敏县| 屏东县| 灵石县| 惠东县| 牟定县| 城市|