官术网_书友最值得收藏!

Spark SQL

Spark SQL allows for querying structured and semi-structured data inside the Spark program, by using SQL or DataFrame APIs. DataFrames are similar to tables in a relational database. Spark SQL can be embedded into the general programs of native Spark and MLlib, in order to enable interactability between different Spark modules.

Spark SQL provides DataFrame abstractions in different programming languages, such as Python, Java, and Scala, in order to work with structured datasets. It can also read and write data in various structured formats, including JSON, Hive Tables, and Parquet. In addition to that, Spark SQL allows for querying the data by using SQL inside of the Spark program, or by using external tools, for example, connecting to Spark SQL using standard database connectors (JDBC/ODBC). 

主站蜘蛛池模板: 徐州市| 钟祥市| 玉环县| 娄底市| 巫溪县| 大新县| 金寨县| 通河县| 澜沧| 长兴县| 介休市| 柳州市| 杨浦区| 临汾市| 江永县| 巴彦县| 竹山县| 河北省| 平邑县| 合作市| 海兴县| 高要市| 赞皇县| 卓资县| 加查县| 宁蒗| 东光县| 汝南县| 神农架林区| 六盘水市| 汉中市| 裕民县| 安国市| 尚志市| 颍上县| 鄂州市| 枣强县| 洛扎县| 鄢陵县| 峨山| 皋兰县|