官术网_书友最值得收藏!

External data with PolyBase

With data acquisition, we frequently face situations when data is not available in the SQL Server, and for our analysis, we usually import or query data from various other database platforms or other systems. SQL Server 2016 has introduced a new feature called PolyBase, which can help us with accessing external data from the SQL Server. PolyBase is able to access Hadoop-type file systems to query external data and to push the computation to Hadoop so that the SQL Server does not get overloaded while accessing large amounts of data. 

The great benefit of PolyBase is the unification of two very different worlds: structured data and unstructured data. Hadoop is a collection of open source utilities, which includes a distributed file system called hdfs. This data distribution is a challenge for data analysis, since the data is distributed and located in heterogeneous systems, which makes it very difficult to access and process from SQL Server. PolyBase allows you to interact between structured data, usually our tables in the SQL Server, and unstructured or semi-structured data, stored in the distributed file systems. PolyBase is not completely new to SQL Server; it was available as a component of the Parallel Data Warehouse from the Analytic Platform System tool, and it's just now been built into the SQL Server.

PolyBase is a feature that can be used to do the following:

  • Query data stored in Hadoop
  • Import data from Hadoop
  • Query data stored in Azure blob storage
  • Export data 
主站蜘蛛池模板: 嵊州市| 连城县| 盐源县| 五寨县| 河南省| 河北省| 贺州市| 崇明县| 松桃| 日喀则市| 老河口市| 大丰市| 屏边| 沂源县| 安庆市| 安达市| 济阳县| 依兰县| 乳山市| 张北县| 云霄县| 花垣县| 河南省| 达州市| 九龙县| 七台河市| 涟源市| 黑山县| 洛隆县| 麟游县| 双柏县| 甘谷县| 枣强县| 门头沟区| 承德县| 佳木斯市| 临西县| 昭通市| 荣昌县| 宣威市| 高淳县|