官术网_书友最值得收藏!

Introduction

In this chapter, we will introduce the use of data in JSON, CSV, and XML formats. This will include the means of parsing and converting this data to other formats, including storing that data in relational databases, search engines such as Elasticsearch, and cloud storage including AWS S3. We will also discuss the creation of distributed and large-scale scraping tasks through the use of messaging systems including AWS Simple Queue Service (SQS).  The goal is to provide both an understanding of the various forms of data you may retrieve and need to parse, and an instruction the the various backends where you can store the data you have scraped.  Finally, we get a first introduction to one and Amazon Web Service (AWS) offerings.  By the end of the book we will be getting quite heavy into AWS and this gives a gentle introduction.

主站蜘蛛池模板: 金秀| 遵义县| 丹江口市| 广宁县| 会昌县| 海丰县| 玉门市| 宿州市| 镇远县| 宁河县| 万源市| 白银市| 石家庄市| 九龙城区| 大兴区| 五河县| 商城县| 澄迈县| 玛沁县| 桂东县| 盖州市| 板桥市| 乐陵市| 阆中市| 黑龙江省| 松原市| 辽源市| 宁波市| 隆化县| 建昌县| 包头市| 锦屏县| 涞水县| 闵行区| 塔河县| 莲花县| 湖南省| 齐河县| 柳州市| 定日县| 格尔木市|