官术网_书友最值得收藏!

When is web scraping useful?

Suppose I have a shop selling shoes and want to keep track of my competitor's prices. I could go to my competitor's website each day and compare each shoe's price with my own; however this will take a lot of time and will not scale well if I sell thousands of shoes or need to check price changes frequently. Or maybe I just want to buy a shoe when it's on sale. I could come back and check the shoe website each day until I get lucky, but the shoe I want might not be on sale for months. These repetitive manual processes could instead be replaced with an automated solution using the web scraping techniques covered in this book.

In an ideal world, web scraping wouldn't be necessary and each website would provide an API to share data in a structured format. Indeed, some websites do provide APIs, but they typically restrict the data that is available and how frequently it can be accessed. Additionally, a website developer might change, remove, or restrict the backend API. In short, we cannot rely on APIs to access the online data we may want. Therefore we need to learn about web scraping techniques.

主站蜘蛛池模板: 潜江市| 远安县| 会昌县| 汪清县| 吉木萨尔县| 盐池县| 绥江县| 华阴市| 南充市| 嘉祥县| 烟台市| 潢川县| 义马市| 获嘉县| 通化市| 吴桥县| 利川市| 张家口市| 密山市| 云龙县| 株洲县| 万安县| 长汀县| 囊谦县| 惠安县| 余江县| 海丰县| 桐城市| 刚察县| 福海县| 杭锦旗| 玉门市| 大关县| 珠海市| 龙江县| 河间市| 德保县| 衡东县| 镇巴县| 锡林浩特市| 垦利县|