官术网_书友最值得收藏!

  • Web Scraping with Python
  • Richard Lawson
  • 259字
  • 2021-07-09 21:28:50

Is web scraping legal?

Web scraping is in the early Wild West stage, where what is permissible is still being established. If the scraped data is being used for personal use, in practice, there is no problem. However, if the data is going to be republished, then the type of data scraped is important.

Several court cases around the world have helped establish what is permissible when scraping a website. In Feist Publications, Inc. v. Rural Telephone Service Co., the United States Supreme Court decided that scraping and republishing facts, such as telephone listings, is allowed. Then, a similar case in Australia, Telstra Corporation Limited v. Phone Directories Company Pty Ltd, demonstrated that only data with an identifiable author can be copyrighted. Also, the European Union case, ofir.dk vs home.dk, concluded that regular crawling and deep linking is permissible.

These cases suggest that when the scraped data constitutes facts (such as business locations and telephone listings), it can be republished. However, if the data is original (such as opinions and reviews), it most likely cannot be republished for copyright reasons.

In any case, when you are scraping data from a website, remember that you are their guest and need to behave politely or they may ban your IP address or proceed with legal action. This means that you should make download requests at a reasonable rate and define a user agent to identify you. The next section on crawling will cover these practices in detail.

主站蜘蛛池模板: 临泽县| 柘荣县| 阿坝县| 德庆县| 龙门县| 古蔺县| 松桃| 绥宁县| 双江| 香河县| 南乐县| 彭州市| 深水埗区| 闽侯县| 双流县| 和平县| 达州市| 威远县| 平武县| 叶城县| 江城| 吉木乃县| 招远市| 牡丹江市| 五莲县| 项城市| 诸城市| 旌德县| 沽源县| 邯郸县| 剑河县| 徐闻县| 吕梁市| 兴和县| 嘉鱼县| 辽中县| 宜宾市| 买车| 格尔木市| 那坡县| 华亭县|