官术网_书友最值得收藏!

3 Alternative Data for Finance – Categories and Use Cases

The previous chapter covered working with market and fundamental data, which have been the traditional drivers of trading strategies. In this chapter, we'll fast-forward to the recent emergence of a broad range of much more perse data sources as fuel for discretionary and algorithmic strategies. Their heterogeneity and novelty have inspired the label of alternative data and created a rapidly growing provider and service industry.

Behind this trend is a familiar story: propelled by the explosive growth of the internet and mobile networks, digital data continues to grow exponentially amid advances in the technology to process, store, and analyze new data sources. The exponential growth in the availability of and ability to manage more perse digital data, in turn, has been a critical force behind the dramatic performance improvements of machine learning (ML) that are driving innovation across industries, including the investment industry.

The scale of the data revolution is extraordinary: the past 2 years alone have witnessed the creation of 90 percent of all data that exists in the world today, and by 2020, each of the 7.7 billion people worldwide is expected to produce 1.7 MB of new information every second of every day. On the other hand, back in 2012, only 0.5 percent of all data was ever analyzed and used, whereas 33 percent is deemed to have value by 2020. The gap between data availability and usage is likely to narrow quickly as global investments in analytics are set to rise beyond $210 billion by 2020, while the value creation potential is a multiple higher.

This chapter explains how inpiduals, business processes, and sensors produce alternative data. It also provides a framework to navigate and evaluate the proliferating supply of alternative data for investment purposes. It demonstrates the workflow, from acquisition to preprocessing and storage, using Python for data obtained through web scraping to set the stage for the application of ML. It concludes by providing examples of sources, providers, and applications.

This chapter will cover the following topics:

  • Which new sources of information have been unleashed by the alternative data revolution
  • How inpiduals, business processes, and sensors generate alternative data
  • Evaluating the burgeoning supply of alternative data used for algorithmic trading
  • Working with alternative data in Python, such as by scraping the internet
  • Important categories and providers of alternative data

You can find the code samples for this chapter and links to additional resources in the corresponding directory of the GitHub repository. The notebooks include color versions of the images.

主站蜘蛛池模板: 保亭| 吉木萨尔县| 项城市| 五峰| 亳州市| 永胜县| 青冈县| 大理市| 吉林省| 城口县| 天津市| 英山县| 陕西省| 临江市| 肥城市| 蓬安县| 榕江县| 高台县| 扎赉特旗| 霍山县| 河南省| 余庆县| 工布江达县| 民权县| 徐汇区| 彩票| 伊金霍洛旗| 武清区| 屏东市| 宣威市| 新乡县| 郓城县| 秦安县| 苍梧县| 渑池县| 瓦房店市| 石嘴山市| 云安县| 旌德县| 兰西县| 兰州市|