官术网_书友最值得收藏!

Finding or observing data

Data can be found or observed in many places. An obvious data source is the internet. With an increase in social media usage, and with mobile phones penetrating deeper as mobile data plans become cheaper or even offer unlimited data, there has been an exponential rise in data consumed by users.

Now, online streaming platforms have emerged—the following diagram shows that the hours spent on consuming video data is also growing rapidly:

To get data from the internet, there are multiple options, as shown in the following list:

  • Bulk downloads from websites such as Wikipedia, IMDb, and the Million Song Dataset (which can be found here: https://labrosa.ee.columbia.edu/millionsong/).
  • Accessing the data through APIs (such as Google, Twitter, Facebook, and YouTube).
  • It is okay to scrape public, non-sensitive, and anonymized data. Be sure to check the terms and conditions and to fully reference the information.

The main drawbacks of the data collected is that it takes time and space to accumulate the data, and it covers only what happened; for instance, intentions and internal and external motivations are not collected. Finally, such data might be noisy, incomplete, inconsistent, and may even change over time.

Another option is to collect measurements from sensors such as inertial and location sensors in mobile devices, environmental sensors, and software agents monitoring key performance indicators.

主站蜘蛛池模板: 灌云县| 都兰县| 武汉市| 伊吾县| 凤凰县| 全椒县| 革吉县| 丽水市| 兰州市| 七台河市| 绥江县| 门源| 高尔夫| 南靖县| 茶陵县| 刚察县| 浮山县| 杭锦旗| 祁门县| 家居| 通州市| 宜春市| 闸北区| 鸡西市| 扎兰屯市| 乡宁县| 弥渡县| 呼玛县| 清苑县| 林周县| 六盘水市| 永济市| 屏边| 和田市| 金坛市| 九寨沟县| 屏南县| 沙河市| 体育| 南部县| 东阳市|