- Machine Learning With Go
- Daniel Whitenack
- 213字
- 2021-07-08 10:37:32
Distributions
A distribution is a representation of how often values appear within a dataset. Let's say, for instance, that one thing you are tracking as a data scientist is the daily sales of a certain product or service, and you have a long list (which you could represent as a vector or part of a matrix) of these daily sales numbers. These sales numbers are part of our dataset, and they include one day with sales of $121, another day with sales of $207, and so on.
There will be one sales number that is the lowest out of the one we have accumulated. There will also be one sales number that is the highest out of the one we have accumulated, and the rest of the sales numbers that are somewhere in between (at least if we assume no exact duplicates). The following image represents these low, high, and in-between values of sales along a line:

This is, thus, a distribution of sales, or at least one representation of the distribution of sales. Note that this distribution has areas where there are more numbers and areas where the numbers are a little sparse. Additionally, note that there seems to be a tendency for numbers to be near the center of the distribution.
- 大學計算機基礎(第二版)
- Mastering Adobe Captivate 2017(Fourth Edition)
- C語言程序設計(第3版)
- Delphi程序設計基礎:教程、實驗、習題
- 碼上行動:零基礎學會Python編程(ChatGPT版)
- Mastering PHP Design Patterns
- 云原生Spring實戰
- PHP 編程從入門到實踐
- Java 9模塊化開發:核心原則與實踐
- 表哥的Access入門:以Excel視角快速學習數據庫開發(第2版)
- Getting Started with Laravel 4
- Android Wear Projects
- Python 3.7從入門到精通(視頻教學版)
- IoT Projects with Bluetooth Low Energy
- Solr權威指南(下卷)