- Learning pandas(Second Edition)
- Michael Heydt
- 224字
- 2021-07-02 20:36:58
Retrieval
Once you have an idea you must then find data to try and support your hypothesis. This data can come from within your organization or from external data providers. This data normally is provided as archived data or can be provided in real-time (although pandas is not well known for being a real-time data processing tool).
Data is often very raw, even if obtained from data sources that you have created or from within your organization. Being raw means that the data can be disorganized, may be in various formats, and erroneous; relative to supporting your analysis, it may be incomplete and need manual augmentation.
There is a lot of free data in the world. Much data is not free and actually costs significant amounts of money to obtain. Some is freely available with public APIs, and the others by subscription. Data you pay for is often cleaner, but this is not always the case.
In either case, pandas provides a robust and easy-to-use set of tools for retrieving data from various sources and that may be in many different formats. pandas also gives us the ability to not only retrieve data, but to also provide an initial structuring of the data via pandas data structures without needing to manually create complex coding, which may be required in other tools or programming languages.
- Web前端開發(fā)技術:HTML、CSS、JavaScript(第3版)
- 數(shù)據(jù)結(jié)構(gòu)簡明教程(第2版)微課版
- 青少年Python編程入門
- JavaScript:Moving to ES2015
- Mastering Drupal 8 Views
- Jupyter數(shù)據(jù)科學實戰(zhàn)
- 程序是怎樣跑起來的(第3版)
- 領域驅(qū)動設計:軟件核心復雜性應對之道(修訂版)
- C++反匯編與逆向分析技術揭秘(第2版)
- 21天學通C++(第5版)
- Solutions Architect's Handbook
- jQuery技術內(nèi)幕:深入解析jQuery架構(gòu)設計與實現(xiàn)原理
- Hands-On Dependency Injection in Go
- PHP 7 Programming Blueprints
- R語言數(shù)據(jù)分析從入門到實戰(zhàn)