- Python:Data Analytics and Visualization
- Phuong Vo.T.H Martin Czygan Ashish Kumar Kirthi Raman
- 256字
- 2021-07-09 18:51:46
Summary
In this chapter we have looked at ways to manipulate data frames, from cleaning and filtering, to grouping, aggregation, and reshaping. Pandas makes a lot of the common operations very easy and more complex operations, such as pivoting or grouping by multiple attributes, can often be expressed as one-liners as well. Cleaning and preparing data is an essential part of data exploration and analysis.
The next chapter explains a brief of machine learning algorithms that is applying data analysis result to make decisions or build helpful products.
Practice exercises
Exercise 1: Cleaning: In the section about filtering, we used the Europe Brent Crude Oil Spot Price, which can be found as an Excel document on the internet. Take this Excel spreadsheet and try to convert it into a CSV document that is ready to be imported with Pandas.
Hint: There are many ways to do this. We used a small tool called xls2csv.py
and we were able to load the resulting CSV file with a helper method:
import datetime import pandas as pd def convert_date(s): parts = s.replace("(", "").replace(")", "").split(",") if len(parts) < 6: return datetime.date(1970, 1, 1) return datetime.datetime(*[int(p) for p in parts]) df = pd.read_csv("RBRTEd.csv", sep=',', names=["date", "price"], converters={"date": convert_date}).dropna()
Take a data set that is important for your work – or if you do not have any at hand, a data set that interests you and that is available online. Ask one or two questions about the data in advance. Then use cleaning, filtering, grouping, and plotting techniques to answer your question.
- Photoshop CS4經典380例
- 大數據技術入門(第2版)
- 數據庫原理與應用技術
- WordPress Theme Development Beginner's Guide(Third Edition)
- 永磁同步電動機變頻調速系統及其控制(第2版)
- 云原生架構進階實戰
- Visual Studio 2010 (C#) Windows數據庫項目開發
- Machine Learning Algorithms(Second Edition)
- 計算機組成與操作系統
- The DevOps 2.1 Toolkit:Docker Swarm
- 智能制造系統及關鍵使能技術
- Linux Shell Scripting Cookbook(Third Edition)
- 電氣控制及Micro800 PLC程序設計
- JSP通用范例開發金典
- Win 7二十一