目錄(206章)
倒序
- 封面
- 版權信息
- Credits
- Preface
- Part 1. Module 1
- Chapter 1. Introducing Data Analysis and Libraries
- Data analysis and processing
- An overview of the libraries in data analysis
- Summary
- Chapter 2. NumPy Arrays and Vectorized Computation
- NumPy arrays
- Array functions
- Data processing using arrays
- Linear algebra with NumPy
- NumPy random numbers
- Summary
- Chapter 3. Data Analysis with Pandas
- An overview of the Pandas package
- The Pandas data structure
- The essential basic functionality
- Indexing and selecting data
- Computational tools
- Working with missing data
- Advanced uses of Pandas for data analysis
- Summary
- Chapter 4. Data Visualization
- The matplotlib API primer
- Exploring plot types
- Legends and annotations
- Plotting functions with Pandas
- Additional Python data visualization tools
- Summary
- Chapter 5. Time Series
- Time series primer
- Working with date and time objects
- Resampling time series
- Downsampling time series data
- Upsampling time series data
- Time zone handling
- Timedeltas
- Time series plotting
- Summary
- Chapter 6. Interacting with Databases
- Interacting with data in text format
- Interacting with data in binary format
- Interacting with data in MongoDB
- Interacting with data in Redis
- Summary
- Chapter 7. Data Analysis Application Examples
- Data munging
- Data aggregation
- Grouping data
- Summary
- Chapter 8. Machine Learning Models with scikit-learn
- An overview of machine learning models
- The scikit-learn modules for different models
- Data representation in scikit-learn
- Supervised learning – classification and regression
- Unsupervised learning – clustering and dimensionality reduction
- Measuring prediction performance
- Summary
- Part 2. Module 2
- Chapter 1. Getting Started with Predictive Modelling
- Introducing predictive modelling
- Applications and examples of predictive modelling
- Python and its packages – download and installation
- Python and its packages for predictive modelling
- IDEs for Python
- Summary
- Chapter 2. Data Cleaning
- Reading the data – variations and examples
- Various methods of importing data in Python
- The read_csv method
- Use cases of the read_csv method
- Case 2 – reading a dataset using the open method of Python
- Case 3 – reading data from a URL
- Case 4 – miscellaneous cases
- Basics – summary dimensions and structure
- Handling missing values
- Creating dummy variables
- Visualizing a dataset by basic plotting
- Summary
- Chapter 3. Data Wrangling
- Subsetting a dataset
- Generating random numbers and their usage
- Grouping the data – aggregation filtering and transformation
- Random sampling – splitting a dataset in training and testing datasets
- Concatenating and appending data
- Merging/joining datasets
- Summary
- Chapter 4. Statistical Concepts for Predictive Modelling
- Random sampling and the central limit theorem
- Hypothesis testing
- Chi-square tests
- Correlation
- Summary
- Chapter 5. Linear Regression with Python
- Understanding the maths behind linear regression
- Making sense of result parameters
- Implementing linear regression with Python
- Model validation
- Handling other issues in linear regression
- Summary
- Chapter 6. Logistic Regression with Python
- Linear regression versus logistic regression
- Understanding the math behind logistic regression
- Implementing logistic regression with Python
- Model validation and evaluation
- Model validation
- Summary
- Chapter 7. Clustering with Python
- Introduction to clustering – what why and how?
- Mathematics behind clustering
- Implementing clustering using Python
- Fine-tuning the clustering
- Summary
- Chapter 8. Trees and Random Forests with Python
- Introducing decision trees
- Understanding the mathematics behind decision trees
- Implementing a decision tree with scikit-learn
- Understanding and implementing regression trees
- Understanding and implementing random forests
- Summary
- Chapter 9. Best Practices for Predictive Modelling
- Best practices for coding
- Best practices for data handling
- Best practices for algorithms
- Best practices for statistics
- Best practices for business contexts
- Summary
- Appendix A. A List of Links
- Part 3. Module 3
- Chapter 1. A Conceptual Framework for Data Visualization
- Data information knowledge and insight
- The transformation of data
- Data visualization history
- How does visualization help decision-making?
- Visualization plots
- Summary
- Chapter 2. Data Analysis and Visualization
- Why does visualization require planning?
- The Ebola example
- A sports example
- Creating interesting stories with data
- Perception and presentation methods
- Some best practices for visualization
- Visualization tools in Python
- Interactive visualization
- Summary
- Chapter 3. Getting Started with the Python IDE
- The IDE tools in Python
- Visualization plots with Anaconda
- Interactive visualization packages
- Summary
- Chapter 4. Numerical Computing and Interactive Plotting
- NumPy SciPy and MKL functions
- Scalar selection
- Slicing
- Array indexing
- Other data structures
- Visualization using matplotlib
- The visualization example in sports
- Summary
- Chapter 5. Financial and Statistical Models
- The deterministic model
- The stochastic model
- The threshold model
- An overview of statistical and machine learning
- Creating animated and interactive plots
- Summary
- Chapter 6. Statistical and Machine Learning
- Classification methods
- Understanding linear regression
- Linear regression
- Decision tree
- The Bayes theorem
- The Na?ˉve Bayes classifier
- The Na?ˉve Bayes classifier using TextBlob
- Viewing positive sentiments using word clouds
- k-nearest neighbors
- Logistic regression
- Support vector machines
- Principal component analysis
- k-means clustering
- Summary
- Chapter 7. Bioinformatics Genetics and Network Models
- Directed graphs and multigraphs
- The clustering coefficient of graphs
- Analysis of social networks
- The planar graph test
- The directed acyclic graph test
- Maximum flow and minimum cut
- A genetic programming example
- Stochastic block models
- Summary
- Chapter 8. Advanced Visualization
- Computer simulation
- Summary
- Appendix B. Go Forth and Explore Visualization
- An overview of conda
- Packages installed with Anaconda
- Packages websites
- About matplotlib
- Bibliography
- Index 更新時間:2021-07-09 18:52:29
推薦閱讀
- 基于LabWindows/CVI的虛擬儀器設計與應用
- Mobile DevOps
- Mastering D3.js
- Expert AWS Development
- 網絡綜合布線技術
- 80x86/Pentium微型計算機原理及應用
- Docker Quick Start Guide
- 3D Printing for Architects with MakerBot
- 計算機系統結構
- 單片機C語言應用100例
- 運動控制系統
- Learning ServiceNow
- 生物3D打印:從醫療輔具制造到細胞打印
- Mastering MongoDB 3.x
- 21天學通Linux嵌入式開發
- PostgreSQL 10 High Performance
- 傳感技術基礎與技能實訓
- 從機器學習到無人駕駛
- Learning Couchbase
- 深度學習之模型優化:核心算法與案例實踐
- 傳感器原理及應用(第二版)
- Arduino創意機器人入門:基于Mixly
- Hands-On Edge Analytics with Azure IoT
- 單片機數據通信及測控應用技術詳解
- Practical Data Science Cookbook
- Alexa Skills Projects
- 微機原理與接口技術(基于32位機)
- Java ME嵌入式程序設計
- R Web Scraping Quick Start Guide
- Practical Site Reliability Engineering