目錄(103章)
倒序
- coverpage
- Mastering Python for Data Science
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Getting Started with Raw Data
- The world of arrays with NumPy
- Empowering data analysis with pandas
- Data cleansing
- Data operations
- Summary
- Chapter 2. Inferential Statistics
- Various forms of distribution
- A z-score
- A p-value
- One-tailed and two-tailed tests
- Type 1 and Type 2 errors
- A confidence interval
- Correlation
- Z-test vs T-test
- The F distribution
- The chi-square distribution
- The chi-square test of independence
- ANOVA
- Summary
- Chapter 3. Finding a Needle in a Haystack
- What is data mining?
- Presenting an analysis
- Studying the Titanic
- Summary
- Chapter 4. Making Sense of Data through Advanced Visualization
- Controlling the line properties of a chart
- Creating multiple plots
- Playing with text
- Styling your plots
- Box plots
- Heatmaps
- Scatter plots with histograms
- A scatter plot matrix
- Area plots
- Bubble charts
- Hexagon bin plots
- Trellis plots
- A 3D plot of a surface
- Summary
- Chapter 5. Uncovering Machine Learning
- Different types of machine learning
- Decision trees
- Linear regression
- Logistic regression
- The naive Bayes classifier
- The k-means clustering
- Hierarchical clustering
- Summary
- Chapter 6. Performing Predictions with a Linear Regression
- Simple linear regression
- Multiple regression
- Training and testing a model
- Summary
- Chapter 7. Estimating the Likelihood of Events
- Logistic regression
- Summary
- Chapter 8. Generating Recommendations with Collaborative Filtering
- Recommendation data
- User-based collaborative filtering
- Item-based collaborative filtering
- Summary
- Chapter 9. Pushing Boundaries with Ensemble Models
- The census income dataset
- Decision trees
- Random forests
- Summary
- Chapter 10. Applying Segmentation with k-means Clustering
- The k-means algorithm and its working
- The k-means clustering with countries
- Clustering the countries
- Summary
- Chapter 11. Analyzing Unstructured Data with Text Mining
- Preprocessing data
- Creating a wordcloud
- Word and sentence tokenization
- Parts of speech tagging
- Stemming and lemmatization
- The Stanford Named Entity Recognizer
- Performing sentiment analysis on world leaders using Twitter
- Summary
- Chapter 12. Leveraging Python in the World of Big Data
- What is Hadoop?
- Python MapReduce
- File handling with Hadoopy
- Pig
- Python with Apache Spark
- Summary
- Index 更新時間:2021-07-16 20:14:41
推薦閱讀
- GAE編程指南
- Raspberry Pi for Python Programmers Cookbook(Second Edition)
- Python量化投資指南:基礎、數據與實戰
- Mastering Entity Framework
- JavaScript+jQuery開發實戰
- Java虛擬機字節碼:從入門到實戰
- 執劍而舞:用代碼創作藝術
- Hadoop 2.X HDFS源碼剖析
- Practical Microservices
- Mastering Gephi Network Visualization
- Learning Grunt
- R語言數據挖掘:實用項目解析
- Python Linux系統管理與自動化運維
- Python滲透測試編程技術:方法與實踐(第2版)
- PHP典型模塊與項目實戰大全
- Spring MVC Blueprints
- Java無難事:詳解Java編程核心思想與技術
- Head First Go語言程序設計
- 天天學敏捷:Scrum團隊轉型記
- HTML5從入門到精通(第3版)
- Java程序設計項目教程(第二版)
- Machine Learning with TensorFlow 1.x
- 從Scratch進階到Python(基礎篇)
- Learning C# by Developing Games with Unity 5.x(Second Edition)
- 從零開始學Python數據分析與挖掘
- Python數據科學導論
- RESTful Java Patterns and Best Practices
- Android Application Development with Maven
- Mastering Machine Learning with scikit-learn(Second Edition)
- PowerShell for SQL Server Essentials