目錄(103章)
倒序
- coverpage
- Mastering Python for Data Science
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Getting Started with Raw Data
- The world of arrays with NumPy
- Empowering data analysis with pandas
- Data cleansing
- Data operations
- Summary
- Chapter 2. Inferential Statistics
- Various forms of distribution
- A z-score
- A p-value
- One-tailed and two-tailed tests
- Type 1 and Type 2 errors
- A confidence interval
- Correlation
- Z-test vs T-test
- The F distribution
- The chi-square distribution
- The chi-square test of independence
- ANOVA
- Summary
- Chapter 3. Finding a Needle in a Haystack
- What is data mining?
- Presenting an analysis
- Studying the Titanic
- Summary
- Chapter 4. Making Sense of Data through Advanced Visualization
- Controlling the line properties of a chart
- Creating multiple plots
- Playing with text
- Styling your plots
- Box plots
- Heatmaps
- Scatter plots with histograms
- A scatter plot matrix
- Area plots
- Bubble charts
- Hexagon bin plots
- Trellis plots
- A 3D plot of a surface
- Summary
- Chapter 5. Uncovering Machine Learning
- Different types of machine learning
- Decision trees
- Linear regression
- Logistic regression
- The naive Bayes classifier
- The k-means clustering
- Hierarchical clustering
- Summary
- Chapter 6. Performing Predictions with a Linear Regression
- Simple linear regression
- Multiple regression
- Training and testing a model
- Summary
- Chapter 7. Estimating the Likelihood of Events
- Logistic regression
- Summary
- Chapter 8. Generating Recommendations with Collaborative Filtering
- Recommendation data
- User-based collaborative filtering
- Item-based collaborative filtering
- Summary
- Chapter 9. Pushing Boundaries with Ensemble Models
- The census income dataset
- Decision trees
- Random forests
- Summary
- Chapter 10. Applying Segmentation with k-means Clustering
- The k-means algorithm and its working
- The k-means clustering with countries
- Clustering the countries
- Summary
- Chapter 11. Analyzing Unstructured Data with Text Mining
- Preprocessing data
- Creating a wordcloud
- Word and sentence tokenization
- Parts of speech tagging
- Stemming and lemmatization
- The Stanford Named Entity Recognizer
- Performing sentiment analysis on world leaders using Twitter
- Summary
- Chapter 12. Leveraging Python in the World of Big Data
- What is Hadoop?
- Python MapReduce
- File handling with Hadoopy
- Pig
- Python with Apache Spark
- Summary
- Index 更新時間:2021-07-16 20:14:41
推薦閱讀
- UNIX編程藝術
- Reporting with Visual Studio and Crystal Reports
- Spring 5企業級開發實戰
- 程序員面試筆試寶典
- Learning RabbitMQ
- 深入淺出PostgreSQL
- Selenium Testing Tools Cookbook(Second Edition)
- R數據科學實戰:工具詳解與案例分析
- Python計算機視覺和自然語言處理
- Java Web開發基礎與案例教程
- Java程序設計及應用開發
- Tkinter GUI Programming by Example
- JavaScript程序設計基礎教程(慕課版)
- 區塊鏈原理、設計與應用
- Performance Testing with JMeter 3(Third Edition)
- CorelDRAW X6中文版應用教程(第二版)
- PostGIS Cookbook
- Mahout實踐指南
- Node.js入門指南
- HTML5與CSS3權威指南(第2版·下冊)
- Java核心技術卷I基礎知識(原書第9版)
- The Ruby Workshop
- R語言入門與實踐
- 客戶驅動的產品開發
- Learning Object-Oriented Programming
- Team Foundation Server 2015 Customization
- Learning System Center App Controller
- Learn Kotlin Programming(Second Edition)
- Microsoft Dynamics CRM 2011 Reporting
- 從零開始學C程序設計