- Mastering Machine Learning on AWS
- Dr. Saket S.R. Mengle Maximo Gurmendez
- 207字
- 2021-06-24 14:23:16
Summary
In this chapter, we explained why ML is a crucial tool in a data scientist's repository. We discussed what a structured ML dataset looks like and how to identify the types of features in the dataset.
We took a deep dive into the Naive Bayes classification algorithm, and studied how Bayes' theorem is used in the Naive Bayes algorithm. We learned that, using Bayes' theorem, we can predict the probability of an event occurring based on the values of each feature, and select the event that has the highest probability.
We also presented an example of a Twitter dataset. We hope that you learned how to think about a text classification problem, and how to build a Naive Bayes classification model to predict the source of a tweet. We also presented how the algorithm can be implemented in SageMaker, and how it can also be implemented using Apache Spark. This code base should help you tackle any text classification problems in the future. As the implementation is presented using SageMaker services and Spark, it can scale to datasets that can be gigabytes or terabytes in size.
We will look at how to deploy the ML models on actual production clusters in later chapters.
- 筆記本電腦使用、維護與故障排除實戰(zhàn)
- Learning SQL Server Reporting Services 2012
- 電腦維護與故障排除傻瓜書(Windows 10適用)
- Mastering Delphi Programming:A Complete Reference Guide
- 精選單片機設(shè)計與制作30例(第2版)
- Unity 5.x Game Development Blueprints
- 數(shù)字邏輯(第3版)
- 筆記本電腦維修不是事兒(第2版)
- 嵌入式系統(tǒng)中的模擬電路設(shè)計
- 微服務(wù)分布式架構(gòu)基礎(chǔ)與實戰(zhàn):基于Spring Boot + Spring Cloud
- VMware Workstation:No Experience Necessary
- 單片機開發(fā)與典型工程項目實例詳解
- Hands-On Motion Graphics with Adobe After Effects CC
- Arduino項目開發(fā):智能生活
- 嵌入式系統(tǒng)原理及應(yīng)用:基于ARM Cortex-M4體系結(jié)構(gòu)