官术网_书友最值得收藏!

Introduction

In the previous chapter, you learned about various extraction methods, such as tokenization, stemming, lemmatization, and stop-word removal, which are used to extract features from unstructured text. We also discussed Bag-of-Words and Term Frequency-Inverse Document Frequency (TF-IDF).

In this chapter, you will learn how to use these extracted features to develop machine learning models. These models are capable of solving real-world problems such as detecting whether sentiments carried by texts are positive or negative, predicting whether emails are spam or not, and so on. We will also cover concepts such as supervised and unsupervised learning, classifications and regressions, the sampling and splitting of data, along with evaluating the performance of a model in depth. This chapter also discusses how to load and save these models for future use.

主站蜘蛛池模板: 麻城市| 沅江市| 永城市| 贡山| 乡城县| 报价| 威远县| 渝中区| 松江区| 龙门县| 清丰县| 金湖县| 澎湖县| 永安市| 青龙| 嘉善县| 房产| 资溪县| 定兴县| 丰顺县| 遂昌县| 三都| 潼南县| 广宗县| 宜黄县| 家居| 山阴县| 安阳市| 吉安市| 大同市| 卫辉市| 聊城市| 浮梁县| 潼关县| 宜章县| 莆田市| 邵武市| 衡阳市| 双流县| 昔阳县| 买车|