官术网_书友最值得收藏!

Summary

In this chapter, you got a feel for the broader things we need to make the project work. We saw the steps that are involved in this process by using a text classification example. We saw how to prepare text for machine learning with scikit-learn. We saw Logistic Regression for ML. We also saw a confusion matrix, which is a quick and powerful tool for making sense of results in all machine learning, beyond NLP.

We are just getting started. From here on out, we will dive deeper into each of these steps and see what other methods exist out there. In the next chapter, we will look at some common methods for text cleaning and extraction. Since this is what we will spend up to 80% of our total time on, it's worth the time and energy learning it. 

主站蜘蛛池模板: 瓮安县| 白玉县| 青铜峡市| 辉县市| 芜湖市| 泸定县| 什邡市| 沾化县| 崇礼县| 乡宁县| 贵港市| 长阳| 高碑店市| 巨鹿县| 曲麻莱县| 彭阳县| 阜平县| 外汇| 漯河市| 句容市| 墨脱县| 祁门县| 萨嘎县| 偏关县| 甘孜| 桐梓县| 渝中区| 竹山县| 宣武区| 冷水江市| 遂平县| 龙岩市| 封丘县| 清远市| 石首市| 安达市| 曲阜市| 芮城县| 宜黄县| 天峨县| 朝阳市|