官术网_书友最值得收藏!

Summary

In this chapter, you got a feel for the broader things we need to make the project work. We saw the steps that are involved in this process by using a text classification example. We saw how to prepare text for machine learning with scikit-learn. We saw Logistic Regression for ML. We also saw a confusion matrix, which is a quick and powerful tool for making sense of results in all machine learning, beyond NLP.

We are just getting started. From here on out, we will dive deeper into each of these steps and see what other methods exist out there. In the next chapter, we will look at some common methods for text cleaning and extraction. Since this is what we will spend up to 80% of our total time on, it's worth the time and energy learning it. 

主站蜘蛛池模板: 龙海市| 讷河市| 巴楚县| 漾濞| 怀化市| 乳源| 建水县| 德清县| 江达县| 白河县| 河北区| 内黄县| 临漳县| 潢川县| 大港区| 郯城县| 阿勒泰市| 江津市| 河西区| 凤庆县| 南江县| 东安县| 克山县| 通榆县| 天津市| 聊城市| 汾西县| 高唐县| 临澧县| 合肥市| 栖霞市| 荣成市| 桃源县| 兴宁市| 韶关市| 松滋市| 洛阳市| 三都| 汨罗市| 抚顺县| 图们市|