官术网_书友最值得收藏!

Summary

In this chapter, we looked at the various steps that are needed to build a natural language vocabulary. These play the most critical role in preprocessing any natural language data. Data preprocessing is probably one of the most important aspects of any machine learning application, and the same applies to NLP as well. When performed properly, these steps help with the machine learning aspects that generally occur after preprocessing the data, consequently providing better results most of the time compared with scenarios where no preprocessing is involved.

In the next chapter, we will use the techniques discussed in this chapter to preprocess data and subsequently build mathematical representations of text that can be understood by machine learning algorithms.

主站蜘蛛池模板: 讷河市| 五指山市| 喀什市| 晋州市| 陵川县| 泊头市| 彰化县| 百色市| 台山市| 兰考县| 正镶白旗| 两当县| 类乌齐县| 临澧县| 金堂县| 肥乡县| 明溪县| 水富县| 肇东市| 巴东县| 凤翔县| 高雄市| 抚州市| 岑溪市| 天祝| 溧阳市| 金阳县| 平凉市| 南涧| 松滋市| 商城县| 汝南县| 蕲春县| 绥芬河市| 嘉祥县| 仁化县| 鸡东县| 布尔津县| 若羌县| 永靖县| 阿瓦提县|