- Natural Language Processing with Python Quick Start Guide
- Nirant Kasliwal
- 77字
- 2021-06-10 18:36:36
Pre-processing
Simple changes in data pre-processing or the data cleaning stage can quite often give you dramatically better results. For instance, making sure that your entire corpus is in lowercase can help you reduce the number of unique words (your vocabulary size) by a significant fraction.
If your numeric representation of words is skewed by the word frequency, sometimes it helps to normalize and/or scale the same. The laziest hack is to simply divide by the frequency.
推薦閱讀
- Objective-C Memory Management Essentials
- AIRAndroid應用開發實戰
- 3D少兒游戲編程(原書第2版)
- Linux命令行與shell腳本編程大全(第4版)
- AppInventor實踐教程:Android智能應用開發前傳
- Java程序設計入門
- Create React App 2 Quick Start Guide
- 好好學Java:從零基礎到項目實戰
- Android嵌入式系統程序開發:基于Cortex-A8(第2版)
- Mastering Adobe Captivate 7
- Secret Recipes of the Python Ninja
- Flink入門與實戰
- 微信小程序開發邊做邊學(微課視頻版)
- C語言從入門到精通(微視頻精編版)
- C/C++語言程序開發參考手冊