- Natural Language Processing with Python Quick Start Guide
- Nirant Kasliwal
- 77字
- 2021-06-10 18:36:36
Pre-processing
Simple changes in data pre-processing or the data cleaning stage can quite often give you dramatically better results. For instance, making sure that your entire corpus is in lowercase can help you reduce the number of unique words (your vocabulary size) by a significant fraction.
If your numeric representation of words is skewed by the word frequency, sometimes it helps to normalize and/or scale the same. The laziest hack is to simply divide by the frequency.
推薦閱讀
- Dynamics 365 Application Development
- 程序員考試案例梳理、真題透解與強化訓練
- VSTO開發(fā)入門教程
- 精通API架構(gòu):設(shè)計、運維與演進
- Mastering Kali Linux for Web Penetration Testing
- Mastering JavaScript High Performance
- Mastering Android Game Development
- Android玩家必備
- Learning PHP 7
- 愛上micro:bit
- HTML5權(quán)威指南
- ArcGIS for Desktop Cookbook
- Android嵌入式系統(tǒng)程序開發(fā):基于Cortex-A8(第2版)
- Python無監(jiān)督學習
- C Primer Plus(第6版)中文版【最新修訂版】