- Natural Language Processing with Python Quick Start Guide
- Nirant Kasliwal
- 129字
- 2021-06-10 18:36:38
Bread and butter – most common tasks
There are several well-known text cleaning ideas. They have all made their way into the most popular tools today such as NLTK, Stanford CoreNLP, and spaCy. I like spaCy for two main reasons:
- It's an industry-grade NLP, unlike NLTK, which is mainly meant for teaching.
- It has good speed-to-performance trade-off. spaCy is written in Cython, which gives it C-like performance with Python code.
spaCy is actively maintained and developed, and incorporates the best methods available for most challenges.
By the end of this section, you will be able to do the following:
- Understand tokenization and do it manually yourself using spaCy
- Understand why stop word removal and case standardization works, with spaCy examples
- Differentiate between stemming and lemmatization, with spaCy lemmatization examples
推薦閱讀
- Intel Galileo Essentials
- Spring 5.0 By Example
- Python數據分析入門與實戰
- 算法訓練營:入門篇(全彩版)
- 人臉識別原理及算法:動態人臉識別系統研究
- The DevOps 2.4 Toolkit
- C#程序設計
- Mastering Linux Network Administration
- Flutter跨平臺開發入門與實戰
- Terraform:多云、混合云環境下實現基礎設施即代碼(第2版)
- 智能搜索和推薦系統:原理、算法與應用
- Illustrator CC平面設計實戰從入門到精通(視頻自學全彩版)
- 大學計算機基礎
- NGUI for Unity
- Python高性能編程(第2版)