- Natural Language Processing Fundamentals
- Sohom Ghosh Dwight Gunning
- 133字
- 2021-06-11 13:42:29
Introduction
In the previous chapter, we learned about the concepts of Natural Language Processing (NLP) and text analytics. We also looked at various pre-processing steps in brief. In this chapter, we will learn how to deal with text data whose formats are mostly unstructured. Unstructured data cannot be represented in a tabular format. Therefore, it is essential to convert it into numeric features because most machine learning algorithms are capable of dealing only with numbers. More emphasis will be put on steps such as tokenization, stemming, lemmatization, and stop-word removal. You will also learn about two popular methods for feature extraction: bag of words and Term Frequency-Inverse Document Frequency, as well as various methods for creating new features from existing features. Finally, you will become familiar with how text data can be visualized.
- Deep Learning Quick Reference
- 21天學通Visual C++
- Moodle Course Design Best Practices
- INSTANT Drools Starter
- 工業控制系統測試與評價技術
- 面向對象程序設計綜合實踐
- 在實戰中成長:Windows Forms開發之路
- 多媒體制作與應用
- Applied Data Visualization with R and ggplot2
- Hands-On Data Warehousing with Azure Data Factory
- 格蠹匯編
- LMMS:A Complete Guide to Dance Music Production Beginner's Guide
- Machine Learning Algorithms(Second Edition)
- 漢字錄入技能訓練
- 計算機組裝與維修實訓