官术网_书友最值得收藏!

What this book covers

Chapter 1, Tokenizing Text and WordNet Basics, covers the basics of tokenizing text and using WordNet.

Chapter 2, Replacing and Correcting Words, discusses various word replacement and correction techniques. The recipes cover the gamut of linguistic compression, spelling correction, and text normalization.

Chapter 3, Text Classification, describes a way to categorize documents or pieces of text and, by examining the word usage in a piece of text, classifiers decide what class label should be assigned to it.

主站蜘蛛池模板: 古田县| 申扎县| 兴隆县| 株洲市| 陈巴尔虎旗| 哈尔滨市| 周至县| 繁昌县| 津南区| 新密市| 保德县| 横峰县| 会宁县| 龙山县| 灵丘县| 孟津县| 凯里市| 承德县| 博野县| 榆林市| 天台县| 嘉禾县| 潮州市| 北流市| 云龙县| 思南县| 广饶县| 兴山县| 上高县| 沾化县| 兖州市| 磐石市| 饶河县| 上杭县| 四会市| 高青县| 鹿泉市| 慈利县| 凉城县| 宁晋县| 贵定县|