官术网_书友最值得收藏!

Selecting the best N-grams

The number of different N-grams grows exponentially in N. Even for a fixed tiny N, such as N=3, there are 256x256x256=16,777,216 possible N-grams. This means that the number of N-grams features is impracticably large. Consequently, we must select a smaller subset of N-grams that will be of most value to our classifiers. In this section, we show three different methods for selecting the topmost informative N-grams.

主站蜘蛛池模板: 江北区| 巴彦淖尔市| 鄂托克前旗| 静宁县| 称多县| 庄浪县| 光泽县| 敖汉旗| 方山县| 南岸区| 卢龙县| 台南市| 百色市| 灯塔市| 河源市| 武邑县| 孝昌县| 鄂伦春自治旗| 常山县| 同仁县| 南召县| 界首市| 韶山市| 英吉沙县| 旺苍县| 河北区| 陈巴尔虎旗| 沭阳县| 扎鲁特旗| 平和县| 获嘉县| 同心县| 咸阳市| 桃园市| 台北市| 会东县| 冀州市| 元阳县| 蓬溪县| 武夷山市| 化隆|