- Hands-On Python Natural Language Processing
- Aman Kedia Mayank Rasu
- 142字
- 2021-06-18 18:28:56
Phonemes, graphemes, and morphemes
Before we start looking at the steps for building vocabulary, we need to understand phonemes, graphemes, and morphemes:
- Phonemes can be thought of as the speech sounds, made by the mouth or unit of sound, that can differentiate one word from another in a language.
- Graphemes are groups of letters of size one or more that can represent these individual sounds or phonemes. The word spoon consists of five letters that actually represent four phonemes, identified by the graphemes s, p, oo, and n.
- A morpheme is the smallest meaningful unit in a language. The word unbreakable is composed of three morphemes:
- un—a bound morpheme signifying not
- break—the root morpheme
- able—a free morpheme signifying can be done
Now, let's delve into some practical aspects that form the base of every NLP-based system.
推薦閱讀
- 21天學(xué)通PHP
- 工業(yè)機(jī)器人產(chǎn)品應(yīng)用實(shí)戰(zhàn)
- 基于LabWindows/CVI的虛擬儀器設(shè)計(jì)與應(yīng)用
- 走入IBM小型機(jī)世界
- 數(shù)據(jù)運(yùn)營之路:掘金數(shù)據(jù)化時(shí)代
- 自動(dòng)檢測(cè)與轉(zhuǎn)換技術(shù)
- 精通Excel VBA
- Hands-On Cybersecurity with Blockchain
- AWS Administration Cookbook
- 中國戰(zhàn)略性新興產(chǎn)業(yè)研究與發(fā)展·工業(yè)機(jī)器人
- 過程控制系統(tǒng)
- 網(wǎng)絡(luò)服務(wù)器搭建與管理
- 單片機(jī)技術(shù)項(xiàng)目化原理與實(shí)訓(xùn)
- MATLAB-Simulink系統(tǒng)仿真超級(jí)學(xué)習(xí)手冊(cè)
- 機(jī)床電氣控制與PLC