官术网_书友最值得收藏!

Example – text classification workflow

The preceding process is fairly generic. What would it look like for one of the most common natural language applications text classification?

The following flow diagram was built by Microsoft Azure, and is used here to explain how their own technology fits directly into our workflow template. There are several new words that they have introduced to feature engineering, such as unigrams, TF-IDF, TF, n-grams, and so on:

The main steps in their flow diagram are as follows:

  1. Step 1: Data preparation
  2. Step 2: Text pre-processing
  3. Step 3: Feature engineering:
    • Unigrams TF-IDF extraction
    • N-grams TF extraction
  4. Step 4: Train and evaluate models
  5. Step 5: Deploy trained models as web services

This means that it's time to stop talking and start programming. Let's quickly set up the environment first and then we will work on building our first text classification system in 30 lines of code or less.

主站蜘蛛池模板: 鲁甸县| 靖远县| 上虞市| 长武县| 吉木萨尔县| 蒙城县| 龙里县| 海林市| 腾冲县| 依安县| 革吉县| 托克托县| 达孜县| 柘城县| 唐海县| 凉城县| 嵩明县| 德令哈市| 灯塔市| 淮阳县| 阿克陶县| 延寿县| 颍上县| 海南省| 思南县| 平利县| 土默特左旗| 乌海市| 三江| 呼伦贝尔市| 皋兰县| 蒲江县| 信宜市| 怀柔区| 万源市| 绥德县| 玛多县| 绵阳市| 视频| 张掖市| 闽清县|