官术网_书友最值得收藏!

Features

When we are talking about features in the context of ML , what we mean is some characteristic property of the object or phenomenon we are investigating.

Other names for the same concept you'll see in some publications are explanatory variable, independent variable, and predictor.

Features are used to distinguish objects from each other and to measure the similarity between them.

For instance:

  • If the objects of our interest are books, features could be a title, page count, author's name, a year of publication, genre, and so on
  • If the objects of interest are images, features could be intensities of each pixel
  • If the objects are blog posts, features could be language, length, or presence of some terms
It's useful to imagine your data as a spreadsheet table. In this case, each sample (data point) would be a row, and each feature would be a column. For example, Table 1.1 shows a tiny dataset of books consisting of four samples where each has eight features.

Table 1.1: an example of a ML dataset (dummy books):

主站蜘蛛池模板: 山东省| 双流县| 淮滨县| 新建县| 健康| 乡宁县| 新安县| 通渭县| 大安市| 孟津县| 泸州市| 团风县| 涟水县| 兰西县| 营山县| 温宿县| 吐鲁番市| 藁城市| 廊坊市| 库伦旗| 宁晋县| 台东县| 泊头市| 绥德县| 定陶县| 瑞昌市| 斗六市| 钟山县| 鹤庆县| 麻江县| 灌云县| 信阳市| 丽江市| 汾西县| 廉江市| 宣汉县| 宜章县| 拉萨市| 融水| 砀山县| 鄱阳县|