官术网_书友最值得收藏!

Chapter 2. Integrity and Inspection

This chapter will cover the following recipes:

  • Trimming excess whitespace
  • Ignoring punctuation and specific characters
  • Coping with unexpected or missing input
  • Validating records by matching regular expressions
  • Lexing and parsing an e-mail address
  • Deduplication of nonconflicting data items
  • Deduplication of conflicting data items
  • Implementing a frequency table using Data.List
  • Implementing a frequency table using Data.MultiSet
  • Computing the Manhattan distance
  • Computing the Euclidean distance
  • Comparing scaled data using the Pearson correlation coefficient
  • Comparing sparse data using cosine similarity
主站蜘蛛池模板: 邮箱| 岗巴县| 班戈县| 射洪县| 正阳县| 平度市| 怀来县| 遵化市| 驻马店市| 扶余县| 贺兰县| 五常市| 普格县| 壶关县| 铜鼓县| 贡嘎县| 宜兴市| 龙川县| 江安县| 蒙自县| 乌审旗| 孟州市| 北流市| 微博| 久治县| 专栏| 旬阳县| 库车县| 神木县| 丰都县| 安顺市| 建宁县| 蓝山县| 安庆市| 东丰县| 石狮市| 高清| 平顶山市| 怀宁县| 福海县| 奉化市|